Tagged articles
37 articles
Page 1 of 1
AI Algorithm Path
AI Algorithm Path
Feb 19, 2026 · Artificial Intelligence

A Practical Guide to Industrial Defect Detection with Pre‑trained Neural Networks

The article explains how manufacturers can shift from defect‑specific vision models to anomaly detection by leveraging pre‑trained object‑detection networks, visualising feature maps, and applying memory‑bank methods such as PaDiM and PatchCore, with the open‑source Anomalib library as a ready‑to‑use solution.

AnomalibPaDiMPatchCore
0 likes · 7 min read
A Practical Guide to Industrial Defect Detection with Pre‑trained Neural Networks
Data Party THU
Data Party THU
Oct 3, 2025 · Artificial Intelligence

Unlocking Pre‑trained Graph Models: The UniPrompt Approach to Graph Prompt Learning

This article analyzes the current limitations of Graph Prompt Learning, reveals that representation‑level prompts are essentially equivalent to fine‑tuning a downstream classifier, and introduces UniPrompt—a method that leverages pre‑trained graph models while preserving input graph structure for superior in‑domain and cross‑domain performance.

Representation PromptUniPromptcross-domain
0 likes · 5 min read
Unlocking Pre‑trained Graph Models: The UniPrompt Approach to Graph Prompt Learning
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 8, 2024 · Artificial Intelligence

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

The article introduces PreFLMR, an open‑source, general‑purpose pre‑trained multimodal retriever that leverages fine‑grained late‑interaction to boost retrieval‑augmented generation for knowledge‑intensive visual tasks, describes its M2KR benchmark, training stages, and strong experimental results across multiple tasks.

AIFLMRKnowledge Retrieval
0 likes · 11 min read
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers
AntTech
AntTech
Dec 13, 2023 · Artificial Intelligence

IEEE ICDM 2023 Graph Learning Challenge: Community Detection and Fraud Group Mining

The IEEE ICDM 2023 Graph Learning Challenge, co‑hosted by Ant Group and Zhejiang University, showcased deep graph learning approaches for community detection and fraud‑group mining, highlighting the winning team's Risk‑DCRN method and emphasizing the importance of pretrained models in large‑scale network analysis.

ICDMcommunity-detectionfraud detection
0 likes · 5 min read
IEEE ICDM 2023 Graph Learning Challenge: Community Detection and Fraud Group Mining
DataFunSummit
DataFunSummit
Jun 24, 2023 · Artificial Intelligence

From Model to Service: Alibaba Cloud Machine Learning PAI One‑Stop Model Development and Deployment Practice

This article presents an end‑to‑end overview of Alibaba Cloud’s Machine Learning PAI platform, detailing the three‑stage ML workflow, challenges in model development, the role of pre‑trained and open‑source models, PAI’s architecture, a hands‑on demo, and MLOps best practices for efficient model deployment.

Alibaba CloudMLOpsModel Deployment
0 likes · 11 min read
From Model to Service: Alibaba Cloud Machine Learning PAI One‑Stop Model Development and Deployment Practice
DataFunTalk
DataFunTalk
Jun 17, 2023 · Artificial Intelligence

Research on Text Generation for Structured Data

This article reviews the rapidly evolving field of structured‑data text generation, covering AI development stages, core concepts, model architectures from pipeline to pretrained transformers, key challenges such as content selection, numeric representation, reasoning and style control, and outlines recent research directions and Q&A insights.

AIStructured DataText Generation
0 likes · 21 min read
Research on Text Generation for Structured Data
DataFunTalk
DataFunTalk
Jun 10, 2023 · Artificial Intelligence

Financial Event Analysis and Applications Based on Pre-trained Models

This article introduces the tasks, techniques, and frameworks for financial event analysis using pre‑trained language models, covering unstructured data parsing, event semantics, graph construction, detection, extraction, and prediction, and presents the TDE‑GTEE model that achieves state‑of‑the‑art performance even in few‑shot scenarios.

AIEvent ExtractionFew‑Shot Learning
0 likes · 18 min read
Financial Event Analysis and Applications Based on Pre-trained Models
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Jun 2, 2023 · Artificial Intelligence

AI Techniques for a Global Search Platform: Word Segmentation, Text Similarity, Image Retrieval, and Multimodal Models

This article shares the development of a global search platform that leverages AI technologies such as Chinese word segmentation, part‑of‑speech tagging, text similarity via Simhash and Synonyms, image similarity using histogram, Hamming distance and ResNet‑50, and multimodal CLIP‑based models to improve search efficiency and accuracy.

AINLPimage retrieval
0 likes · 12 min read
AI Techniques for a Global Search Platform: Word Segmentation, Text Similarity, Image Retrieval, and Multimodal Models
DataFunTalk
DataFunTalk
May 18, 2023 · Artificial Intelligence

Query Intent Recognition in Enterprise Search: Knowledge‑Enhanced and Pretrained Model Approaches

This article explains how Alibaba's enterprise search system tackles query intent recognition by combining knowledge‑enhanced techniques, short‑text classification, and pretrained language models such as StructBERT and prompt‑learning, and it shares two real‑world case studies, experimental results, and future research directions.

Enterprise searchKnowledge EnhancementNLP
0 likes · 19 min read
Query Intent Recognition in Enterprise Search: Knowledge‑Enhanced and Pretrained Model Approaches
Sohu Tech Products
Sohu Tech Products
Mar 22, 2023 · Artificial Intelligence

An Overview of Prompt Learning in Natural Language Processing

This article reviews the evolution of NLP training paradigms, explains why prompt learning is needed, defines its core concepts, and surveys major hard‑template and soft‑template methods such as PET, LM‑BFF, P‑tuning, and Prefix‑tuning, highlighting their advantages for few‑shot and zero‑shot scenarios.

Few-ShotNLPPrompt Tuning
0 likes · 10 min read
An Overview of Prompt Learning in Natural Language Processing
Baidu Geek Talk
Baidu Geek Talk
Feb 17, 2023 · Artificial Intelligence

How PGLBox Achieves 27× Faster GPU‑Powered Large‑Scale Graph Learning

PGLBox, Baidu’s GPU‑based large‑scale graph training framework, delivers up to 27× speedup over CPU clusters by fully GPU‑accelerating storage, sampling, and training, supporting billions of nodes, advanced GNN algorithms, multi‑level storage, and seamless integration of massive pretrained models.

GPULarge-Scale TrainingPGLBox
0 likes · 7 min read
How PGLBox Achieves 27× Faster GPU‑Powered Large‑Scale Graph Learning
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 16, 2023 · Artificial Intelligence

Intelligent Creative Generation and Optimization for Xiaohongshu Advertising

Xiaohongshu’s end‑to‑end intelligent creative platform extracts high‑quality images, generates diverse titles with RED‑pretrained GPT‑2/T5 models, and selects the best ads using a UCB‑based multi‑armed bandit that balances CTR uplift, revenue and user‑experience, while employing position‑corrected metrics and a scalable dual‑tower DNN to boost long‑tail performance and overall revenue.

AIAdvertisingNLP
0 likes · 18 min read
Intelligent Creative Generation and Optimization for Xiaohongshu Advertising
DataFunTalk
DataFunTalk
Jan 28, 2023 · Artificial Intelligence

Industry Search: Background, Technologies, and Real‑World Applications

This article presents a comprehensive overview of industry search, covering its background, core retrieval and ranking technologies—including sparse and dense retrieval, pre‑trained language models, tokenization, NER, adaptive multi‑task training, and re‑ranking models—followed by detailed case studies such as address analysis, family‑ID unification, emergency call handling, education photo‑search, and power‑knowledge‑base integration.

NLPaddress analysisindustry search
0 likes · 13 min read
Industry Search: Background, Technologies, and Real‑World Applications
Baidu Geek Talk
Baidu Geek Talk
Nov 16, 2022 · Artificial Intelligence

How Baidu’s Ernie‑SimCSE Uses Contrastive Learning to Crush Spam Promotion

This article explains how Baidu's anti‑spam team tackled large‑scale promotional spam on Baidu Zhidao by combining the Ernie pretrained model with SimCSE contrastive learning, detailing the problem background, traditional methods, text‑representation stages, the SimCSE approach, training pipeline, optimizations, and experimental results.

ErnieNLPSimCSE
0 likes · 15 min read
How Baidu’s Ernie‑SimCSE Uses Contrastive Learning to Crush Spam Promotion
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 18, 2022 · Artificial Intelligence

Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch

This article walks readers through building, training, and evaluating a Vision Transformer (ViT) model for a five‑class flower classification task, providing detailed code snippets, model architecture explanations, training script adjustments, and experimental results that highlight the importance of pre‑trained weights.

Deep LearningImage ClassificationPyTorch
0 likes · 13 min read
Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch
ELab Team
ELab Team
Sep 23, 2022 · Artificial Intelligence

Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes

This tutorial walks you through NLP fundamentals, the evolution of BERT, the concept of pre‑trained models, and a step‑by‑step guide to fine‑tune a Chinese BERT on a cloze‑style task, complete with code snippets and verification results.

BERTChineseCloze Task
0 likes · 13 min read
Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes
DataFunTalk
DataFunTalk
Aug 28, 2022 · Artificial Intelligence

Emerging Paths Toward General AI: Trends in Large‑Scale Pretrained Models

The article reviews how the Transformer breakthrough, the rapid scaling of large language models such as GPT‑3, Switch Transformer, and Alibaba's AliceMind and M6, together with multimodal research, are shaping the next phase of artificial intelligence toward more general, collaborative, and open AI systems.

AI trendsartificial intelligencelarge language models
0 likes · 5 min read
Emerging Paths Toward General AI: Trends in Large‑Scale Pretrained Models
AntTech
AntTech
Jul 7, 2022 · Artificial Intelligence

Ant Group Insurance Technology Wins First Place in Fine‑Grained Dialogue Social Bias Detection at NLPCC 2023

Ant Group's insurance technology team secured the top spot in the fine‑grained dialogue social bias detection task at the 11th CCF NLPCC conference, showcasing their AI‑driven bias‑mitigation methods, a proprietary pre‑trained model AntInsBert, and a claim‑automation system that boosts insurance service fairness and efficiency.

AntInsBertBias DetectionInsurance AI
0 likes · 3 min read
Ant Group Insurance Technology Wins First Place in Fine‑Grained Dialogue Social Bias Detection at NLPCC 2023
DataFunTalk
DataFunTalk
May 22, 2022 · Artificial Intelligence

Advances in Information‑Flow Recommendation: Pre‑trained Models and Multimodal User‑Interface Modeling

This article reviews Huawei Noah's Ark Lab's work on modern information‑flow recommendation, covering the evolution from collaborative filtering to deep learning, the application of BERT‑based pre‑training for news ranking, multimodal user‑interface modeling, practical deployment challenges, and future research directions.

AIBERTHuawei
0 likes · 19 min read
Advances in Information‑Flow Recommendation: Pre‑trained Models and Multimodal User‑Interface Modeling
DataFunTalk
DataFunTalk
Mar 17, 2022 · Artificial Intelligence

A Survey of Text Classification and Intent Recognition: Industrial and Research Perspectives

This article reviews recent developments in text classification and intent recognition, comparing industrial practices such as business‑coupled feature engineering with research trends like pretrained language models, and provides references and practical insights for building effective NLP solutions.

NLPindustry applicationsintent recognition
0 likes · 13 min read
A Survey of Text Classification and Intent Recognition: Industrial and Research Perspectives
ITPUB
ITPUB
Mar 2, 2022 · Artificial Intelligence

Leveraging Giant AI Models for Startup Success: Opportunities and Pitfalls

This article examines how startups can harness massive pre‑trained AI models such as GPT‑3, outlining the historical context, benefits of transfer learning, the steep costs and data‑alignment challenges, and strategic considerations when using cloud APIs versus self‑hosting.

AICloud APIsmachine learning
0 likes · 14 min read
Leveraging Giant AI Models for Startup Success: Opportunities and Pitfalls
DataFunSummit
DataFunSummit
Feb 28, 2022 · Artificial Intelligence

UGC Sentiment Analysis Solutions and Applications in Taobao

This article presents a comprehensive overview of Taobao's user‑generated content (UGC) sentiment analysis pipeline, covering background, task definition, challenges, model architecture—including RoBERTa‑based extraction, sentiment‑knowledge pre‑training, and graph augmentation—personalized impression ranking, business impact cases, and future research directions.

Sentiment AnalysisUGCaspect extraction
0 likes · 16 min read
UGC Sentiment Analysis Solutions and Applications in Taobao
DataFunTalk
DataFunTalk
Feb 2, 2022 · Artificial Intelligence

UGC Sentiment Analysis Solutions and Applications in Taobao

This article presents a comprehensive overview of Taobao's user‑generated content sentiment analysis pipeline, covering task definition, challenges, model architecture with RoBERTa‑based extraction, sentiment‑knowledge pre‑training, graph augmentation, personalized ranking, business impact metrics, and future research directions.

Deep LearningKnowledge GraphSentiment Analysis
0 likes · 16 min read
UGC Sentiment Analysis Solutions and Applications in Taobao
DataFunSummit
DataFunSummit
Jan 13, 2022 · Artificial Intelligence

DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation

DeltaLM is a multilingual pretrained encoder‑decoder model that leverages cross‑lingual transfer from a pretrained encoder and novel decoder architecture, employs span‑corruption and translation‑pair pretraining tasks, and uses a two‑stage fine‑tuning strategy to achieve strong zero‑shot and supervised translation performance across over 100 languages.

Cross-Lingual TransferDeltaLMNeural Machine Translation
0 likes · 12 min read
DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation
Baidu Geek Talk
Baidu Geek Talk
Nov 29, 2021 · Artificial Intelligence

Pretrained Models for First-Stage Information Retrieval: A Comprehensive Review

This comprehensive review by Dr. Fan Yixing surveys how pretrained language models have transformed first‑stage information retrieval, tracing the shift from traditional term‑based methods to neural sparse, dense, and hybrid approaches, and discussing key challenges such as hard‑negative mining, joint indexing‑representation learning, and generative‑discriminative training.

Hybrid RetrievalNeural IRSparse Retrieval
0 likes · 15 min read
Pretrained Models for First-Stage Information Retrieval: A Comprehensive Review
Meituan Technology Team
Meituan Technology Team
Oct 21, 2021 · Artificial Intelligence

Meituan's End-to-End Sentiment Analysis Technology and the ASAP Dataset

Meituan’s NLP Center introduced the ASAP dataset—the largest real‑world Chinese attribute‑level sentiment corpus—to date, and the article traces the progression from document‑level regression models upgraded with MT‑BERT, through multi‑task attribute‑level ABSA and opinion‑triplet extraction, to scalable real‑time and batch services, while outlining future transfer‑learning and few‑shot research.

MeituanNLPSentiment Analysis
0 likes · 25 min read
Meituan's End-to-End Sentiment Analysis Technology and the ASAP Dataset
58 Tech
58 Tech
Aug 5, 2021 · Artificial Intelligence

Exploration and Practice of Text Representation Algorithms in the 58 Security Scenario

This article presents a comprehensive study of text representation techniques—from weighted word‑vector methods to supervised SimBert and unsupervised contrastive learning models—applied to large‑scale unstructured data in 58's information‑security workflows, evaluating their effectiveness for classification and content‑recall tasks.

BERTSimCSEcontrastive learning
0 likes · 11 min read
Exploration and Practice of Text Representation Algorithms in the 58 Security Scenario
DataFunTalk
DataFunTalk
Jul 1, 2021 · Artificial Intelligence

Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey

This article surveys the evolution of pre‑trained models, covering the origins of transfer and self‑supervised learning, the rise of transformer‑based PTMs such as BERT and GPT, efficient architecture designs, multimodal and multilingual extensions, theoretical analyses, and future research directions for scalable and robust AI systems.

AI researchefficient traininglarge language models
0 likes · 27 min read
Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
May 8, 2021 · Artificial Intelligence

How Huawei’s Pangu Pre‑trained Models Slash Development Costs and Boost Vision AI

In a detailed interview, Huawei Cloud experts explain how the ultra‑large Pangu CV and NLP models—trained on billions of parameters and terabytes of data—achieve top benchmark scores, simplify developer workflows, and deliver industry‑wide deployments that dramatically cut labeling effort and iteration time.

AI DevelopmentHuawei Cloudpretrained models
0 likes · 9 min read
How Huawei’s Pangu Pre‑trained Models Slash Development Costs and Boost Vision AI
JD Cloud Developers
JD Cloud Developers
Feb 5, 2021 · Artificial Intelligence

2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist

In an InfoQ interview, JD Technology senior algorithm scientist Wu Youzheng reviews the rapid advances of natural language processing in 2020—including GPT‑3, multimodal dialogue, knowledge‑enhanced pre‑training, and knowledge graphs—while outlining the most promising research directions and practical challenges for the coming year.

AI applicationsGPT-3Knowledge Graph
0 likes · 18 min read
2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist
AntTech
AntTech
Sep 27, 2020 · Artificial Intelligence

Question Directed Graph Attention Network for Numerical Reasoning over Text (QDGAT)

The paper introduces QDGAT, a question‑directed graph attention network that enhances numerical reasoning in reading comprehension by explicitly modeling relationships among numbers, entities, and questions, and demonstrates its effectiveness through extensive experiments on the DROP dataset.

DROPKnowledge GraphNLP
0 likes · 7 min read
Question Directed Graph Attention Network for Numerical Reasoning over Text (QDGAT)
DataFunTalk
DataFunTalk
Jul 7, 2020 · Artificial Intelligence

Optimizing Pretrained Language Model Inference: Lessons from the NLPCC Small Model Competition and Deployment at Xiaomi

This article shares the Xiaomi AI Lab NLP team's experience in the NLPCC lightweight language model competition, discusses efficiency challenges of large pretrained models like BERT, and details practical inference optimizations—including model distillation, batching, FP16 quantization, and FasterTransformer integration—that dramatically reduce latency and hardware costs in production.

AIBERTInference Optimization
0 likes · 15 min read
Optimizing Pretrained Language Model Inference: Lessons from the NLPCC Small Model Competition and Deployment at Xiaomi
DataFunTalk
DataFunTalk
May 6, 2020 · Artificial Intelligence

Application of Large-Scale Pretrained Models in Alibaba Machine Translation

This article reviews how large‑scale pretrained language models have reshaped NLP, outlines the challenges of applying them to machine translation, introduces the APT framework and the GRET architecture for better encoder‑decoder integration, and reports experimental gains and future research directions.

AIAPT frameworkGRET
0 likes · 10 min read
Application of Large-Scale Pretrained Models in Alibaba Machine Translation
DataFunTalk
DataFunTalk
Apr 16, 2020 · Artificial Intelligence

Comprehensive Survey of Pre-trained Models for Natural Language Processing

This article provides a detailed survey of pre‑trained models (PTMs) for natural language processing, classifying them into shallow embeddings and contextual encoders, discussing training paradigms such as knowledge integration and model compression, and offering guidance on transfer learning and future challenges.

knowledge integrationmodel compressionnatural language processing
0 likes · 25 min read
Comprehensive Survey of Pre-trained Models for Natural Language Processing
JD Retail Technology
JD Retail Technology
Aug 8, 2019 · Artificial Intelligence

From Word Representations to Sentiment Analysis – Talk by Dr. Feng Ao

On August 6, Dr. Feng Ao presented a comprehensive overview of the evolution of word representations and sentiment analysis, illustrating the shift from traditional linguistic features to modern pretrained models such as BERT and XLNet, and sharing practical convolutional experiments relevant to industry applications.

NLPSentiment Analysisartificial intelligence
0 likes · 4 min read
From Word Representations to Sentiment Analysis – Talk by Dr. Feng Ao