Tagged articles

Pretrained Models

38 articles · Page 1 of 1

Jun 19, 2026 · Artificial Intelligence

Common NLP Q&A: Key Concepts, Models, and Tools Explained

This article provides concise answers to frequent Natural Language Processing questions, covering the distinction between NLP and NLG, popular pretrained models, deep‑learning architectures, word‑vector techniques, named‑entity recognition, sentiment, semantic and syntax analysis, part‑of‑speech tagging, language models, core tasks, real‑world applications, challenges, future trends, interpretability, and essential tools and libraries.

Deep LearningNLPNamed Entity Recognition

0 likes · 14 min read

Common NLP Q&A: Key Concepts, Models, and Tools Explained

AI Algorithm Path

Feb 19, 2026 · Artificial Intelligence

A Practical Guide to Industrial Defect Detection with Pre‑trained Neural Networks

The article explains how manufacturers can shift from defect‑specific vision models to anomaly detection by leveraging pre‑trained object‑detection networks, visualising feature maps, and applying memory‑bank methods such as PaDiM and PatchCore, with the open‑source Anomalib library as a ready‑to‑use solution.

AnomalibAnomaly DetectionPaDiM

0 likes · 7 min read

A Practical Guide to Industrial Defect Detection with Pre‑trained Neural Networks

Data Party THU

Oct 3, 2025 · Artificial Intelligence

Unlocking Pre‑trained Graph Models: The UniPrompt Approach to Graph Prompt Learning

This article analyzes the current limitations of Graph Prompt Learning, reveals that representation‑level prompts are essentially equivalent to fine‑tuning a downstream classifier, and introduces UniPrompt—a method that leverages pre‑trained graph models while preserving input graph structure for superior in‑domain and cross‑domain performance.

Pretrained ModelsRepresentation PromptUniPrompt

0 likes · 5 min read

Unlocking Pre‑trained Graph Models: The UniPrompt Approach to Graph Prompt Learning

Alibaba Cloud Developer

Jul 24, 2025 · Artificial Intelligence

Unlocking AI Model Choices: From CNNs to Transformers and Fine‑Tuning Strategies

This comprehensive guide walks you through the evolution of AI model architectures—from CNNs and RNNs to Transformers and GANs—explaining their core concepts, typical use cases, and how to select, train, and fine‑tune pre‑trained models using practical code examples.

AIPretrained ModelsPython

0 likes · 25 min read

Unlocking AI Model Choices: From CNNs to Transformers and Fine‑Tuning Strategies

Rare Earth Juejin Tech Community

Apr 8, 2024 · Artificial Intelligence

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

The article introduces PreFLMR, an open‑source, general‑purpose pre‑trained multimodal retriever that leverages fine‑grained late‑interaction to boost retrieval‑augmented generation for knowledge‑intensive visual tasks, describes its M2KR benchmark, training stages, and strong experimental results across multiple tasks.

AIFLMRPretrained Models

0 likes · 11 min read

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

AntTech

Dec 13, 2023 · Artificial Intelligence

IEEE ICDM 2023 Graph Learning Challenge: Community Detection and Fraud Group Mining

The IEEE ICDM 2023 Graph Learning Challenge, co‑hosted by Ant Group and Zhejiang University, showcased deep graph learning approaches for community detection and fraud‑group mining, highlighting the winning team's Risk‑DCRN method and emphasizing the importance of pretrained models in large‑scale network analysis.

ICDMPretrained Modelscommunity-detection

0 likes · 5 min read

IEEE ICDM 2023 Graph Learning Challenge: Community Detection and Fraud Group Mining

DataFunSummit

Jun 24, 2023 · Artificial Intelligence

From Model to Service: Alibaba Cloud Machine Learning PAI One‑Stop Model Development and Deployment Practice

This article presents an end‑to‑end overview of Alibaba Cloud’s Machine Learning PAI platform, detailing the three‑stage ML workflow, challenges in model development, the role of pre‑trained and open‑source models, PAI’s architecture, a hands‑on demo, and MLOps best practices for efficient model deployment.

Alibaba CloudMLOpsModel Deployment

0 likes · 11 min read

From Model to Service: Alibaba Cloud Machine Learning PAI One‑Stop Model Development and Deployment Practice

DataFunTalk

Jun 17, 2023 · Artificial Intelligence

Research on Text Generation for Structured Data

This article reviews the rapidly evolving field of structured‑data text generation, covering AI development stages, core concepts, model architectures from pipeline to pretrained transformers, key challenges such as content selection, numeric representation, reasoning and style control, and outlines recent research directions and Q&A insights.

AIPretrained ModelsStructured Data

0 likes · 21 min read

Research on Text Generation for Structured Data

DataFunTalk

Jun 10, 2023 · Artificial Intelligence

Financial Event Analysis and Applications Based on Pre-trained Models

This article introduces the tasks, techniques, and frameworks for financial event analysis using pre‑trained language models, covering unstructured data parsing, event semantics, graph construction, detection, extraction, and prediction, and presents the TDE‑GTEE model that achieves state‑of‑the‑art performance even in few‑shot scenarios.

AIEvent ExtractionFinancial NLP

0 likes · 18 min read

Financial Event Analysis and Applications Based on Pre-trained Models

NetEase LeiHuo Testing Center

Jun 2, 2023 · Artificial Intelligence

AI Techniques for a Global Search Platform: Word Segmentation, Text Similarity, Image Retrieval, and Multimodal Models

This article shares the development of a global search platform that leverages AI technologies such as Chinese word segmentation, part‑of‑speech tagging, text similarity via Simhash and Synonyms, image similarity using histogram, Hamming distance and ResNet‑50, and multimodal CLIP‑based models to improve search efficiency and accuracy.

AIMultimodalNLP

0 likes · 12 min read

AI Techniques for a Global Search Platform: Word Segmentation, Text Similarity, Image Retrieval, and Multimodal Models

DataFunTalk

May 18, 2023 · Artificial Intelligence

Query Intent Recognition in Enterprise Search: Knowledge‑Enhanced and Pretrained Model Approaches

This article explains how Alibaba's enterprise search system tackles query intent recognition by combining knowledge‑enhanced techniques, short‑text classification, and pretrained language models such as StructBERT and prompt‑learning, and it shares two real‑world case studies, experimental results, and future research directions.

NLPPretrained Modelsenterprise search

0 likes · 19 min read

Query Intent Recognition in Enterprise Search: Knowledge‑Enhanced and Pretrained Model Approaches

Sohu Tech Products

Mar 22, 2023 · Artificial Intelligence

An Overview of Prompt Learning in Natural Language Processing

This article reviews the evolution of NLP training paradigms, explains why prompt learning is needed, defines its core concepts, and surveys major hard‑template and soft‑template methods such as PET, LM‑BFF, P‑tuning, and Prefix‑tuning, highlighting their advantages for few‑shot and zero‑shot scenarios.

Few-shotNLPPretrained Models

0 likes · 10 min read

An Overview of Prompt Learning in Natural Language Processing

Baidu Geek Talk

Feb 17, 2023 · Artificial Intelligence

How PGLBox Achieves 27× Faster GPU‑Powered Large‑Scale Graph Learning

PGLBox, Baidu’s GPU‑based large‑scale graph training framework, delivers up to 27× speedup over CPU clusters by fully GPU‑accelerating storage, sampling, and training, supporting billions of nodes, advanced GNN algorithms, multi‑level storage, and seamless integration of massive pretrained models.

Distributed ComputingGPUGraph Neural Networks

0 likes · 7 min read

How PGLBox Achieves 27× Faster GPU‑Powered Large‑Scale Graph Learning

Xiaohongshu Tech REDtech

Feb 16, 2023 · Artificial Intelligence

Intelligent Creative Generation and Optimization for Xiaohongshu Advertising

Xiaohongshu’s end‑to‑end intelligent creative platform extracts high‑quality images, generates diverse titles with RED‑pretrained GPT‑2/T5 models, and selects the best ads using a UCB‑based multi‑armed bandit that balances CTR uplift, revenue and user‑experience, while employing position‑corrected metrics and a scalable dual‑tower DNN to boost long‑tail performance and overall revenue.

AIAdvertisingNLP

0 likes · 18 min read

Intelligent Creative Generation and Optimization for Xiaohongshu Advertising

DataFunTalk

Jan 28, 2023 · Artificial Intelligence

Industry Search: Background, Technologies, and Real‑World Applications

This article presents a comprehensive overview of industry search, covering its background, core retrieval and ranking technologies—including sparse and dense retrieval, pre‑trained language models, tokenization, NER, adaptive multi‑task training, and re‑ranking models—followed by detailed case studies such as address analysis, family‑ID unification, emergency call handling, education photo‑search, and power‑knowledge‑base integration.

NLPPretrained Modelsaddress analysis

0 likes · 13 min read

Industry Search: Background, Technologies, and Real‑World Applications

Baidu Geek Talk

Nov 16, 2022 · Artificial Intelligence

How Baidu’s Ernie‑SimCSE Uses Contrastive Learning to Crush Spam Promotion

This article explains how Baidu's anti‑spam team tackled large‑scale promotional spam on Baidu Zhidao by combining the Ernie pretrained model with SimCSE contrastive learning, detailing the problem background, traditional methods, text‑representation stages, the SimCSE approach, training pipeline, optimizations, and experimental results.

ERNIENLPPretrained Models

0 likes · 15 min read

How Baidu’s Ernie‑SimCSE Uses Contrastive Learning to Crush Spam Promotion

Rare Earth Juejin Tech Community

Oct 18, 2022 · Artificial Intelligence

Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch

This article walks readers through building, training, and evaluating a Vision Transformer (ViT) model for a five‑class flower classification task, providing detailed code snippets, model architecture explanations, training script adjustments, and experimental results that highlight the importance of pre‑trained weights.

Deep LearningPretrained ModelsPyTorch

0 likes · 13 min read

Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch

ELab Team

Sep 23, 2022 · Artificial Intelligence

Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes

This tutorial walks you through NLP fundamentals, the evolution of BERT, the concept of pre‑trained models, and a step‑by‑step guide to fine‑tune a Chinese BERT on a cloze‑style task, complete with code snippets and verification results.

BERTChineseCloze Task

0 likes · 13 min read

Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes

DataFunTalk

Aug 28, 2022 · Artificial Intelligence

Emerging Paths Toward General AI: Trends in Large‑Scale Pretrained Models

The article reviews how the Transformer breakthrough, the rapid scaling of large language models such as GPT‑3, Switch Transformer, and Alibaba's AliceMind and M6, together with multimodal research, are shaping the next phase of artificial intelligence toward more general, collaborative, and open AI systems.

AI trendsArtificial IntelligenceLarge Language Models

0 likes · 5 min read

Emerging Paths Toward General AI: Trends in Large‑Scale Pretrained Models

AntTech

Jul 7, 2022 · Artificial Intelligence

Ant Group Insurance Technology Wins First Place in Fine‑Grained Dialogue Social Bias Detection at NLPCC 2023

Ant Group's insurance technology team secured the top spot in the fine‑grained dialogue social bias detection task at the 11th CCF NLPCC conference, showcasing their AI‑driven bias‑mitigation methods, a proprietary pre‑trained model AntInsBert, and a claim‑automation system that boosts insurance service fairness and efficiency.

AntInsBertBias DetectionNLP

0 likes · 3 min read

Ant Group Insurance Technology Wins First Place in Fine‑Grained Dialogue Social Bias Detection at NLPCC 2023

DataFunTalk

May 22, 2022 · Artificial Intelligence

Advances in Information‑Flow Recommendation: Pre‑trained Models and Multimodal User‑Interface Modeling

This article reviews Huawei Noah's Ark Lab's work on modern information‑flow recommendation, covering the evolution from collaborative filtering to deep learning, the application of BERT‑based pre‑training for news ranking, multimodal user‑interface modeling, practical deployment challenges, and future research directions.

AIBERTHuawei

0 likes · 19 min read

Advances in Information‑Flow Recommendation: Pre‑trained Models and Multimodal User‑Interface Modeling

DataFunTalk

Mar 17, 2022 · Artificial Intelligence

A Survey of Text Classification and Intent Recognition: Industrial and Research Perspectives

This article reviews recent developments in text classification and intent recognition, comparing industrial practices such as business‑coupled feature engineering with research trends like pretrained language models, and provides references and practical insights for building effective NLP solutions.

Intent RecognitionNLPPretrained Models

0 likes · 13 min read

A Survey of Text Classification and Intent Recognition: Industrial and Research Perspectives

ITPUB

Mar 2, 2022 · Artificial Intelligence

Leveraging Giant AI Models for Startup Success: Opportunities and Pitfalls

This article examines how startups can harness massive pre‑trained AI models such as GPT‑3, outlining the historical context, benefits of transfer learning, the steep costs and data‑alignment challenges, and strategic considerations when using cloud APIs versus self‑hosting.

AICloud APIsPretrained Models

0 likes · 14 min read

Leveraging Giant AI Models for Startup Success: Opportunities and Pitfalls

DataFunSummit

Feb 28, 2022 · Artificial Intelligence

UGC Sentiment Analysis Solutions and Applications in Taobao

This article presents a comprehensive overview of Taobao's user‑generated content (UGC) sentiment analysis pipeline, covering background, task definition, challenges, model architecture—including RoBERTa‑based extraction, sentiment‑knowledge pre‑training, and graph augmentation—personalized impression ranking, business impact cases, and future research directions.

Pretrained ModelsSentiment AnalysisUGC

0 likes · 16 min read

UGC Sentiment Analysis Solutions and Applications in Taobao

DataFunTalk

Feb 2, 2022 · Artificial Intelligence

UGC Sentiment Analysis Solutions and Applications in Taobao

This article presents a comprehensive overview of Taobao's user‑generated content sentiment analysis pipeline, covering task definition, challenges, model architecture with RoBERTa‑based extraction, sentiment‑knowledge pre‑training, graph augmentation, personalized ranking, business impact metrics, and future research directions.

Deep LearningKnowledge GraphPretrained Models

0 likes · 16 min read

DataFunSummit

Jan 13, 2022 · Artificial Intelligence

DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation

DeltaLM is a multilingual pretrained encoder‑decoder model that leverages cross‑lingual transfer from a pretrained encoder and novel decoder architecture, employs span‑corruption and translation‑pair pretraining tasks, and uses a two‑stage fine‑tuning strategy to achieve strong zero‑shot and supervised translation performance across over 100 languages.

Cross-Lingual TransferDeltaLMNeural Machine Translation

0 likes · 12 min read

DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation

Baidu Geek Talk

Nov 29, 2021 · Artificial Intelligence

Pretrained Models for First-Stage Information Retrieval: A Comprehensive Review

This comprehensive review by Dr. Fan Yixing surveys how pretrained language models have transformed first‑stage information retrieval, tracing the shift from traditional term‑based methods to neural sparse, dense, and hybrid approaches, and discussing key challenges such as hard‑negative mining, joint indexing‑representation learning, and generative‑discriminative training.

Hybrid RetrievalInformation RetrievalNeural IR

0 likes · 15 min read

Pretrained Models for First-Stage Information Retrieval: A Comprehensive Review

Meituan Technology Team

Oct 21, 2021 · Artificial Intelligence

Meituan's End-to-End Sentiment Analysis Technology and the ASAP Dataset

Meituan’s NLP Center introduced the ASAP dataset—the largest real‑world Chinese attribute‑level sentiment corpus—to date, and the article traces the progression from document‑level regression models upgraded with MT‑BERT, through multi‑task attribute‑level ABSA and opinion‑triplet extraction, to scalable real‑time and batch services, while outlining future transfer‑learning and few‑shot research.

MeituanNLPPretrained Models

0 likes · 25 min read

Meituan's End-to-End Sentiment Analysis Technology and the ASAP Dataset

Sohu Tech Products

Sep 1, 2021 · Artificial Intelligence

2021 Sohu Text Matching Competition: Model Design, Tricks, and Performance Analysis

This article details the authors' approach to the 2021 Sohu Text Matching competition, describing the task definition, data splits, model architectures (cross‑encoder and bi‑encoder), pretrained language models used, various training tricks, ensemble strategies, and the resulting evaluation scores.

AINLPPretrained Models

0 likes · 8 min read

2021 Sohu Text Matching Competition: Model Design, Tricks, and Performance Analysis

58 Tech

Aug 5, 2021 · Artificial Intelligence

Exploration and Practice of Text Representation Algorithms in the 58 Security Scenario

This article presents a comprehensive study of text representation techniques—from weighted word‑vector methods to supervised SimBert and unsupervised contrastive learning models—applied to large‑scale unstructured data in 58's information‑security workflows, evaluating their effectiveness for classification and content‑recall tasks.

BERTPretrained ModelsSimCSE

0 likes · 11 min read

Exploration and Practice of Text Representation Algorithms in the 58 Security Scenario

DataFunTalk

Jul 1, 2021 · Artificial Intelligence

Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey

This article surveys the evolution of pre‑trained models, covering the origins of transfer and self‑supervised learning, the rise of transformer‑based PTMs such as BERT and GPT, efficient architecture designs, multimodal and multilingual extensions, theoretical analyses, and future research directions for scalable and robust AI systems.

AI researchEfficient TrainingLarge Language Models

0 likes · 27 min read

Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey

Huawei Cloud Developer Alliance

May 8, 2021 · Artificial Intelligence

How Huawei’s Pangu Pre‑trained Models Slash Development Costs and Boost Vision AI

In a detailed interview, Huawei Cloud experts explain how the ultra‑large Pangu CV and NLP models—trained on billions of parameters and terabytes of data—achieve top benchmark scores, simplify developer workflows, and deliver industry‑wide deployments that dramatically cut labeling effort and iteration time.

Huawei CloudPretrained Modelsai-development

0 likes · 9 min read

How Huawei’s Pangu Pre‑trained Models Slash Development Costs and Boost Vision AI

JD Cloud Developers

Feb 5, 2021 · Artificial Intelligence

2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist

In an InfoQ interview, JD Technology senior algorithm scientist Wu Youzheng reviews the rapid advances of natural language processing in 2020—including GPT‑3, multimodal dialogue, knowledge‑enhanced pre‑training, and knowledge graphs—while outlining the most promising research directions and practical challenges for the coming year.

AI ApplicationsGPT-3Knowledge Graph

0 likes · 18 min read

2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist

AntTech

Sep 27, 2020 · Artificial Intelligence

Question Directed Graph Attention Network for Numerical Reasoning over Text (QDGAT)

The paper introduces QDGAT, a question‑directed graph attention network that enhances numerical reasoning in reading comprehension by explicitly modeling relationships among numbers, entities, and questions, and demonstrates its effectiveness through extensive experiments on the DROP dataset.

DROPKnowledge GraphNLP

0 likes · 7 min read

Question Directed Graph Attention Network for Numerical Reasoning over Text (QDGAT)

DataFunTalk

Jul 7, 2020 · Artificial Intelligence

Optimizing Pretrained Language Model Inference: Lessons from the NLPCC Small Model Competition and Deployment at Xiaomi

This article shares the Xiaomi AI Lab NLP team's experience in the NLPCC lightweight language model competition, discusses efficiency challenges of large pretrained models like BERT, and details practical inference optimizations—including model distillation, batching, FP16 quantization, and FasterTransformer integration—that dramatically reduce latency and hardware costs in production.

AIBERTInference Optimization

0 likes · 15 min read

Optimizing Pretrained Language Model Inference: Lessons from the NLPCC Small Model Competition and Deployment at Xiaomi

DataFunTalk

May 6, 2020 · Artificial Intelligence

Application of Large-Scale Pretrained Models in Alibaba Machine Translation

This article reviews how large‑scale pretrained language models have reshaped NLP, outlines the challenges of applying them to machine translation, introduces the APT framework and the GRET architecture for better encoder‑decoder integration, and reports experimental gains and future research directions.

AIAPT frameworkGRET

0 likes · 10 min read

Application of Large-Scale Pretrained Models in Alibaba Machine Translation

DataFunTalk

Apr 16, 2020 · Artificial Intelligence

Comprehensive Survey of Pre-trained Models for Natural Language Processing

This article provides a detailed survey of pre‑trained models (PTMs) for natural language processing, classifying them into shallow embeddings and contextual encoders, discussing training paradigms such as knowledge integration and model compression, and offering guidance on transfer learning and future challenges.

Knowledge IntegrationPretrained Modelsmodel compression

0 likes · 25 min read

Comprehensive Survey of Pre-trained Models for Natural Language Processing

JD Retail Technology

Aug 8, 2019 · Artificial Intelligence

From Word Representations to Sentiment Analysis – Talk by Dr. Feng Ao

On August 6, Dr. Feng Ao presented a comprehensive overview of the evolution of word representations and sentiment analysis, illustrating the shift from traditional linguistic features to modern pretrained models such as BERT and XLNet, and sharing practical convolutional experiments relevant to industry applications.

Artificial IntelligenceNLPPretrained Models

0 likes · 4 min read

From Word Representations to Sentiment Analysis – Talk by Dr. Feng Ao