Tagged articles
507 articles
Page 2 of 6
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 18, 2023 · Artificial Intelligence

Are Large Language Models Really a Silver Bullet? Costs, Limits, and Alternatives

While the hype around large language models suggests they are a universal solution, this article examines their high operational costs, slow response times, unnecessary features, legal risks, and compares them with traditional NLP techniques, arguing that they are not a silver bullet but one tool among many.

AI limitationsNLPcost analysis
0 likes · 9 min read
Are Large Language Models Really a Silver Bullet? Costs, Limits, and Alternatives
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Jul 29, 2023 · Artificial Intelligence

Getting Started with GPT: How Generative Pre‑Training and Discriminative Fine‑Tuning Work

This article explains GPT's two‑stage learning—unsupervised generative pre‑training on large raw corpora followed by discriminative fine‑tuning on labeled tasks—detailing the underlying Transformer decoder architecture, loss functions, and task‑specific input transformations.

Fine-tuningGPTGenerative Pre‑Training
0 likes · 5 min read
Getting Started with GPT: How Generative Pre‑Training and Discriminative Fine‑Tuning Work
DataFunSummit
DataFunSummit
Jul 28, 2023 · Big Data

User Path Analysis and SessionAnalytics: Business Practices, Technical Architecture, and Open‑Source Framework

This article introduces user path analysis and the SessionAnalytics open‑source framework, covering business scenarios, data processing techniques, algorithmic mining methods, technical architecture, implementation details, comparisons with event‑based analysis, and a comprehensive Q&A for practical deployment.

Big DataNLPdata engineering
0 likes · 19 min read
User Path Analysis and SessionAnalytics: Business Practices, Technical Architecture, and Open‑Source Framework
Huolala Tech
Huolala Tech
Jul 28, 2023 · Artificial Intelligence

How HuoLala Leverages AI to Revolutionize Service Quality Inspection

This article details HuoLala's AI‑driven intelligent quality inspection system, covering its NLP‑based semantic understanding pipeline, data denoising, confidence learning, contrastive learning, model acceleration techniques such as pruning, knowledge distillation, quantization, and interpretability methods to improve coverage, recall and risk detection.

NLPcontrastive learningdata denoising
0 likes · 23 min read
How HuoLala Leverages AI to Revolutionize Service Quality Inspection
Sohu Tech Products
Sohu Tech Products
Jul 26, 2023 · Artificial Intelligence

Attention Mechanism, Transformer Architecture, and BERT: An In-Depth Overview

This article provides a comprehensive overview of the attention mechanism, its mathematical foundations, the transformer model architecture—including encoder and decoder components—and the BERT pre‑training model, detailing their principles, implementations, and applications in natural language processing.

Attention MechanismBERTEncoder-Decoder
0 likes · 13 min read
Attention Mechanism, Transformer Architecture, and BERT: An In-Depth Overview
DataFunTalk
DataFunTalk
Jul 24, 2023 · Artificial Intelligence

Session Analytics: User Path Analysis, Data Processing, and Algorithm Mining

This article introduces user path analysis and the SessionAnalytics open‑source framework, covering business scenarios, technical architecture, data integration, session segmentation, data cleaning, sampling, graph structures, NLP‑based mining, clustering, and visualization techniques for extracting insights from large‑scale user behavior data.

NLPdata miningsession analytics
0 likes · 19 min read
Session Analytics: User Path Analysis, Data Processing, and Algorithm Mining
DataFunSummit
DataFunSummit
Jul 17, 2023 · Artificial Intelligence

Introduction to ModelScope Community's Fundamental NLP Models and Their Applications

This article introduces the ModelScope community's suite of foundational NLP models—including tokenization, POS tagging, NER, and text representation—detailing their architectures, performance, application scenarios, while also highlighting research contributions such as the ACE framework and retrieval‑enhanced techniques.

Artificial IntelligenceModelScopeNLP
0 likes · 21 min read
Introduction to ModelScope Community's Fundamental NLP Models and Their Applications
HelloTech
HelloTech
Jun 21, 2023 · Artificial Intelligence

Overview of Haro Intelligent Customer Service: Algorithms, Challenges, and AI Solutions

Haro’s intelligent customer service combines a smart FAQ recommender and a conversational chatbot that leverages matching‑based intent recognition, large‑scale domain pre‑training, metric‑learning for new intents, and fine‑tuned generative LLMs, achieving 82 % top‑1 accuracy while reducing human workload and outlining future API‑orchestrated, multimodal AI enhancements.

NLPailarge language model
0 likes · 10 min read
Overview of Haro Intelligent Customer Service: Algorithms, Challenges, and AI Solutions
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Jun 2, 2023 · Artificial Intelligence

AI Techniques for a Global Search Platform: Word Segmentation, Text Similarity, Image Retrieval, and Multimodal Models

This article shares the development of a global search platform that leverages AI technologies such as Chinese word segmentation, part‑of‑speech tagging, text similarity via Simhash and Synonyms, image similarity using histogram, Hamming distance and ResNet‑50, and multimodal CLIP‑based models to improve search efficiency and accuracy.

MultimodalNLPai
0 likes · 12 min read
AI Techniques for a Global Search Platform: Word Segmentation, Text Similarity, Image Retrieval, and Multimodal Models
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
May 24, 2023 · Artificial Intelligence

COPNER: Contrastive Learning with Prompt Guidance for Few‑Shot Named Entity Recognition

The article introduces COPNER, a contrastive‑learning framework that uses class‑specific prompt words to guide sentence encoders, addressing the limited semantic capture of existing few‑shot NER methods and demonstrating superior performance across multiple benchmark datasets and K‑shot settings.

COPNERNLPcontrastive learning
0 likes · 4 min read
COPNER: Contrastive Learning with Prompt Guidance for Few‑Shot Named Entity Recognition
DataFunTalk
DataFunTalk
May 18, 2023 · Artificial Intelligence

Query Intent Recognition in Enterprise Search: Knowledge‑Enhanced and Pretrained Model Approaches

This article explains how Alibaba's enterprise search system tackles query intent recognition by combining knowledge‑enhanced techniques, short‑text classification, and pretrained language models such as StructBERT and prompt‑learning, and it shares two real‑world case studies, experimental results, and future research directions.

Enterprise searchKnowledge EnhancementNLP
0 likes · 19 min read
Query Intent Recognition in Enterprise Search: Knowledge‑Enhanced and Pretrained Model Approaches
DataFunSummit
DataFunSummit
May 17, 2023 · Artificial Intelligence

Event Extraction: Overview, Methods, and the OmniEvent Toolkit

This article reviews the development of event extraction, explains its importance for knowledge graphs, surveys four major algorithmic paradigms, introduces the OmniEvent open‑source toolkit with its unified benchmark and modular design, and outlines future research directions such as document‑level extraction and event relation modeling.

Event ExtractionInformation ExtractionNLP
0 likes · 11 min read
Event Extraction: Overview, Methods, and the OmniEvent Toolkit
Full-Stack Trendsetter
Full-Stack Trendsetter
May 15, 2023 · Artificial Intelligence

Do You Really Understand ChatGPT, the Era‑Defining AI?

This article explains what ChatGPT is, how it builds on natural-language-processing and the Transformer-based GPT series, details its model-size growth, architectural enhancements, multilingual support, and walks through the tokenization-to-generation pipeline that enables coherent AI-driven conversations.

ChatGPTDeep LearningGPT-3
0 likes · 8 min read
Do You Really Understand ChatGPT, the Era‑Defining AI?
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 8, 2023 · Artificial Intelligence

Understanding the Principles Behind ChatGPT: NLP, Transformers, and Reinforcement Learning

This article explains how ChatGPT works by covering the fundamentals of natural language processing, generative language models, deep learning, the Transformer architecture, attention mechanisms, few‑shot learning, and the reinforcement‑learning techniques that align its outputs with human preferences.

ChatGPTNLPReinforcement Learning
0 likes · 24 min read
Understanding the Principles Behind ChatGPT: NLP, Transformers, and Reinforcement Learning
DataFunSummit
DataFunSummit
May 6, 2023 · Artificial Intelligence

The Convergence of NLP and Computer Vision: Unified Neural Architectures and Pre‑training Strategies

This talk reviews the recent trend of unifying natural‑language processing and computer‑vision models through shared transformer architectures, masked‑image‑modeling pre‑training, brain‑inspired prediction mechanisms, and practical benefits such as knowledge sharing, multimodal applications, and cost efficiency, while highlighting the evolution of Swin Transformer and its next‑generation variants.

NLPTransformerUnified Architecture
0 likes · 20 min read
The Convergence of NLP and Computer Vision: Unified Neural Architectures and Pre‑training Strategies
DataFunTalk
DataFunTalk
Apr 29, 2023 · Operations

WeChat NLP Algorithm Microservice Governance: Challenges and Solutions

This article examines the governance of WeChat NLP algorithm microservices, outlining the management, performance, and scheduling challenges they pose, and presents solutions including automated CI/CD pipelines, task‑aware auto‑scaling, DAG‑based service composition, custom Python interpreter PyInter, and an improved Joint‑Idle‑Queue load‑balancing algorithm.

MicroservicesNLPOperations
0 likes · 13 min read
WeChat NLP Algorithm Microservice Governance: Challenges and Solutions
JD Tech
JD Tech
Apr 20, 2023 · Artificial Intelligence

Comprehensive Overview of ChatGPT: AI Background, Technical Foundations, and Commercial Applications

This extensive report examines ChatGPT’s origins, the evolution of artificial intelligence and natural language processing, details the underlying Transformer architecture and GPT series, discusses its limitations, and explores the wide-ranging commercial applications and future prospects of generative AI.

AIGCArtificial IntelligenceChatGPT
0 likes · 34 min read
Comprehensive Overview of ChatGPT: AI Background, Technical Foundations, and Commercial Applications
DataFunTalk
DataFunTalk
Apr 12, 2023 · Artificial Intelligence

Prompt Engineering for ChatGPT: Principles, Design Steps, and Practical Cases

This article provides a comprehensive overview of ChatGPT prompt engineering, covering its background, design principles, step‑by‑step workflow, numerous practical examples—including code generation, entity extraction, and style rewriting—and discusses why prompts are crucial for large language model performance.

Artificial IntelligenceChatGPTNLP
0 likes · 30 min read
Prompt Engineering for ChatGPT: Principles, Design Steps, and Practical Cases
Architect
Architect
Apr 9, 2023 · Artificial Intelligence

Evaluating the Commonsense Knowledge and Reasoning Capabilities of ChatGPT and Other Large Language Models

This study systematically evaluates ChatGPT and other large language models on their ability to answer commonsense questions, assess their knowledge awareness, and utilize generated knowledge for reasoning, revealing strong QA performance but notable gaps in social and temporal commonsense and in leveraging contextual knowledge.

ChatGPTLarge Language ModelsNLP
0 likes · 20 min read
Evaluating the Commonsense Knowledge and Reasoning Capabilities of ChatGPT and Other Large Language Models
DataFunSummit
DataFunSummit
Mar 25, 2023 · Artificial Intelligence

How GPT‑4 Has Changed NLP Research: Community Perspectives

A collection of Zhihu answers reflects on how the release of GPT‑4 has reshaped NLP research, dividing the community into LLM‑enthusiasts and skeptics, discussing the relevance of parsing, resource‑driven research directions, and the existential challenges faced by researchers.

Academic CommunityGPT-4LLM
0 likes · 10 min read
How GPT‑4 Has Changed NLP Research: Community Perspectives
Sohu Tech Products
Sohu Tech Products
Mar 22, 2023 · Artificial Intelligence

An Overview of Prompt Learning in Natural Language Processing

This article reviews the evolution of NLP training paradigms, explains why prompt learning is needed, defines its core concepts, and surveys major hard‑template and soft‑template methods such as PET, LM‑BFF, P‑tuning, and Prefix‑tuning, highlighting their advantages for few‑shot and zero‑shot scenarios.

Few-ShotNLPPrompt Tuning
0 likes · 10 min read
An Overview of Prompt Learning in Natural Language Processing
Python Programming Learning Circle
Python Programming Learning Circle
Mar 21, 2023 · Artificial Intelligence

Analyzing WeChat Friend Data with Python: Gender, Avatar, Signature, and Location Insights

This tutorial demonstrates how to use Python libraries such as itchat, jieba, matplotlib, SnowNLP, and Tencent Youtu SDK to collect WeChat friend information and perform data analysis on gender distribution, avatar characteristics, signature text (including word‑cloud and sentiment analysis), and geographic location, presenting the results with visual charts and maps.

NLPWeChatdata-analysis
0 likes · 14 min read
Analyzing WeChat Friend Data with Python: Gender, Avatar, Signature, and Location Insights
Alipay Experience Technology
Alipay Experience Technology
Mar 21, 2023 · Artificial Intelligence

How to Make OpenAI’s API Understand Ultra‑Long Insurance Policies

This article explains how to overcome OpenAI's token limits by splitting massive insurance documents into manageable chunks, vectorizing them with embeddings, using a custom "broccoli" algorithm for intelligent segmentation, and compressing text with dictionary mapping and tokenization techniques to enable accurate question‑answering via the API.

APIDocument SplittingNLP
0 likes · 22 min read
How to Make OpenAI’s API Understand Ultra‑Long Insurance Policies
Sohu Tech Products
Sohu Tech Products
Mar 16, 2023 · Artificial Intelligence

ChatGPT Data Augmentation Methods for NLP

This article introduces various ChatGPT‑based data‑augmentation techniques for natural language processing, explains how to use prompts for synonym, antonym, homophone, random insertion, deletion, and swapping transformations, and provides concrete example prompts and outputs to illustrate each method.

Artificial IntelligenceChatGPTNLP
0 likes · 15 min read
ChatGPT Data Augmentation Methods for NLP
DataFunSummit
DataFunSummit
Mar 16, 2023 · Artificial Intelligence

Construction of Real‑World Medical Knowledge Graphs and Clinical Event Graphs

The article describes how YiduCloud builds real‑world medical knowledge graphs and clinical event graphs from heterogeneous hospital systems (EMR, HIS, LIS, RIS) using data aggregation, de‑identification, quality control, NLP‑driven entity extraction, standardisation, graph construction, cleaning, embedding and various AI‑powered applications such as decision support, intelligent diagnosis, automated medical‑record generation and patient recruitment.

Big DataMedical Knowledge GraphNLP
0 likes · 21 min read
Construction of Real‑World Medical Knowledge Graphs and Clinical Event Graphs
DataFunTalk
DataFunTalk
Mar 1, 2023 · Artificial Intelligence

ACL 2023 Multi‑lingual Document‑grounded Dialogue Competition Overview

The ACL 2023 Multi‑lingual Document‑grounded Dialogue Competition, hosted by Alibaba DAMO Academy and Nanjing University, introduces the first multilingual document‑dialogue dataset, provides a baseline system, offers a $7,000 prize pool, and invites participants to submit papers to the Doc2dial Workshop for Best Paper awards.

ACL2023DatasetNLP
0 likes · 6 min read
ACL 2023 Multi‑lingual Document‑grounded Dialogue Competition Overview
DataFunTalk
DataFunTalk
Feb 27, 2023 · Artificial Intelligence

Exploring ChatGPT: Evolution, Technical Foundations, and Practical Applications

This article reviews the development of ChatGPT from early GPT models, explains its underlying RLHF training, compares it with BERT and GPT‑3, and discusses practical applications such as intelligent writing, customer service, and voice calling, while evaluating performance, cost, and future prospects.

AI applicationsChatGPTNLP
0 likes · 22 min read
Exploring ChatGPT: Evolution, Technical Foundations, and Practical Applications
DataFunSummit
DataFunSummit
Feb 26, 2023 · Artificial Intelligence

Design Philosophy and Industrial Practices of PaddleNLP

This article reviews the development trends of open‑source NLP products, explains PaddleNLP’s design principles—task‑centric, model‑centric, and solution‑centric—along with its modular, ecosystem‑driven, and production‑ready architecture, and showcases several industry case studies demonstrating its practical applications.

AI pipelinesIndustrial ApplicationsNLP
0 likes · 17 min read
Design Philosophy and Industrial Practices of PaddleNLP
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Feb 20, 2023 · Industry Insights

Why Pre‑trained Large Models Are the New Infrastructure for AI Applications

Pre‑trained large models are emerging as the foundational infrastructure for AI across industries; this article analyzes their technical advantages, application trends in NLP, CV and multimodal domains, presents a telecom customer‑service case study with performance benchmarks, and outlines future deployment challenges and research directions.

Computer VisionNLPPrompt Tuning
0 likes · 23 min read
Why Pre‑trained Large Models Are the New Infrastructure for AI Applications
DataFunSummit
DataFunSummit
Feb 19, 2023 · Artificial Intelligence

Intelligent Writing Assistant: TexSmart and Effidit Systems, Multi‑Level Unsupervised Text Rewriting, and the New ParaScore Evaluation Metric

This article presents Tencent AI Lab's intelligent writing assistant, detailing the TexSmart text‑understanding platform, the Effidit writing‑assistant features, a multi‑level controllable unsupervised text‑rewriting method, and a novel ParaScore metric that jointly measures semantic similarity and diversity for paraphrase evaluation.

AI writingEvaluation MetricsNLP
0 likes · 14 min read
Intelligent Writing Assistant: TexSmart and Effidit Systems, Multi‑Level Unsupervised Text Rewriting, and the New ParaScore Evaluation Metric
DataFunSummit
DataFunSummit
Feb 16, 2023 · Artificial Intelligence

Curated Collection of Articles on AI‑Powered Smart Medicine

This guide introduces the challenges in healthcare, explains how artificial intelligence is already reshaping the field, and provides a curated list of recent articles on smart medicine for readers to explore the emerging AI‑healthcare integration.

Big DataHealthcareNLP
0 likes · 4 min read
Curated Collection of Articles on AI‑Powered Smart Medicine
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 16, 2023 · Artificial Intelligence

Intelligent Creative Generation and Optimization for Xiaohongshu Advertising

Xiaohongshu’s end‑to‑end intelligent creative platform extracts high‑quality images, generates diverse titles with RED‑pretrained GPT‑2/T5 models, and selects the best ads using a UCB‑based multi‑armed bandit that balances CTR uplift, revenue and user‑experience, while employing position‑corrected metrics and a scalable dual‑tower DNN to boost long‑tail performance and overall revenue.

AdvertisingNLPUCB
0 likes · 18 min read
Intelligent Creative Generation and Optimization for Xiaohongshu Advertising
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 10, 2023 · Artificial Intelligence

Expert Insights on ChatGPT: Technical Challenges, Applications, and Future Directions

In a REDtech live interview, NLP professor Li Lei and Xiaohongshu engineers examined ChatGPT’s strengths—long, topic‑focused replies and few‑shot learning—and its challenges such as hallucinations, safety, lack of real‑time data, model compression, and multimodal AIGC, outlining how the technology could reshape content creation, customer service, and search while requiring careful risk management.

AI SafetyChatGPTIn-Context Learning
0 likes · 20 min read
Expert Insights on ChatGPT: Technical Challenges, Applications, and Future Directions
Architect's Guide
Architect's Guide
Feb 9, 2023 · Artificial Intelligence

Why ChatGPT Is So Powerful: A Technical Overview of NLP Model Evolution

This article explains why ChatGPT performs so well by tracing the evolution of natural‑language processing from rule‑based grammars through statistical n‑gram models to neural architectures like RNNs, LSTMs, attention mechanisms, Transformers, and the massive data and training methods that power modern large language models.

ChatGPTNLPTransformer
0 likes · 14 min read
Why ChatGPT Is So Powerful: A Technical Overview of NLP Model Evolution
JD Cloud Developers
JD Cloud Developers
Feb 8, 2023 · Operations

Boosting Log Anomaly Detection with NLP and Deep Learning

This article presents a log anomaly detection approach that leverages NLP techniques such as Part‑of‑Speech tagging and Named Entity Recognition combined with deep neural networks, detailing a six‑step model, experimental validation on three datasets, and superior performance compared with existing DeepLog and LogClass methods.

DNNDeep LearningNER
0 likes · 13 min read
Boosting Log Anomaly Detection with NLP and Deep Learning
NewBeeNLP
NewBeeNLP
Feb 7, 2023 · Artificial Intelligence

Mastering ChatGPT Prompt Engineering: Principles, Steps, and Real-World Examples

This article provides a comprehensive guide to ChatGPT prompt engineering, covering background concepts, design principles, step‑by‑step workflows, diverse use‑case examples, model limitations, and references to key research papers, helping readers craft effective prompts for various NLP tasks.

ChatGPTIn-Context LearningNLP
0 likes · 30 min read
Mastering ChatGPT Prompt Engineering: Principles, Steps, and Real-World Examples
DataFunTalk
DataFunTalk
Jan 28, 2023 · Artificial Intelligence

Industry Search: Background, Technologies, and Real‑World Applications

This article presents a comprehensive overview of industry search, covering its background, core retrieval and ranking technologies—including sparse and dense retrieval, pre‑trained language models, tokenization, NER, adaptive multi‑task training, and re‑ranking models—followed by detailed case studies such as address analysis, family‑ID unification, emergency call handling, education photo‑search, and power‑knowledge‑base integration.

NLPaddress analysisindustry search
0 likes · 13 min read
Industry Search: Background, Technologies, and Real‑World Applications
21CTO
21CTO
Jan 16, 2023 · Artificial Intelligence

7 AI Trends Shaping 2023: From Model Governance to the Metaverse

The 2023 AI landscape will be driven by model‑governance reforms, NLP breakthroughs, hyper‑automation, AI‑powered recruiting, metaverse integration, enhanced cybersecurity, and conversational chatbots, all reshaping industries and creating massive economic opportunities worldwide.

2023 trendsMetaverseNLP
0 likes · 8 min read
7 AI Trends Shaping 2023: From Model Governance to the Metaverse
DataFunSummit
DataFunSummit
Jan 15, 2023 · Artificial Intelligence

Intelligent Writing: AIGC Technologies, Models, Evaluation Metrics, and Real‑World Applications

This article surveys the evolution of AI‑generated content for intelligent writing, covering its definition, key technologies from RNN Seq2Seq to Transformer‑based models such as UniLM, T5, BART and GPT series, evaluation datasets and metrics, product deployments by Datagrand, and the remaining challenges and future directions.

AI writingAIGCGPT
0 likes · 25 min read
Intelligent Writing: AIGC Technologies, Models, Evaluation Metrics, and Real‑World Applications
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jan 10, 2023 · Artificial Intelligence

Sentiment Classification and Topic Clustering for NetEase Cloud Music Comments

To boost NetEase Cloud Music’s comment handling, the authors combine active‑learning‑driven relabeling, domain‑specific MLM pretraining, contrastive‑learning‑based sample expansion, and multi‑task BERT sharing to raise sentiment‑classification precision and recall above 90 % and double moderation clean‑rate, while employing prompt‑generated story themes, IP‑focused classifiers, and hot‑word aggregation for effective short‑text topic clustering and scalable, theme‑aware distribution.

NLPSentiment Analysisactive learning
0 likes · 10 min read
Sentiment Classification and Topic Clustering for NetEase Cloud Music Comments
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 12, 2022 · Artificial Intelligence

How Unified Prompt Tuning Boosts Few-Shot NLP Performance Across Tasks

Unified Prompt Tuning (UPT) is a meta-learning based few‑shot algorithm that converts diverse NLP tasks into a common Prompt‑Options‑Verbalizer format, enabling large pre‑trained language models to achieve higher accuracy with minimal labeled data, as demonstrated on EMNLP‑2022 benchmarks and SuperGLUE datasets.

Few‑Shot LearningMeta LearningNLP
0 likes · 10 min read
How Unified Prompt Tuning Boosts Few-Shot NLP Performance Across Tasks
DataFunTalk
DataFunTalk
Dec 9, 2022 · Artificial Intelligence

POI Recognition and Alias Linking in Travel Search: Challenges, Algorithmic Practices, and Online Impact

The article presents a comprehensive study of POI (point‑of‑interest) recognition and alias linking within travel search, detailing background challenges, a multi‑stage algorithmic framework, extensive offline experiments, and the resulting improvements in online conversion and relevance.

Alias LinkingNLPPOI Recognition
0 likes · 14 min read
POI Recognition and Alias Linking in Travel Search: Challenges, Algorithmic Practices, and Online Impact
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 9, 2022 · Artificial Intelligence

How SpanProto Boosts Few-Shot NER Accuracy with a Two-Stage Span Approach

SpanProto, a two‑stage span‑based prototypical network, dramatically improves few‑shot named entity recognition by extracting candidate spans with a global boundary matrix and classifying them via prototypical and margin learning, achieving notable gains on the Few‑NERD benchmark with minimal labeled data.

EMNLP 2022NLPnamed entity recognition
0 likes · 8 min read
How SpanProto Boosts Few-Shot NER Accuracy with a Two-Stage Span Approach
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 8, 2022 · Artificial Intelligence

KECP: Enhancing Few-Shot Machine Reading Comprehension via Knowledge-Driven Prompt Tuning

KECP, a Knowledge‑Enhanced Contrastive Prompt‑tuning model, achieves strong few‑shot extractive question answering by converting questions to masked statements, injecting external knowledge via gated fusion, and leveraging contrastive learning alongside masked language modeling, as demonstrated on EMNLP‑2022 benchmarks.

NLPcontrastive learningknowledge injection
0 likes · 9 min read
KECP: Enhancing Few-Shot Machine Reading Comprehension via Knowledge-Driven Prompt Tuning
DataFunTalk
DataFunTalk
Nov 26, 2022 · Artificial Intelligence

Human‑Centric Design for AI/NLP Document Extraction and Knowledge‑Graph Deployment

The article explains how combining human expertise with AI techniques—through problem decomposition, model selection, feature engineering, and knowledge‑graph construction—enables practical NLP solutions for document extraction and intelligent Q&A, illustrating the process with contract‑field extraction case studies.

Document ExtractionHuman-in-the-LoopModel Selection
0 likes · 14 min read
Human‑Centric Design for AI/NLP Document Extraction and Knowledge‑Graph Deployment
DataFunTalk
DataFunTalk
Nov 22, 2022 · Artificial Intelligence

NVIDIA's Advances in Multi‑Role Generative Dialogue Modeling and Synthetic Data‑Driven QA

This article reviews NVIDIA's recent work on multi‑role generative dialogue modeling using GPT‑2‑based architectures and on enhancing question‑answering systems with synthetic data pipelines, covering model design, data preparation from Reddit, extensive experiments, scaling effects, and practical Q&A insights.

GPT-2Generative DialogueModel Scaling
0 likes · 17 min read
NVIDIA's Advances in Multi‑Role Generative Dialogue Modeling and Synthetic Data‑Driven QA
DataFunTalk
DataFunTalk
Nov 21, 2022 · Artificial Intelligence

Research on Information Extraction from a Graph Perspective

This presentation reviews the background, significance, current research status, objectives, and key contributions of a graph‑based approach to information extraction, covering entity recognition, relation extraction, event extraction, open‑domain extraction, and the proposed unified modeling framework with experimental results.

Graph ModelingInformation ExtractionNLP
0 likes · 27 min read
Research on Information Extraction from a Graph Perspective
DataFunSummit
DataFunSummit
Nov 20, 2022 · Artificial Intelligence

NLP Technology Applications and Research in Voice Assistants

This article presents an in‑depth overview of NLP techniques used in voice assistants, covering the end‑to‑end conversational AI pipeline, intent and slot modeling, multi‑turn dialog management, model deployment pipelines, quantization methods, and self‑learning strategies for continuous improvement.

Conversational AIModel QuantizationNLP
0 likes · 30 min read
NLP Technology Applications and Research in Voice Assistants
Baidu Geek Talk
Baidu Geek Talk
Nov 16, 2022 · Artificial Intelligence

How Baidu’s Ernie‑SimCSE Uses Contrastive Learning to Crush Spam Promotion

This article explains how Baidu's anti‑spam team tackled large‑scale promotional spam on Baidu Zhidao by combining the Ernie pretrained model with SimCSE contrastive learning, detailing the problem background, traditional methods, text‑representation stages, the SimCSE approach, training pipeline, optimizations, and experimental results.

ErnieNLPSimCSE
0 likes · 15 min read
How Baidu’s Ernie‑SimCSE Uses Contrastive Learning to Crush Spam Promotion
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 11, 2022 · Artificial Intelligence

Language Model as a Service and Black‑Box Optimization: Insights from Prof. Qiu Xipeng’s Talk

Prof. Qiu Xipeng’s talk highlighted how large language models can be offered as a service and efficiently adapted via in‑context learning, lightweight label‑tuning, and gradient‑free black‑box optimization, showcasing a unified asymmetric Transformer (CPT) that handles understanding, generation, ABSA and NER tasks while reducing resource demands.

Black-Box OptimizationLLMLanguage Model
0 likes · 15 min read
Language Model as a Service and Black‑Box Optimization: Insights from Prof. Qiu Xipeng’s Talk
Zuoyebang Tech Team
Zuoyebang Tech Team
Nov 9, 2022 · Artificial Intelligence

Boost Data Annotation Efficiency with VAPAL: Active Learning Meets Virtual Adversarial Perturbation

This article explains how a pool‑based active learning framework that combines uncertainty sampling (using BADGE, ALPS, or virtual adversarial perturbations) with diversity‑driven clustering can dramatically cut labeling costs for Transformer‑based NLP models, and presents experimental results showing VAPAL’s competitive performance and early‑stage advantages.

NLPactive learningdata annotation
0 likes · 10 min read
Boost Data Annotation Efficiency with VAPAL: Active Learning Meets Virtual Adversarial Perturbation
Youzan Coder
Youzan Coder
Oct 24, 2022 · Artificial Intelligence

Knowledge Base Retrieval Matching: Algorithm and Engineering Service Practice

The article outlines a comprehensive knowledge‑base retrieval matching solution—combining PageRank‑enhanced DSL rewriting, keyword and dual‑tower vector recall, contrastive fine‑ranking, and optimized vector‑based ranking—implemented via offline DP training and Sunfish online inference on Milvus, with applications in enterprise search and recommendations and future plans for graph‑neural embeddings.

InfoNCEMilvusNLP
0 likes · 12 min read
Knowledge Base Retrieval Matching: Algorithm and Engineering Service Practice
DataFunTalk
DataFunTalk
Oct 18, 2022 · Artificial Intelligence

Large‑Model and Small‑Model Interaction: Knowledge Distillation and Reverse Distillation Techniques

This article explains how large‑scale NLP models can be paired with smaller models through task‑related and task‑unrelated knowledge distillation, progressive multi‑stage distillation, and reverse distillation, thereby reducing training costs, accelerating inference, and even allowing small models to improve large‑model training via sample‑value assessment.

NLPreverse distillationsample selection
0 likes · 11 min read
Large‑Model and Small‑Model Interaction: Knowledge Distillation and Reverse Distillation Techniques
Ctrip Technology
Ctrip Technology
Oct 13, 2022 · Artificial Intelligence

Chinese New Word Discovery: From Traditional Unsupervised Methods to CNN‑Based Deep Learning

The article examines the challenge of out‑of‑vocabulary terms in Chinese e‑commerce NLP, reviews classic unsupervised metrics such as frequency, cohesion and neighbor entropy, and proposes a lightweight fully‑convolutional network inspired by image‑segmentation techniques to automatically detect new words.

CNNDeep LearningNLP
0 likes · 10 min read
Chinese New Word Discovery: From Traditional Unsupervised Methods to CNN‑Based Deep Learning
DataFunTalk
DataFunTalk
Oct 13, 2022 · Artificial Intelligence

Multimodal Attribute-Level Sentiment Analysis for Social Media: Background, Tasks, and Recent Advances

This article reviews the rapid development of multimodal attribute-level sentiment analysis on social media, outlining its background, defining four core sub‑tasks, summarizing representative recent models—including unified multimodal transformers, coarse‑to‑fine image‑target matching, and vision‑language pre‑training—and discussing experimental results and future research directions.

Deep LearningNLPaspect based sentiment
0 likes · 21 min read
Multimodal Attribute-Level Sentiment Analysis for Social Media: Background, Tasks, and Recent Advances
DataFunTalk
DataFunTalk
Oct 10, 2022 · Artificial Intelligence

Model Compression and Deployment of Pre‑trained Language Models at Meituan

This article presents Meituan's practical experience with compressing large pre‑trained language models—covering challenges of large‑model deployment, compression techniques such as knowledge distillation, pruning and quantization, the AutoDisc assistant‑model approach, multi‑teacher and iterative distillation, and real‑world applications in search advertising, intelligent assistants, and dual‑tower semantic matching.

MeituanNLPpretrained language models
0 likes · 17 min read
Model Compression and Deployment of Pre‑trained Language Models at Meituan
HaoDF Tech Team
HaoDF Tech Team
Oct 8, 2022 · Artificial Intelligence

Exploring Transformer Technology and Its Applications in NLP, Computer Vision, and OCR at Haodf.com

This article introduces the Transformer architecture, explains its attention mechanism, details its adaptations for natural language processing, computer vision, and OCR tasks, and presents experimental results of various models such as BERT, ELECTRA, Swin Transformer, and CRNN-BCN on large-scale medical data from Haodf.com.

Model EvaluationNLPOCR
0 likes · 39 min read
Exploring Transformer Technology and Its Applications in NLP, Computer Vision, and OCR at Haodf.com
DataFunTalk
DataFunTalk
Oct 3, 2022 · Artificial Intelligence

Building Real‑World Medical Knowledge Graphs and Clinical Event Graphs: Methods, Pipelines, and Applications

This article explains how YiduCore processes heterogeneous hospital data (EMR, HIS, LIS, RIS, literature) to construct real‑world medical knowledge graphs and clinical event graphs, detailing pipelines for entity extraction, normalization, graph cleaning, PSR scoring, graph embedding, and showcasing applications such as intelligent diagnosis, question answering, automated medical record generation, and clinical trial patient recruitment.

Big DataMedical Knowledge GraphNLP
0 likes · 21 min read
Building Real‑World Medical Knowledge Graphs and Clinical Event Graphs: Methods, Pipelines, and Applications
ELab Team
ELab Team
Sep 23, 2022 · Artificial Intelligence

Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes

This tutorial walks you through NLP fundamentals, the evolution of BERT, the concept of pre‑trained models, and a step‑by‑step guide to fine‑tune a Chinese BERT on a cloze‑style task, complete with code snippets and verification results.

BERTChineseCloze Task
0 likes · 13 min read
Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes
Zuoyebang Tech Team
Zuoyebang Tech Team
Sep 23, 2022 · Artificial Intelligence

How AI Powers K‑12 Education: Insights from a Chief Algorithm Expert

In this interview, the chief algorithm expert at Zuoyebang discusses how AI technologies such as NLP, speech recognition, large‑model pre‑training, and knowledge‑graph construction are applied to K‑12 education, covering practical challenges, deployment strategies, and future research directions.

Education TechnologyNLPai
0 likes · 27 min read
How AI Powers K‑12 Education: Insights from a Chief Algorithm Expert
DataFunTalk
DataFunTalk
Sep 21, 2022 · Artificial Intelligence

XiaoAi Intelligent QA: Information Extraction, Event Extraction, and Knowledge Graph Question Answering

This presentation details the XiaoAi intelligent assistant’s QA system, covering its application scenarios, information extraction techniques (including relation and event extraction with SPO/PSO models), graph‑based question answering methods, cross‑domain slot extraction, path retrieval, and practical Q&A insights.

Graph QANLPai
0 likes · 18 min read
XiaoAi Intelligent QA: Information Extraction, Event Extraction, and Knowledge Graph Question Answering
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 21, 2022 · Artificial Intelligence

Unlocking PEGASUS: How EasyNLP Simplifies Text Summarization with Pre‑Training

This article explains the importance of text generation, introduces the PEGASUS model’s gap‑sentence pre‑training for abstractive summarization, and shows how the EasyNLP framework integrates PEGASUS and other Chinese and English summarization models with step‑by‑step installation, data preparation, and training commands.

EasyNLPNLPPEGASUS
0 likes · 22 min read
Unlocking PEGASUS: How EasyNLP Simplifies Text Summarization with Pre‑Training
DataFunTalk
DataFunTalk
Sep 20, 2022 · Artificial Intelligence

Graph4NLP: An Open‑Source Graph Neural Network Library for Natural Language Processing

Graph4NLP is a PyTorch‑ and DGL‑based open‑source library that provides a full pipeline—from static and dynamic graph construction to embedding, learning, prediction, and inference—for applying graph neural networks to a wide range of NLP tasks, with extensive documentation, demos, and future scalability plans.

DGLGraph4NLPNLP
0 likes · 13 min read
Graph4NLP: An Open‑Source Graph Neural Network Library for Natural Language Processing
Zuoyebang Tech Team
Zuoyebang Tech Team
Sep 15, 2022 · Artificial Intelligence

How We Replaced BERT with a Lightweight TextCNN to Slash GPU Costs

This article describes the production challenges of using BERT for large‑scale text classification at Zuoyebang, explores lightweight alternatives such as knowledge distillation, pruning and quantization, and details a teacher‑student‑active‑learning pipeline that trains a TextCNN model to match BERT performance while dramatically reducing GPU consumption and improving throughput.

BERTModel DeploymentNLP
0 likes · 13 min read
How We Replaced BERT with a Lightweight TextCNN to Slash GPU Costs
DataFunSummit
DataFunSummit
Sep 8, 2022 · Artificial Intelligence

GAST: Graph Adaptive Semantic Transfer Model for Cross‑Domain Sentiment Analysis

This article introduces GAST, a graph‑adaptive semantic transfer framework that combines POS‑based Transformers and hybrid graph attention to improve cross‑domain sentiment analysis, presents related work, details the model architecture, reports extensive experiments showing state‑of‑the‑art results, and discusses future directions.

GAST modelNLPPOS tagging
0 likes · 13 min read
GAST: Graph Adaptive Semantic Transfer Model for Cross‑Domain Sentiment Analysis
HomeTech
HomeTech
Sep 8, 2022 · Artificial Intelligence

Concept Tag Mining for Recommendation Systems: Methods, Challenges, and Solutions

This article presents a comprehensive overview of concept tag mining for recommendation systems, describing unsupervised pattern‑matching, semi‑supervised AutoPhase, and supervised NER approaches, analyzing their advantages and drawbacks, and offering practical solutions to tag duplication and quality issues.

NERNLPSemi-supervised Learning
0 likes · 11 min read
Concept Tag Mining for Recommendation Systems: Methods, Challenges, and Solutions
Youzan Coder
Youzan Coder
Sep 5, 2022 · Artificial Intelligence

Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact

This article explains the role of Youzan’s Query Parser (QP) in search, walks through its overall and layered architecture, details each algorithmic plugin—from preprocessing to synonym handling—and shows concrete code examples and results that improve search relevance across multiple retail scenarios.

NLPSystem ArchitectureYouzan
0 likes · 12 min read
Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact
Programmer DD
Programmer DD
Aug 30, 2022 · Artificial Intelligence

How to Build a Custom HanLP Analyzer Plugin for Elasticsearch with Nginx

This guide walks through setting up a Java GraalVM 17 environment, installing Nginx to serve static dictionary files, configuring a HanLP‑based Elasticsearch analyzer plugin, packaging and deploying it, and testing the analyzer with JUnit5 and curl commands.

ElasticsearchHanLPNLP
0 likes · 14 min read
How to Build a Custom HanLP Analyzer Plugin for Elasticsearch with Nginx
DataFunSummit
DataFunSummit
Jul 27, 2022 · Artificial Intelligence

DataFun 2022 Natural Language Processing Summit – Leading Experts Discuss Large‑Scale Language Models, Multimodal Understanding, Dialogue Systems and AI Applications

The DataFun 2022 NLP Summit, held on July 30, brings together top researchers and industry leaders from Alibaba, Baidu, Microsoft, Amazon, and more to present the latest advances in large‑scale pre‑training, multimodal perception, information extraction, dialogue interaction, machine translation, and practical AI deployments, with live streaming and free registration via QR code.

Dialogue SystemsInformation ExtractionLarge Language Models
0 likes · 44 min read
DataFun 2022 Natural Language Processing Summit – Leading Experts Discuss Large‑Scale Language Models, Multimodal Understanding, Dialogue Systems and AI Applications
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Jul 19, 2022 · Artificial Intelligence

How NER Dominated NLPCC 2022: Techniques Behind the Winning Model

This article reviews the recent NLPCC 2022 NER competition, explains the evolution of named entity recognition, details the five major modeling paradigms, and describes the winning team’s relation‑classification approach, data‑augmentation strategy, experimental results, and its practical deployment in NetEase Cloud Commerce services.

Artificial IntelligenceDeep LearningNLP
0 likes · 13 min read
How NER Dominated NLPCC 2022: Techniques Behind the Winning Model
Bitu Technology
Bitu Technology
Jul 8, 2022 · Artificial Intelligence

Applying NLP and Machine Learning to Classify Tubi User Feedback

This article explains how Tubi leverages natural‑language processing, sentence embeddings (USE and BERT), and LightGBM models to automatically categorize large volumes of Net Promoter Score comments and customer‑support tickets, enabling data‑driven product decisions and workflow automation.

LightGBMNLPTubi
0 likes · 11 min read
Applying NLP and Machine Learning to Classify Tubi User Feedback
DataFunTalk
DataFunTalk
Jul 8, 2022 · Artificial Intelligence

Civil Aviation QA Competition (CCL2022‑DQAB): Task Description, Data, Evaluation Metrics, and Prizes

The CCL2022‑DQAB competition, organized by Beihang University and AVIC Mobile Technology, invites participants to develop reading‑comprehension models for extracting accurate question‑answer pairs from civil aviation texts, offering detailed task definitions, evaluation criteria, dataset statistics, a prize structure, and a competition schedule.

Civil AviationDatasetEvaluation Metrics
0 likes · 5 min read
Civil Aviation QA Competition (CCL2022‑DQAB): Task Description, Data, Evaluation Metrics, and Prizes
AntTech
AntTech
Jul 7, 2022 · Artificial Intelligence

Ant Group Insurance Technology Wins First Place in Fine‑Grained Dialogue Social Bias Detection at NLPCC 2023

Ant Group's insurance technology team secured the top spot in the fine‑grained dialogue social bias detection task at the 11th CCF NLPCC conference, showcasing their AI‑driven bias‑mitigation methods, a proprietary pre‑trained model AntInsBert, and a claim‑automation system that boosts insurance service fairness and efficiency.

AntInsBertBias DetectionInsurance AI
0 likes · 3 min read
Ant Group Insurance Technology Wins First Place in Fine‑Grained Dialogue Social Bias Detection at NLPCC 2023
政采云技术
政采云技术
Jul 5, 2022 · Artificial Intelligence

Overview of Natural Language Processing Techniques and Their Evolution

This article provides a comprehensive overview of natural language processing, covering its definition, historical development from one‑hot encoding to modern models such as word2vec, ELMo, GPT, and BERT, and discusses the advantages, limitations, and key concepts of each technique.

Artificial IntelligenceNLPWord Embedding
0 likes · 23 min read
Overview of Natural Language Processing Techniques and Their Evolution
Airbnb Technology Team
Airbnb Technology Team
Jul 4, 2022 · Artificial Intelligence

Intelligent Customer Service Product: Overview, History, Architecture, and Future Trends

The article outlines the evolution, architecture, and core value of intelligent customer service systems—detailing their GUI‑based chatbot interface, triage and dialogue modes, knowledge‑base management, and operator benefits—while highlighting future trends such as richer human‑like interactions, 5G‑enabled channels, and continuous feedback‑driven improvement.

ChatbotIntelligent Customer ServiceNLP
0 likes · 12 min read
Intelligent Customer Service Product: Overview, History, Architecture, and Future Trends
DataFunTalk
DataFunTalk
Jun 30, 2022 · Artificial Intelligence

OBERT: A Billion‑Parameter Pretrained Language Model for Large‑Scale NLP Applications

The OPPO XiaoBu team introduced OBERT, a series of 100M‑, 300M‑, and 1B‑parameter pretrained language models that leverage massive TB‑scale corpora, multi‑granular masking, retrieval‑augmented training, and distributed acceleration to achieve state‑of‑the‑art results on CLUE and KgCLUE benchmarks while enabling efficient industrial deployment.

Fine-tuningKnowledge augmentationNLP
0 likes · 12 min read
OBERT: A Billion‑Parameter Pretrained Language Model for Large‑Scale NLP Applications
DataFunSummit
DataFunSummit
Jun 26, 2022 · Artificial Intelligence

Applying Knowledge Graphs to Recruitment: Construction, Tag Mining, and Recommendation at 58.com

58.com’s NLP senior engineer explains how a recruitment knowledge graph is built—through multi‑dimensional tag systems, tag mining, and relation extraction—and how it enhances bidirectional matching and recommendation efficiency, addressing challenges such as weak expression, cold start, and supply‑demand imbalance.

NLPaidata augmentation
0 likes · 17 min read
Applying Knowledge Graphs to Recruitment: Construction, Tag Mining, and Recommendation at 58.com
AntTech
AntTech
Jun 21, 2022 · Artificial Intelligence

FinQA Competition Winning Model by Ant Risk AI: Architecture, Dataset, and Experimental Results

Ant Risk AI’s team secured the FinQA competition champion by presenting a comprehensive model that combines a retriever and program generator, detailed dataset analysis, domain-specific language design, and extensive experiments demonstrating superior execution and program accuracy on financial numerical reasoning tasks.

Dataset AnalysisFinQANLP
0 likes · 16 min read
FinQA Competition Winning Model by Ant Risk AI: Architecture, Dataset, and Experimental Results
JD Retail Technology
JD Retail Technology
Jun 16, 2022 · Artificial Intelligence

2022 Global AI Technology Innovation Competition – Algorithm Challenge: Connecting AI with E‑commerce

The 2022 Global AI Technology Innovation Competition – Algorithm Challenge, co‑hosted by JD Retail and academic partners, brought together 12 finalist teams from over 3,000 entrants to tackle e‑commerce‑focused AI problems such as multimodal image‑text matching and product‑title entity recognition, highlighting real‑world business impact and fostering talent exchange.

AI competitionJD RetailMultimodal
0 likes · 8 min read
2022 Global AI Technology Innovation Competition – Algorithm Challenge: Connecting AI with E‑commerce
Ctrip Technology
Ctrip Technology
Jun 16, 2022 · Artificial Intelligence

Entity Linking System for Travel Knowledge Graph at Ctrip AI R&D

The article presents Ctrip's travel AI team's end‑to‑end entity linking solution built on a large‑scale tourism knowledge graph, detailing its background, technical architecture, core modules—including mention detection, candidate generation, and disambiguation using BERT and prefix‑tree techniques—and real‑world applications such as search, intelligent客服, and POI data maintenance.

BERTNLPentity linking
0 likes · 18 min read
Entity Linking System for Travel Knowledge Graph at Ctrip AI R&D
Baidu Geek Talk
Baidu Geek Talk
Jun 15, 2022 · Artificial Intelligence

CCL2022 Video Highlight Extraction Challenge Overview

The article describes the CCL2022 Video Highlight Extraction Challenge, a competition at the 21st China Conference on Computational Linguistics organized by Baidu, inviting participants worldwide to generate timestamped concise summaries of video segments, with registration details, eligibility, task description, example inputs/outputs, and evaluation metrics based on timing accuracy and ROUGE-L.

CCL2022Evaluation MetricsNLP
0 likes · 6 min read
CCL2022 Video Highlight Extraction Challenge Overview
DataFunTalk
DataFunTalk
Jun 13, 2022 · Artificial Intelligence

JD Technology Financial Causal Knowledge Graph: Construction, Causal Extraction, and Alignment Techniques

This article presents JD Technology's recent research on financial causal knowledge graphs, detailing the overall knowledge‑graph architecture, data layers, causal relation extraction, argument extraction, and graph‑alignment methods, and discusses their applications in finance, intelligent research reports, and industry‑leader recommendation.

Financial AIGraph AlignmentNLP
0 likes · 18 min read
JD Technology Financial Causal Knowledge Graph: Construction, Causal Extraction, and Alignment Techniques
Meituan Technology Team
Meituan Technology Team
Jun 9, 2022 · Artificial Intelligence

FSL++: A Few-Shot Learning Model for Chinese Language Understanding that Tops the FewCLUE Benchmark

FSL++—a RoBERTa‑large‑based few‑shot model enhanced with domain‑adaptive pre‑training, prompt learning, diverse embedding‑level augmentations, and ensemble self‑training—topped the Chinese FewCLUE benchmark, beating human accuracy on news and scientific classification tasks and delivering measurable gains across multiple Meituan product scenarios.

Chinese language understandingFew‑Shot LearningNLP
0 likes · 23 min read
FSL++: A Few-Shot Learning Model for Chinese Language Understanding that Tops the FewCLUE Benchmark
Alimama Tech
Alimama Tech
Jun 8, 2022 · Artificial Intelligence

CTR-Driven Advertising Text Generation and Bundle Creative Optimization (CREATER & CONNA)

Alibaba’s advertising team introduces CREATER, a CTR‑driven text generator that leverages user reviews, aspect control codes, and contrastive fine‑tuning, and CONNA, a non‑autoregressive bundle creator that predicts heterogeneous ad elements with set‑based loss, both delivering substantial online CTR gains and CPC reductions through dynamic creative optimization.

CTRDynamic creative optimizationNLP
0 likes · 25 min read
CTR-Driven Advertising Text Generation and Bundle Creative Optimization (CREATER & CONNA)
Python Programming Learning Circle
Python Programming Learning Circle
Jun 8, 2022 · Artificial Intelligence

Leveraging PaddleNLP UIE for Zero‑Shot Logistic Parcel Information Extraction

This article explains how PaddleNLP's Universal Information Extraction (UIE) model can dramatically reduce labeling effort and improve accuracy for logistics parcel data extraction, showcasing a five‑sample experiment that boosts F1 by 18 points to 93% and providing a zero‑shot Python example.

Information ExtractionLogisticsNLP
0 likes · 5 min read
Leveraging PaddleNLP UIE for Zero‑Shot Logistic Parcel Information Extraction
Meituan Technology Team
Meituan Technology Team
May 26, 2022 · Information Security

Building and Deploying Software Composition Analysis (SCA) for Enterprise Security

The article analyzes the rising threat of open‑source components, explains Software Composition Analysis (SCA) and SBOM generation, outlines the three‑stage process for building an in‑house SCA capability, discusses practical challenges such as data quality and integration, and looks ahead to future standards and open‑source tools.

DevSecOpsNLPSBOM
0 likes · 37 min read
Building and Deploying Software Composition Analysis (SCA) for Enterprise Security
Tencent Cloud Developer
Tencent Cloud Developer
May 19, 2022 · Industry Insights

What Does the Future Hold for AI? Insights from Industry Leaders

In a TVP forum hosted by Li Kaifu and Shen Chunhua, experts trace AI’s 70‑year journey, discuss the origins of the book “AI Future in Progress,” analyze investment stages, AI‑cloud synergy, NLP breakthroughs, medical applications, societal impacts, data privacy, and the challenges facing traditional enterprises.

AI cloudHealthcare AINLP
0 likes · 23 min read
What Does the Future Hold for AI? Insights from Industry Leaders