Tagged articles
507 articles
Page 3 of 6
Code DAO
Code DAO
May 19, 2022 · Artificial Intelligence

Semi‑Supervised Training Methods for Transformers

This article explains an end‑to‑end semi‑supervised training pipeline for Transformer‑based NLP models, detailing the unsupervised language‑model pre‑training, supervised fine‑tuning, and the internal architecture of embeddings, encoder layers, and downstream tasks such as text classification and NER.

BERTFine-tuningMasked Language Model
0 likes · 9 min read
Semi‑Supervised Training Methods for Transformers
Sohu Tech Products
Sohu Tech Products
May 18, 2022 · Artificial Intelligence

Design and Implementation of the Internal Intelligent QA Chatbot “Jarvis”

This article describes the motivation, micro‑service architecture, code implementation, V1.0 browser‑based NLP prototype, V2.0 AI‑enhanced version with BM25 and BERT, integration with ChatUI, DingTalk bot, command‑based automation, and future plans for the internal intelligent QA chatbot named Jarvis.

AIChatbotMicroservices
0 likes · 18 min read
Design and Implementation of the Internal Intelligent QA Chatbot “Jarvis”
DataFunTalk
DataFunTalk
May 7, 2022 · Artificial Intelligence

Intelligent Recommendation Selling Point Generation: Architecture, Core AI Techniques, Model Development, and Product Impact

This article explains how JD's intelligent recommendation selling point system leverages NLP, BERT, Transformer and pointer‑generator models to automatically create short, personalized product highlights, describing the technical background, system architecture, model training pipeline, online/offline monitoring, and the resulting business benefits.

BERTNLPRecommendation Systems
0 likes · 13 min read
Intelligent Recommendation Selling Point Generation: Architecture, Core AI Techniques, Model Development, and Product Impact
DataFunTalk
DataFunTalk
May 5, 2022 · Artificial Intelligence

NLP Evolution: Symbolic Deep Parsing vs Neural Pre‑trained Models, Low‑Code Trends, and Semi‑Automated Applications

The article reviews the history and current state of NLP, compares symbolic deep‑parsing and neural pre‑trained approaches, discusses the knowledge‑bottleneck and low‑code trend, and illustrates semi‑automated, low‑code NLP deployment in the financial domain while pondering future integration of symbolic and neural methods.

Knowledge EngineeringNLPSemi-Automated
0 likes · 23 min read
NLP Evolution: Symbolic Deep Parsing vs Neural Pre‑trained Models, Low‑Code Trends, and Semi‑Automated Applications
DataFunTalk
DataFunTalk
May 1, 2022 · Artificial Intelligence

Graph Deep Learning for Natural Language Processing: Methods, Models, and the Graph4NLP Library

This talk introduces graph deep learning techniques for natural language processing, covering the motivation for graph representations, traditional graph-based NLP methods, fundamentals of graph neural networks, static and dynamic graph construction, representation learning, and showcases the open‑source Graph4NLP Python library with example applications.

Graph RepresentationGraph4NLPNLP
0 likes · 16 min read
Graph Deep Learning for Natural Language Processing: Methods, Models, and the Graph4NLP Library
Tencent Tech
Tencent Tech
Apr 29, 2022 · Artificial Intelligence

Tencent’s Hunyuan AI Model Tops CLUE Leaderboard with Record Score

Tencent’s Hunyuan AI large model shattered records by scoring 80.888 to claim first place on the CLUE benchmark, showcasing its advanced natural language processing, multimodal abilities, curriculum‑learning training approach, and real‑world deployments in WeChat Search and advertising.

AICLUEHunyuan
0 likes · 3 min read
Tencent’s Hunyuan AI Model Tops CLUE Leaderboard with Record Score
Zuoyebang Tech Team
Zuoyebang Tech Team
Apr 15, 2022 · Artificial Intelligence

Zuoyebang’s NLP Platforms: Boosting Online Education with AI

In this interview, Zuoyebang’s NLP lead explains how the company built self‑developed platforms like IQC and FTP to automate text quality inspection and intelligent labeling, outlines their architecture, shares practical deep‑learning applications such as translation and grammar correction, and discusses future research directions in large‑scale multi‑label classification, few‑shot learning, and multimodal models.

AI PlatformsNLPmachine learning
0 likes · 11 min read
Zuoyebang’s NLP Platforms: Boosting Online Education with AI
TAL Education Technology
TAL Education Technology
Apr 14, 2022 · Artificial Intelligence

Intelligent Call Recording Quality Inspection Using Dual‑Mode Detection

This article proposes a dual‑mode detection solution for call‑recording quality inspection that combines rule‑based semantic similarity matching with BERT‑based sentence segmentation and RoBERTa multi‑label classification to achieve high accuracy, fast task adaptation, and strong generalization for customer‑service scenarios.

BERTNLPRoBERTa
0 likes · 7 min read
Intelligent Call Recording Quality Inspection Using Dual‑Mode Detection
政采云技术
政采云技术
Apr 12, 2022 · Artificial Intelligence

Design and Implementation of the Internal Intelligent QA Chatbot “Jarvis”

This article describes the end‑to‑end design, architecture, code implementation, and deployment steps for an internal intelligent QA chatbot named “Jarvis”, covering its V1.0 browser‑based prototype, V2.0 AI‑enhanced version, DingTalk integration, automation features, and future roadmap.

AIBackendChatbot
0 likes · 19 min read
Design and Implementation of the Internal Intelligent QA Chatbot “Jarvis”
DaTaobao Tech
DaTaobao Tech
Apr 12, 2022 · Artificial Intelligence

ArcCSE: Angular Margin Contrastive Learning for Self‑Supervised Text Representation

ArcCSE introduces an angular‑margin contrastive loss and both pairwise (dropout‑augmented) and triple‑wise (span‑masked) relationship modeling to self‑supervise text embeddings, yielding tighter decision boundaries, higher alignment and uniformity, and superior performance on unsupervised STS, SentEval, and Alibaba’s retrieval and recommendation systems.

NLPangular margincontrastive learning
0 likes · 8 min read
ArcCSE: Angular Margin Contrastive Learning for Self‑Supervised Text Representation
Code DAO
Code DAO
Apr 10, 2022 · Artificial Intelligence

A Comprehensive Overview of Relation Extraction Techniques

This article surveys relation extraction, defining the task, categorizing its five main forms, and detailing key approaches such as entity position encoding, dependency‑tree methods like shortest dependency path and BRCNN, as well as distant supervision with multi‑instance learning and selective attention.

NLPdependency parsingdistant supervision
0 likes · 12 min read
A Comprehensive Overview of Relation Extraction Techniques
Xueersi Online School Tech Team
Xueersi Online School Tech Team
Apr 2, 2022 · Artificial Intelligence

Design and Implementation of the Xiaosi Intelligent Customer Service Bot for Internal IM

This article details the design, architecture, key technologies, and deployment of Xiaosi, an AI‑powered intelligent customer service chatbot built on the internal IM platform, highlighting its problem‑diagnosis accuracy, manpower savings, adapter mechanism, high‑availability backend, and practical use cases.

AI chatbotBackendDjango
0 likes · 13 min read
Design and Implementation of the Xiaosi Intelligent Customer Service Bot for Internal IM
Youku Technology
Youku Technology
Apr 2, 2022 · Artificial Intelligence

Constrained Sequence-to-Tree Generation for Hierarchical Text Classification

At SIGIR 2022, the authors present a constrained Seq2Tree model that transforms hierarchical label taxonomies into preorder sequences and applies dynamic‑dictionary decoding to ensure label consistency, achieving superior hierarchical text classification performance on benchmark datasets and real‑world deployment within Alibaba Entertainment’s AI Brain.

Artificial IntelligenceEncoder-DecoderHierarchical Text Classification
0 likes · 5 min read
Constrained Sequence-to-Tree Generation for Hierarchical Text Classification
Yuewen Technology
Yuewen Technology
Apr 1, 2022 · Artificial Intelligence

Detecting Emerging Terms in Web Novels: PMI, Entropy, and TF‑IDF Methods

This article explores how to automatically discover new words in Chinese web novels by combining n‑gram statistics, pointwise mutual information, information entropy, and TF‑IDF filtering, presenting a practical, unsupervised pipeline that improves tokenization and search recall without manual labeling.

Chinese text miningNLPPMI
0 likes · 14 min read
Detecting Emerging Terms in Web Novels: PMI, Entropy, and TF‑IDF Methods
DaTaobao Tech
DaTaobao Tech
Mar 31, 2022 · Artificial Intelligence

Intelligent Copy Generation for Taobao Push: Design, Implementation, and Evaluation

The 2021 Taobao Push project introduced an AI‑driven copy‑generation platform that combines template extraction and fine‑tuned Unilm models with diverse beam search, creating diverse, high‑quality push messages, cutting manual costs, and delivering a 10 % click‑through lift and higher material adoption.

AICopy GenerationNLP
0 likes · 18 min read
Intelligent Copy Generation for Taobao Push: Design, Implementation, and Evaluation
JD Cloud Developers
JD Cloud Developers
Mar 21, 2022 · Artificial Intelligence

How JD’s AI Generates Multimodal Product Summaries to Boost E‑Commerce

The article explains how rapid internet growth created information overload, leading to concise summary services, and how recent AI advances—especially large language models like GPT‑3—enable platforms such as JD.com to automatically generate high‑quality, multimodal product copy that drives sales and supports diverse creative tasks.

AINLPText Generation
0 likes · 4 min read
How JD’s AI Generates Multimodal Product Summaries to Boost E‑Commerce
Meituan Technology Team
Meituan Technology Team
Mar 17, 2022 · Artificial Intelligence

Tsinghua University & Meituan Digital Life Joint Research Institute Academic Salon: Large Model Technologies and Challenges

The Tsinghua‑Meituan Digital Life Joint Research Institute’s Academic Salon on March 23 featured Associate Professor Liu Zhiyuan presenting the latest advances and ten key challenges in large‑model technologies, aiming to foster industry‑academia collaboration and drive innovation in representation learning, knowledge graphs, and social computing.

Academic SeminarArtificial IntelligenceLarge Language Models
0 likes · 4 min read
Tsinghua University & Meituan Digital Life Joint Research Institute Academic Salon: Large Model Technologies and Challenges
DataFunTalk
DataFunTalk
Mar 17, 2022 · Artificial Intelligence

A Survey of Text Classification and Intent Recognition: Industrial and Research Perspectives

This article reviews recent developments in text classification and intent recognition, comparing industrial practices such as business‑coupled feature engineering with research trends like pretrained language models, and provides references and practical insights for building effective NLP solutions.

NLPindustry applicationsintent recognition
0 likes · 13 min read
A Survey of Text Classification and Intent Recognition: Industrial and Research Perspectives
ELab Team
ELab Team
Mar 16, 2022 · Artificial Intelligence

Reverse Dictionary Made Easy: Harness WantWords and Hugging Face for Quick NLP Model Integration

This article introduces the open‑source WantWords reverse‑dictionary project, explains its token‑based processing pipeline, walks through Python implementation and model invocation with Hugging Face’s Transformers, reviews NLP’s historical evolution, and shows how front‑end developers can quickly integrate NLP models into products.

Artificial IntelligenceBERTHugging Face
0 likes · 13 min read
Reverse Dictionary Made Easy: Harness WantWords and Hugging Face for Quick NLP Model Integration
Baobao Algorithm Notes
Baobao Algorithm Notes
Feb 10, 2022 · Artificial Intelligence

Winning Kaggle’s Jigsaw Toxicity Challenge with Transfer Learning and Zero‑Shot Classification

This article breaks down the evolution of Kaggle’s Jigsaw toxic comment competitions and presents a three‑step solution—training on historic data, using a genetic algorithm to weight multi‑label predictions, and ensembling fifteen models—to achieve high‑accuracy zero‑shot text classification.

Artificial IntelligenceKaggleNLP
0 likes · 5 min read
Winning Kaggle’s Jigsaw Toxicity Challenge with Transfer Learning and Zero‑Shot Classification
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 28, 2022 · Artificial Intelligence

How Pre‑Training Evolved: From word2vec to MAE Across NLP and CV

This article traces the history of deep‑learning pre‑training techniques, comparing the parallel developments in natural‑language processing and computer vision—from early word2vec and bag‑of‑words models through ELMo and BERT to recent transformer‑based vision models like iGPT, ViT, BEiT and MAE—highlighting key innovations, challenges, and the convergence of the two fields.

Deep LearningMAENLP
0 likes · 20 min read
How Pre‑Training Evolved: From word2vec to MAE Across NLP and CV
DataFunTalk
DataFunTalk
Jan 26, 2022 · Artificial Intelligence

Exploring and Practicing Generative Chat in OPPO's XiaoBu Assistant

This article presents a comprehensive overview of OPPO's XiaoBu Assistant, detailing its research background, chat skill architecture, evolution from retrieval and rule‑based methods to generative models, industry model comparisons, decoding and ranking strategies, safety mechanisms, performance optimizations, and evaluation results.

ChatbotDialogue SystemsModel Optimization
0 likes · 20 min read
Exploring and Practicing Generative Chat in OPPO's XiaoBu Assistant
Code DAO
Code DAO
Jan 15, 2022 · Artificial Intelligence

Compressing Unsupervised fastText Models 300× Smaller with Near‑Identical NLP Performance

This article shows how the compress‑fasttext Python library can shrink a 7 GB fastText word‑embedding model to about 21 MB—a 300‑fold reduction—while preserving almost the same accuracy on downstream NLP tasks, and explains the underlying compression techniques, usage examples, and evaluation results.

NLPcompress-fasttextfastText
0 likes · 9 min read
Compressing Unsupervised fastText Models 300× Smaller with Near‑Identical NLP Performance
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 14, 2022 · Artificial Intelligence

Boosting BERT Text Classification with Label Embedding: How It Works

The paper proposes a simple yet effective method that fuses label embeddings into BERT, enhancing text‑classification performance without increasing computational cost, and validates the approach across six benchmark datasets, also exploring tf‑idf‑based label augmentation and the impact of using [SEP] versus no‑[SEP] inputs.

BERTDeep LearningNLP
0 likes · 8 min read
Boosting BERT Text Classification with Label Embedding: How It Works
Beike Product & Technology
Beike Product & Technology
Jan 7, 2022 · Artificial Intelligence

Beike Real Estate NLP Team Wins First Place in CCIR Cup 2021 Intelligent Human‑Computer Interaction Track

The Beike Real Estate NLP team secured first place in the CCIR Cup 2021 Intelligent Human‑Computer Interaction track by applying semi‑supervised and transfer learning techniques to small‑sample intent recognition and slot filling, and also presented the large‑scale Mandarin dialect speech benchmark KeSpeech at NeurIPS 2021.

AI competitionBERTNLP
0 likes · 5 min read
Beike Real Estate NLP Team Wins First Place in CCIR Cup 2021 Intelligent Human‑Computer Interaction Track
JD Cloud Developers
JD Cloud Developers
Jan 4, 2022 · Artificial Intelligence

How JD’s Vega v1 Model Dominated GLUE Benchmark, Surpassing Human Performance

JD Explore’s Vega v1 model topped the GLUE benchmark with a 91.3 average score, outperforming Microsoft, Facebook, and Stanford across multiple NLP tasks, including first‑ever human‑level results on sentiment analysis and coreference, showcasing JD’s leading position in deep‑learning research.

AI researchDeep LearningGLUE benchmark
0 likes · 3 min read
How JD’s Vega v1 Model Dominated GLUE Benchmark, Surpassing Human Performance
Ctrip Technology
Ctrip Technology
Dec 30, 2021 · Artificial Intelligence

Semantic Matching Techniques for Intelligent Customer Service at Ctrip

This article presents Ctrip's intelligent customer service system, detailing the evolution of semantic matching methods from traditional lexical models to deep learning approaches such as BERT and ESIM, and describing multi‑stage retrieval, multilingual transfer learning, and KBQA techniques for improving query understanding and response accuracy.

BERTNLPcustomer-service
0 likes · 16 min read
Semantic Matching Techniques for Intelligent Customer Service at Ctrip
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 23, 2021 · Artificial Intelligence

How Pre‑Training Evolved: From word2vec to MAE Across NLP & Vision

This article traces the evolution of deep‑learning pre‑training techniques, starting with word2vec in NLP, moving through ELMo and BERT, then shifting to computer‑vision models such as iGPT, ViT, BEiT, and MAE, highlighting key innovations, challenges, and the convergence of NLP and CV paradigms.

BERTMAENLP
0 likes · 21 min read
How Pre‑Training Evolved: From word2vec to MAE Across NLP & Vision
DataFunSummit
DataFunSummit
Dec 21, 2021 · Artificial Intelligence

Large‑Scale Pretrained Model Compression and Distillation: AdaBERT, L2A, and Meta‑KD

This talk presents Alibaba DAMO Academy’s recent work on compressing large pretrained language models, covering task‑adaptive AdaBERT, data‑augmented L2A, and meta‑knowledge distillation Meta‑KD, describing their motivations, architectures, NAS‑based search, loss designs, and experimental results across multiple NLP tasks.

NLPNeural Architecture Searchknowledge distillation
0 likes · 13 min read
Large‑Scale Pretrained Model Compression and Distillation: AdaBERT, L2A, and Meta‑KD
Python Programming Learning Circle
Python Programming Learning Circle
Dec 16, 2021 · Artificial Intelligence

Part-of-Speech Tagging with Jieba in Python

This article explains how to perform Chinese part-of-speech tagging using the jieba.posseg library in Python, including loading stop words, extracting article content via Newspaper3k, applying precise mode segmentation, filtering, and presenting results in a pandas DataFrame.

NLPPOS taggingPython
0 likes · 3 min read
Part-of-Speech Tagging with Jieba in Python
Code DAO
Code DAO
Dec 12, 2021 · Artificial Intelligence

How to Boost Text Analysis Accuracy on a 2‑Billion‑Word Corpus

This article explains practical techniques for improving NLP model accuracy on massive corpora, covering challenges of multi‑field text, word‑embedding choices, a fasttext‑based regression demo with book‑review data, feature engineering tricks, and a comparison with tf‑idf + LASSO.

NLPPythonWord2Vec
0 likes · 13 min read
How to Boost Text Analysis Accuracy on a 2‑Billion‑Word Corpus
Laiye Technology Team
Laiye Technology Team
Dec 10, 2021 · Artificial Intelligence

Best Practices for Building an Entity‑Relationship Annotation Tool at Laiye AI R&D Center

This article details Laiye Technology’s AI R&D team’s end‑to‑end approach to designing and optimizing a custom entity‑relationship annotation tool, covering data‑labeling challenges, shortcomings of Excel and off‑the‑shelf solutions, architectural requirements, line‑breaking and mark‑position algorithms, performance improvements, and real‑world results.

JavaScriptNLPdata annotation
0 likes · 12 min read
Best Practices for Building an Entity‑Relationship Annotation Tool at Laiye AI R&D Center
Meituan Technology Team
Meituan Technology Team
Dec 9, 2021 · Artificial Intelligence

Fine-Grained Aspect-Based Sentiment Analysis for Meituan's To‑Restaurant Business

To enhance decision‑making for users and quality monitoring for merchants, Meituan’s to‑restaurant platform implements fine‑grained aspect‑based sentiment analysis that extracts dish, attribute, opinion and polarity tuples from reviews, employing both a BERT‑CRF pipeline and a joint Dual‑MRC model which raise F1 scores from 0.61 to 0.68, and are deployed in dashboards and review‑management tools, with future work targeting efficiency and broader four‑tuple extraction.

ABSABERTNLP
0 likes · 28 min read
Fine-Grained Aspect-Based Sentiment Analysis for Meituan's To‑Restaurant Business
Code DAO
Code DAO
Dec 7, 2021 · Artificial Intelligence

How to Cluster Text with TF‑IDF, KMeans and PCA in Python

This article walks through a complete Python workflow that loads the 20 Newsgroups dataset, preprocesses the documents, vectorizes them with TF‑IDF, groups them using KMeans, reduces dimensions with PCA, and visualizes the resulting clusters, illustrating each step with code and plots.

KMeansNLPPCA
0 likes · 13 min read
How to Cluster Text with TF‑IDF, KMeans and PCA in Python
Alibaba Cloud Native
Alibaba Cloud Native
Dec 7, 2021 · Operations

How Information Entropy Powers AI‑Driven Alert Noise Reduction in Cloud‑Native Operations

This article explains how Shannon's information entropy and NLP are combined in Alibaba Cloud's ARMS intelligent noise reduction to quantify alert uncertainty, filter redundant notifications, and automatically prioritize critical incidents, offering a practical, self‑learning solution for modern monitoring environments.

Alert Noise ReductionNLPinformation entropy
0 likes · 11 min read
How Information Entropy Powers AI‑Driven Alert Noise Reduction in Cloud‑Native Operations
DataFunTalk
DataFunTalk
Nov 29, 2021 · Artificial Intelligence

Text Mining for User Research: Architecture, Labeling, and Application Cases at JD.com

The presentation explains how JD.com leverages large‑scale text mining and NLP techniques—including data cleaning, multi‑level labeling, sentiment classification with models such as TextCNN, RoBERTa, and USE—to transform unstructured customer feedback into actionable product insights across various e‑commerce scenarios.

AINLPSentiment Analysis
0 likes · 18 min read
Text Mining for User Research: Architecture, Labeling, and Application Cases at JD.com
DataFunTalk
DataFunTalk
Nov 26, 2021 · Artificial Intelligence

Solving Model Prediction Errors: A Comprehensive Bad‑Case Treatment Methodology

This article presents a step‑by‑step methodology for diagnosing and fixing model prediction errors—especially bad cases—in NLP and search systems, covering sample bias, threshold selection, preprocessing, post‑processing, validation cycles, and guidance on when to replace the model.

NLPPostProcessingPreprocessing
0 likes · 11 min read
Solving Model Prediction Errors: A Comprehensive Bad‑Case Treatment Methodology
Programmer DD
Programmer DD
Nov 26, 2021 · Artificial Intelligence

Leverage DDParser for COVID‑19 Vaccine Data Extraction and Open‑Source Tools

This article introduces several pandemic‑related open‑source resources—including a nationwide COVID‑19 vaccine record lookup, a tracker for ineffective vaccine distribution, and Baidu's DDParser NLP tool—detailing their purpose, usage, and installation to help developers build better vaccine‑related applications.

COVID-19DDParserNLP
0 likes · 5 min read
Leverage DDParser for COVID‑19 Vaccine Data Extraction and Open‑Source Tools
Baidu Geek Talk
Baidu Geek Talk
Nov 17, 2021 · Artificial Intelligence

Fast Video Editing: Architecture and AI‑Powered Subtitle & Redundant Segment Detection

Baidu’s Fast Editing tool automates video trimming by using NLP to recognize subtitles, tone markers and duplicate sentences, then aligns them with the timeline for one‑click removal, employing character, Levenshtein and cosine similarity algorithms within a three‑module architecture (Plugin, Window, Caption) and planning on‑device PaddlePaddle analysis to cut latency and cost.

AINLPSwift
0 likes · 11 min read
Fast Video Editing: Architecture and AI‑Powered Subtitle & Redundant Segment Detection
Kuaishou Tech
Kuaishou Tech
Nov 16, 2021 · Artificial Intelligence

KuaiSearch's PERKS Pre‑trained Language Model Sets New Record on the CLUE Benchmark

The KuaiSearch research team introduced PERKS, a large‑scale Chinese pre‑trained language model that achieved an 80.618 score on the CLUE 1.1 language classification task, narrowing the gap to human annotation and demonstrating significant advances in multi‑stage training, model optimization, and real‑world search applications.

CLUE benchmarkKuaiSearchNLP
0 likes · 7 min read
KuaiSearch's PERKS Pre‑trained Language Model Sets New Record on the CLUE Benchmark
DataFunSummit
DataFunSummit
Nov 14, 2021 · Artificial Intelligence

Overview of Pre‑training Models and the UER‑py Framework for Natural Language Processing

This article introduces the importance of pre‑training in natural language processing, reviews classic pre‑training models such as Skip‑thoughts, BERT, GPT‑2 and T5, presents the modular UER‑py framework and its Chinese resources, compares it with Huggingface Transformers, and outlines practical deployment steps in industry.

NLPUER-pylanguage models
0 likes · 22 min read
Overview of Pre‑training Models and the UER‑py Framework for Natural Language Processing
DataFunTalk
DataFunTalk
Nov 12, 2021 · Artificial Intelligence

Xiaomi Xiao AI Intelligent Question‑Answering System: Architecture, Techniques, and Applications

This article presents a comprehensive overview of Xiaomi's Xiao AI intelligent QA system, detailing its background, three core answering modules—knowledge‑graph QA, retrieval‑based FAQ, and reading‑comprehension—and the underlying methods such as template matching, cross‑domain semantic parsing, path‑based reasoning, semantic retrieval, and neural matching, while also discussing performance results and practical trade‑offs.

AINLPReading Comprehension
0 likes · 18 min read
Xiaomi Xiao AI Intelligent Question‑Answering System: Architecture, Techniques, and Applications
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Nov 11, 2021 · Artificial Intelligence

Transforming B2B Customer Service: Table QA via Multi‑Turn Dialogue

This article explores how table‑based question answering can be integrated into B2B intelligent customer service by converting table queries into entity‑attribute recognition and multi‑turn dialogue, comparing end‑to‑end NL2SQL and slot‑filling approaches, and presenting NetEase Qiyu's practical implementation with its benefits and use cases.

NL2SQLNLPattribute extraction
0 likes · 10 min read
Transforming B2B Customer Service: Table QA via Multi‑Turn Dialogue
Meituan Technology Team
Meituan Technology Team
Nov 4, 2021 · Artificial Intelligence

Knowledge-based Question Answering (KBQA) System at Meituan: Design, Challenges, and Solutions

Meituan’s knowledge‑based question answering system tackles diverse, constraint‑rich, multi‑hop queries across pre‑sale, in‑sale and post‑sale scenarios by integrating fine‑grained query understanding, relation recognition, sub‑graph retrieval and answer ranking, employing optimized BERT models, pre‑training tasks, and domain‑specific enhancements to boost response speed, conversion rates, and benchmark performance, while acknowledging remaining challenges in long‑tail and complex queries.

KBQAMeituanNLP
0 likes · 24 min read
Knowledge-based Question Answering (KBQA) System at Meituan: Design, Challenges, and Solutions
Meituan Technology Team
Meituan Technology Team
Oct 21, 2021 · Artificial Intelligence

Meituan's End-to-End Sentiment Analysis Technology and the ASAP Dataset

Meituan’s NLP Center introduced the ASAP dataset—the largest real‑world Chinese attribute‑level sentiment corpus—to date, and the article traces the progression from document‑level regression models upgraded with MT‑BERT, through multi‑task attribute‑level ABSA and opinion‑triplet extraction, to scalable real‑time and batch services, while outlining future transfer‑learning and few‑shot research.

MeituanNLPSentiment Analysis
0 likes · 25 min read
Meituan's End-to-End Sentiment Analysis Technology and the ASAP Dataset
DataFunTalk
DataFunTalk
Oct 20, 2021 · Artificial Intelligence

Building an Industry Chain Knowledge Graph: Theory, Architecture, and Key Methods

This article presents a comprehensive overview of constructing an industry‑chain knowledge graph for the financial sector, covering its theoretical background, architectural design, automated building pipeline, key NLP techniques, and practical applications such as visualization, IPO review, and investment analysis.

Industry ChainNLPfinancial technology
0 likes · 22 min read
Building an Industry Chain Knowledge Graph: Theory, Architecture, and Key Methods
Yuewen Technology
Yuewen Technology
Oct 15, 2021 · Artificial Intelligence

How Yuedu's TTS Platform Automates High‑Quality Audiobook Production

This article explains how Yuedu's TTS synthesis platform tackles the booming audiobook market by using AI‑driven text preprocessing, role graph construction, content structuring, emotion and effect recognition, and a streamlined post‑processing workflow to efficiently generate multi‑character, emotionally rich audio books at scale.

Audio SynthesisEmotion RecognitionNLP
0 likes · 13 min read
How Yuedu's TTS Platform Automates High‑Quality Audiobook Production
Amap Tech
Amap Tech
Oct 14, 2021 · Artificial Intelligence

CCF Big Data & Computing Intelligence Contest – POI Name Generation Challenge

The 9th CCF Big Data & Computing Intelligence Contest partners with Gaode Map to launch a POI Name Generation challenge, requiring participants to fuse image, signboard detection, and OCR data to automatically produce accurate, fluent place names, with a ¥50,000 prize pool, weekly vouchers, and recruitment opportunities for global teams.

AINLPPOI
0 likes · 7 min read
CCF Big Data & Computing Intelligence Contest – POI Name Generation Challenge
DataFunTalk
DataFunTalk
Oct 12, 2021 · Artificial Intelligence

Intelligent Grading: Technical Exploration and Practice in AI‑Powered Education

This presentation by Tencent senior researcher Li Chao outlines the background, typical challenges, and multi‑layer technical solutions for intelligent grading in education, covering AI‑driven classroom, homework, review, and exam scenarios, multimodal spell‑checking, essay evaluation, and adaptive learning pipelines.

AIEducation TechnologyEssay Evaluation
0 likes · 25 min read
Intelligent Grading: Technical Exploration and Practice in AI‑Powered Education
DataFunTalk
DataFunTalk
Oct 12, 2021 · Artificial Intelligence

PaddleNLP v2.1 Release: Taskflow One‑Click NLP, Few‑Shot Learning Enhancements, and 28× Text Generation Acceleration

PaddleNLP v2.1 introduces an industrial‑grade Taskflow for eight NLP scenarios, a three‑line few‑shot learning paradigm that boosts small‑sample performance, and a FasterTransformer‑based inference engine that delivers up to 28‑fold speedup for text generation, all backed by extensive model and algorithm integrations.

Few‑Shot LearningNLPPaddleNLP
0 likes · 7 min read
PaddleNLP v2.1 Release: Taskflow One‑Click NLP, Few‑Shot Learning Enhancements, and 28× Text Generation Acceleration
DataFunSummit
DataFunSummit
Oct 12, 2021 · Artificial Intelligence

Intelligent Grading: Technical Exploration and Practice in AI‑Powered Education

This article presents a comprehensive overview of AI‑driven intelligent grading technologies, covering background, typical educational challenges, multimodal NLP solutions for essay, spelling and grammar correction, adaptive learning, and related research, illustrating how deep learning and multimodal models improve automated assessment across K‑12 scenarios.

AIEducation TechnologyEssay Scoring
0 likes · 24 min read
Intelligent Grading: Technical Exploration and Practice in AI‑Powered Education
58 Tech
58 Tech
Oct 12, 2021 · Artificial Intelligence

Seq2Seq Approaches for Phone Number Extraction from Two‑Speaker Voice Dialogues

This article presents a practical study of extracting phone numbers from two‑speaker voice dialogues using Seq2Seq models—including LSTM, GRU with attention and feature fusion, and Transformer—detailing data characteristics, model architectures, training strategies, experimental results, and comparative analysis showing the GRU‑Attention approach achieving the best performance.

GRULSTMNLP
0 likes · 13 min read
Seq2Seq Approaches for Phone Number Extraction from Two‑Speaker Voice Dialogues
DataFunSummit
DataFunSummit
Oct 2, 2021 · Artificial Intelligence

Joint Entity and Relation Extraction: Methods and Document‑Level Approaches

This presentation reviews the importance of entity‑relation extraction for knowledge‑graph construction, compares sentence‑level and complex contexts, and surveys joint extraction techniques—including sequence labeling, table filling, and seq2seq models—as well as document‑level graph‑based methods and future research directions.

NLPdocument-levelentity-relation extraction
0 likes · 15 min read
Joint Entity and Relation Extraction: Methods and Document‑Level Approaches
DataFunSummit
DataFunSummit
Sep 26, 2021 · Artificial Intelligence

Contrastive Learning and Its Applications in Weibo Content Representation

This article explains the fundamentals of contrastive learning, reviews typical models such as SimCLR, MoCo, SwAV, BYOL, SimSiam and Barlow Twins, and demonstrates how these methods are applied to Weibo text and multimodal (text‑image) representation tasks like hashtag generation and image‑text matching.

MultimodalNLPWeibo
0 likes · 18 min read
Contrastive Learning and Its Applications in Weibo Content Representation
Ctrip Technology
Ctrip Technology
Aug 26, 2021 · Artificial Intelligence

Applying Snorkel Weak Supervision to Automate Event Summaries in Ctrip Customer Service

The article explains how Ctrip’s hotel customer‑service team uses the Snorkel weak‑supervision framework to generate large‑scale labeled data for training models that automatically produce structured event summaries, detailing the workflow, labeling functions, generative and discriminative model training, and performance improvements.

Labeling FunctionsNLPSnorkel
0 likes · 14 min read
Applying Snorkel Weak Supervision to Automate Event Summaries in Ctrip Customer Service
DataFunSummit
DataFunSummit
Aug 21, 2021 · Artificial Intelligence

My Journey in Text2SQL Research: From Paper Reading to Winning a Global Competition

This article recounts the author's six‑month Text2SQL research experience, detailing how systematic paper reading, leveraging existing engineering solutions, and fully utilizing academic, human, and hardware resources led to a successful thesis, a patent, a paper, and a second‑place finish in Yale's global Text2SQL competition.

AINLPPaper Reading
0 likes · 9 min read
My Journey in Text2SQL Research: From Paper Reading to Winning a Global Competition
Meituan Technology Team
Meituan Technology Team
Aug 19, 2021 · Artificial Intelligence

Few-Shot Learning Methods and Applications in Meituan NLP

Meituan’s NLP team leverages few‑shot learning—using data‑augmentation, semi‑supervised, ensemble/self‑training, and domain‑adaptation techniques—to cut annotation costs, achieving 1–2 percentage‑point accuracy gains on internal benchmarks and deploying high‑performing models for tasks such as topic classification, fake‑review detection, and sentiment analysis, while planning broader platform and model extensions.

Few‑Shot LearningNLPSemi-supervised Learning
0 likes · 29 min read
Few-Shot Learning Methods and Applications in Meituan NLP
Meituan Technology Team
Meituan Technology Team
Aug 5, 2021 · Artificial Intelligence

Overview of Meituan's ACL 2021 Accepted Papers

Meituan’s 2021 ACL contributions comprise seven accepted papers—six long and one short—introducing novel approaches to event argument decoding, cross‑domain slot transfer, contrastive out‑of‑domain detection, novel slot discovery, self‑supervised sentence representation, unsupervised semantic parsing, and pseudo‑query‑enhanced dense retrieval, inviting further research and collaboration.

ACLEvent ExtractionMeituan
0 likes · 22 min read
Overview of Meituan's ACL 2021 Accepted Papers
Sohu Tech Products
Sohu Tech Products
Aug 4, 2021 · Artificial Intelligence

Technical Summary of the 2021 Sohu Campus Text Matching Algorithm Competition

This article presents a comprehensive technical summary of the 2021 Sohu Campus Text Matching Algorithm Competition, detailing data characteristics, preprocessing strategies, tokenization choices, positional encoding methods, model architectures using relative encodings such as WoBERT and RoFormer, experimental results, and reflections on future improvements.

Model DesignNLPcompetition
0 likes · 9 min read
Technical Summary of the 2021 Sohu Campus Text Matching Algorithm Competition
DataFunSummit
DataFunSummit
Aug 3, 2021 · Artificial Intelligence

Content Understanding for Personalized Recommendation: Interest Graph, Concept Mining, and Semantic Matching at Tencent

The article explains how Tencent addresses the limitations of traditional content understanding methods in personalized recommendation by introducing an interest‑graph framework that combines classification, concept, entity, and event layers, and details the associated mining, matching, and online evaluation techniques.

EmbeddingNLPcontent understanding
0 likes · 13 min read
Content Understanding for Personalized Recommendation: Interest Graph, Concept Mining, and Semantic Matching at Tencent
DataFunTalk
DataFunTalk
Jul 30, 2021 · Artificial Intelligence

Fundamentals of Natural Language Processing: Language Models, Smoothing, and Basic Tasks

This article provides a comprehensive overview of natural language processing fundamentals, covering the challenges of language modeling, N‑gram and Markov assumptions, smoothing techniques such as discounting and add‑one, evaluation via perplexity, basic tasks like Chinese word segmentation, subword tokenization, POS tagging, syntactic and semantic parsing, and a range of downstream applications including information extraction, sentiment analysis, question answering, machine translation, and dialogue systems.

AILanguage ModelNLP
0 likes · 29 min read
Fundamentals of Natural Language Processing: Language Models, Smoothing, and Basic Tasks
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 30, 2021 · Artificial Intelligence

iQIYI Search Ranking Algorithm Practice – NLP and Search Integration

At iQIYI’s iTech Conference, Zhang Zhigang detailed a full‑stack search ranking system that combines NLP‑driven query analysis, hierarchical indexing, multi‑stage coarse‑to‑fine ranking, Transformer‑based re‑ranking, sparse‑feature DNN enhancements and LIME/SE‑Block explainability, delivering measurable gains in CTR and NDCG for the platform’s video search.

NLPiQIYIinformation retrieval
0 likes · 20 min read
iQIYI Search Ranking Algorithm Practice – NLP and Search Integration
Ctrip Technology
Ctrip Technology
Jul 29, 2021 · Artificial Intelligence

NLP Techniques for Classifying Ctrip Ticket Customer Service Conversations

This article presents the background, problem analysis, data preprocessing, modeling approaches and optimization results of applying various NLP methods—including statistical models, word embeddings, attention mechanisms and pretrained language models such as BERT—to improve the accuracy of classifying Ctrip ticket customer service dialogues.

BERTDeep LearningNLP
0 likes · 13 min read
NLP Techniques for Classifying Ctrip Ticket Customer Service Conversations
DataFunTalk
DataFunTalk
Jul 22, 2021 · Artificial Intelligence

Joint Entity and Relation Extraction: Methods, Challenges, and Document‑Level Approaches

This article reviews the fundamentals of entity‑relation extraction, surveys joint extraction techniques such as sequence labeling, table‑filling and seq2seq models, discusses document‑level graph‑based methods, highlights experimental findings, and outlines future research directions in knowledge‑graph construction.

NLPdocument-levelentity-relation extraction
0 likes · 17 min read
Joint Entity and Relation Extraction: Methods, Challenges, and Document‑Level Approaches
Meituan Technology Team
Meituan Technology Team
Jul 15, 2021 · Artificial Intelligence

Local Life Comprehensive Demand Knowledge Graph: Design, Algorithms, and Applications

The Local Life Comprehensive Demand Knowledge Graph (GENE) reorients Meituan’s supply‑demand matching by building a multi‑layer, user‑centric graph that captures intent and consideration, employing BERT, Word2Vec, ELECTRA, and reinforcement‑learning models to generate concrete and scene‑based demand nodes, now powering parent‑child, leisure, medical‑beauty, and education services.

AIDemand ModelingNLP
0 likes · 34 min read
Local Life Comprehensive Demand Knowledge Graph: Design, Algorithms, and Applications
DataFunTalk
DataFunTalk
Jul 13, 2021 · Artificial Intelligence

NLP‑Driven Scenario Tagging and Experience Management Platform for Douyin App

This article describes how Douyin built an AI‑powered feedback management platform that uses NLP to automatically tag and cluster user comments, maps them to business scenarios, defines quantitative experience metrics, and creates a closed‑loop system for rapid problem discovery and product improvement.

AIDouyinNLP
0 likes · 15 min read
NLP‑Driven Scenario Tagging and Experience Management Platform for Douyin App
DataFunTalk
DataFunTalk
Jul 2, 2021 · Artificial Intelligence

Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan

This article describes how Dingxiangyuan's algorithm team adopted Milvus for distributed vector indexing to improve semantic search in their community forum, detailing the background, retrieval workflow, various embedding models—including Bi‑Encoder, Spherical Embedding, and Knowledge Embedding—and summarizing the benefits and future applications.

EmbeddingMilvusNLP
0 likes · 10 min read
Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan
DataFunTalk
DataFunTalk
Jun 22, 2021 · Artificial Intelligence

Survey of Graph Neural Networks for Natural Language Processing

This comprehensive survey reviews the latest research on graph neural networks applied to natural language processing, covering graph construction methods, graph representation learning techniques, encoder‑decoder models, static and dynamic graph building, and discusses challenges, benchmarks, and future directions in the field.

Encoder-DecoderGraph ConstructionNLP
0 likes · 57 min read
Survey of Graph Neural Networks for Natural Language Processing
58 Tech
58 Tech
Jun 18, 2021 · Artificial Intelligence

Bidding Document Classification and Entity Extraction Using BERT-based Models

This article describes how 58.com built an end‑to‑end bidding service that crawls tender documents, classifies them into multiple categories with BERT‑based models (including softmax, sigmoid and ensemble approaches) and extracts key entities using BERT‑CRF and reading‑comprehension techniques, achieving over 90% overall accuracy and dramatically improving recall and precision.

BERTNLPdocument classification
0 likes · 15 min read
Bidding Document Classification and Entity Extraction Using BERT-based Models
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 16, 2021 · Artificial Intelligence

Master Chinese Text Segmentation with jieba: Installation, Modes, and Advanced Tricks

This tutorial walks you through installing the jieba Python library, explains its three segmentation modes—precise, full, and search—demonstrates how to add or delete words, manage custom dictionaries, handle stop words, perform weight analysis, adjust word frequencies, and retrieve token positions, all with clear code examples and visual output.

NLPPythonchinese segmentation
0 likes · 10 min read
Master Chinese Text Segmentation with jieba: Installation, Modes, and Advanced Tricks
MaGe Linux Operations
MaGe Linux Operations
Jun 13, 2021 · Fundamentals

7 Fun Python Projects: Web Scraping, Chatbots, Poetry Classification and More

This article presents seven practical Python scripts—from a concise web scraper for Zhihu images and a chatbot conversation loop to a Naive Bayes poem author classifier, a lottery number generator, an automated essay writer, a screen‑capture tool, and a GIF creator—demonstrating how to avoid reinventing the wheel while exploring diverse automation tasks.

ChatbotData GenerationNLP
0 likes · 8 min read
7 Fun Python Projects: Web Scraping, Chatbots, Poetry Classification and More
58 Tech
58 Tech
Jun 4, 2021 · Artificial Intelligence

Architecture and Evolution of the 58 Intelligent Q&A Chatbot System

This article details the design, iterative development, and performance optimizations of 58's AI‑driven intelligent Q&A chatbot, covering its overall three‑layer architecture, the QABot, TaskBot, and answer‑recommendation modules, as well as dynamic strategy adjustment, caching mechanisms, and real‑world deployment results.

AIChatbotDeep Learning
0 likes · 16 min read
Architecture and Evolution of the 58 Intelligent Q&A Chatbot System
Meituan Technology Team
Meituan Technology Team
Jun 3, 2021 · Artificial Intelligence

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

ConSERT is a contrastive self‑supervised framework that fine‑tunes BERT with augmented sentence views and NT‑Xent loss to overcome embedding collapse, delivering roughly 8% higher STS performance than prior methods, remaining robust in few‑shot and supervised scenarios, and now deployed in Meituan’s NLP pipelines.

BERTNLPcontrastive learning
0 likes · 20 min read
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
Meituan Technology Team
Meituan Technology Team
May 27, 2021 · Artificial Intelligence

Standardizing Food Delivery Dish Names: Knowledge Graph Construction and Applications

The paper outlines an end‑to‑end pipeline that standardizes highly personalized food‑delivery dish names by combining rule‑based and BERT‑DSSM text synonym detection with EfficientNet image classification, constructing a multi‑level taxonomy that improves aggregation, supply‑demand analysis, recall ranking and merchant tagging.

Computer VisionNLPentity extraction
0 likes · 17 min read
Standardizing Food Delivery Dish Names: Knowledge Graph Construction and Applications
Python Programming Learning Circle
Python Programming Learning Circle
May 24, 2021 · Artificial Intelligence

Useful Python Libraries for Data Science Beyond Pandas and NumPy

This article introduces a curated selection of lesser‑known Python libraries for data‑science tasks—including data acquisition, date‑time handling, imbalanced‑learning, fast keyword extraction, fuzzy string matching, time‑series modeling, 3‑D visualization, web‑app building, and reinforcement‑learning—providing installation commands and concise usage examples.

Data ScienceNLPTime Series
0 likes · 10 min read
Useful Python Libraries for Data Science Beyond Pandas and NumPy
TiPaiPai Technical Team
TiPaiPai Technical Team
May 21, 2021 · Artificial Intelligence

How AI Powers Automatic Homework Grading: Challenges and Solutions

Automatic homework grading leverages AI to transform captured images into graded results through preprocessing, layout analysis, OCR, answer matching, and strategy modules, while addressing three question categories—logical, text‑rich, and graphic—each presenting distinct technical challenges and future research directions.

AIEducation TechnologyImage Processing
0 likes · 7 min read
How AI Powers Automatic Homework Grading: Challenges and Solutions
Meituan Technology Team
Meituan Technology Team
May 20, 2021 · Artificial Intelligence

CKBQA Task for Chinese Knowledge Graph Question Answering

The CKBQA task introduced at CCKS2021 challenges participants to build Chinese knowledge‑graph question answering systems that combine Meituan’s life‑services KG with open‑domain data from PKUBASE, requiring accurate, efficient and explainable handling of both domain‑specific and open‑domain queries to advance real‑world applications such as smart customer service and e‑commerce.

AICCKS2021KBQA
0 likes · 4 min read
CKBQA Task for Chinese Knowledge Graph Question Answering
Python Programming Learning Circle
Python Programming Learning Circle
May 5, 2021 · Fundamentals

Python Scripts for Various Automation Tasks: Web Scraping, Chatbots, Poem Classification, Lottery Generation, Apology Writing, Screen Capture, and GIF Creation

This article presents a collection of Python 3.6.4 scripts that demonstrate how to scrape Zhihu images, converse with chatbots, classify Tang poems using NLP, generate random lottery numbers, automatically compose apology letters, capture screen images, and create animated GIFs, providing practical code examples for each task.

ChatbotGIFNLP
0 likes · 9 min read
Python Scripts for Various Automation Tasks: Web Scraping, Chatbots, Poem Classification, Lottery Generation, Apology Writing, Screen Capture, and GIF Creation
Cyber Elephant Tech Team
Cyber Elephant Tech Team
Apr 28, 2021 · Artificial Intelligence

Understanding BERT: From Encoder-Decoder to Transformer and Attention

This article explains the BERT model by first reviewing the Encoder-Decoder framework, then detailing the attention mechanism—including self-attention and multi-head attention—before describing the full Transformer architecture and finally outlining BERT’s encoder-only design, training stages, and fine-tuning applications.

BERTEncoder-DecoderNLP
0 likes · 15 min read
Understanding BERT: From Encoder-Decoder to Transformer and Attention
NetEase Media Technology Team
NetEase Media Technology Team
Apr 13, 2021 · Artificial Intelligence

Applying BERT for News Timeliness Classification at NetEase

The article describes how NetEase adapts a pre‑trained BERT model to classify news articles into ultra‑short, short, or long timeliness categories by combining rule‑based strong and weak time cues, key‑sentence extraction, domain‑embedding fusion and multi‑layer semantic aggregation, achieving accurate and interpretable predictions for its platform.

Artificial IntelligenceBERTModel Fusion
0 likes · 12 min read
Applying BERT for News Timeliness Classification at NetEase
DataFunTalk
DataFunTalk
Apr 6, 2021 · Artificial Intelligence

Advances in Text Summarization: Pointer-Generator, Coverage Mechanisms, Entity Knowledge Integration, and Non-Autoregressive Models

This article reviews recent advances in abstractive summarization, covering pointer‑generator networks with coverage loss, integration of entity knowledge, strategies to mitigate repetition such as unlikelihood training and nucleus sampling, and emerging non‑autoregressive approaches like the Levenshtein Transformer.

NLPPointer-Generatorcoverage
0 likes · 15 min read
Advances in Text Summarization: Pointer-Generator, Coverage Mechanisms, Entity Knowledge Integration, and Non-Autoregressive Models
DataFunTalk
DataFunTalk
Apr 5, 2021 · Artificial Intelligence

Summary of Methods and Findings from the NLP Chinese Pre‑training Model Generalization Challenge

The article reviews the Chinese NLP pre‑training model generalization competition, detailing data preprocessing, augmentation, external data usage, model scaling and architecture tweaks, loss functions, learning‑rate and adversarial training strategies, regularization techniques, post‑processing optimizations, and ineffective methods, highlighting their impact on performance metrics.

Loss FunctionsModel OptimizationNLP
0 likes · 15 min read
Summary of Methods and Findings from the NLP Chinese Pre‑training Model Generalization Challenge
Youku Technology
Youku Technology
Mar 23, 2021 · Artificial Intelligence

Text-Video Alignment Algorithm for Automated Short Video Production at Youku

Youku’s new text‑video alignment system automatically generates short video summaries by extracting multimodal video and linguistic features, matching sentences to clips through embedding and tag‑level models, and enabling AI‑driven auto‑editing that cuts production time from days to minutes.

BERTNLPcross-modal matching
0 likes · 10 min read
Text-Video Alignment Algorithm for Automated Short Video Production at Youku
DataFunSummit
DataFunSummit
Mar 7, 2021 · Artificial Intelligence

A Comprehensive Overview of Multi‑Task Learning in AI: Concepts, Applications, and Practical Tips

This article provides an in‑depth introduction to multi‑task learning (MTL), explaining its core concepts, why it is widely used in recommendation systems, NLP, CV and reinforcement learning, and offering guidance on model architectures, loss design, auxiliary tasks, and practical deployment tips.

MTLNLPRecommendation Systems
0 likes · 19 min read
A Comprehensive Overview of Multi‑Task Learning in AI: Concepts, Applications, and Practical Tips
58 Tech
58 Tech
Mar 5, 2021 · Artificial Intelligence

Intelligent Job Title Generation with Pipeline and Seq2Seq Approaches

This article presents a comprehensive study on generating recruitment job titles by combining a rule‑based pipeline with advanced seq2seq models—including BiLSTM‑Attention, Pointer‑Generator, and a Field‑Gate Dual‑Attention architecture—demonstrating significant performance gains on real‑world hiring data.

NLPPipelinePointer-Generator
0 likes · 14 min read
Intelligent Job Title Generation with Pipeline and Seq2Seq Approaches