Tagged articles
507 articles
Page 5 of 6
HomeTech
HomeTech
Nov 20, 2019 · Artificial Intelligence

Query Understanding and Intent Recognition in Search: Methods, Taxonomy, and Applications

This article explains how query understanding (QP) transforms user search queries into structured semantic blocks and intent categories using rule‑based NLP, entity recognition, and post‑processing, and describes its taxonomy, implementation details, and practical impact on search engine results.

Knowledge GraphNLPQuery Understanding
0 likes · 16 min read
Query Understanding and Intent Recognition in Search: Methods, Taxonomy, and Applications
DataFunTalk
DataFunTalk
Nov 15, 2019 · Artificial Intelligence

MT-BERT: Domain‑Adapted BERT Pre‑training and Fine‑tuning for Meituan‑Dianping NLP Tasks

This article describes the development of MT‑BERT, a BERT‑based language model pre‑trained on Meituan‑Dianping business data, its distributed mixed‑precision training pipeline, domain adaptation, knowledge‑graph integration, model compression techniques, and the wide range of downstream NLP applications achieved in the platform.

BERTKnowledge GraphMeituan
0 likes · 31 min read
MT-BERT: Domain‑Adapted BERT Pre‑training and Fine‑tuning for Meituan‑Dianping NLP Tasks
Meituan Technology Team
Meituan Technology Team
Nov 14, 2019 · Artificial Intelligence

MT-BERT: Pre‑training and Fine‑tuning Practices at Meituan‑Dianping

MT‑BERT at Meituan‑Dianping combines mixed‑precision, domain‑adapted continual pre‑training, knowledge‑graph‑aware masking, and extensive compression techniques to produce fast, accurate BERT models that power fine‑grained sentiment analysis, intent classification, recommendation reasoning, and other NLP tasks across the platform.

BERTKnowledge GraphMT-BERT
0 likes · 33 min read
MT-BERT: Pre‑training and Fine‑tuning Practices at Meituan‑Dianping
360 Tech Engineering
360 Tech Engineering
Nov 13, 2019 · Artificial Intelligence

Text Anti‑Spam Techniques and TextCNN Model for Real‑Time Spam Detection on the Huajiao Platform

This article introduces the Huajiao platform's text anti‑spam architecture, analyzes spam categories and challenges, compares rule‑based and machine‑learning approaches, details traditional NLP methods and the TextCNN deep‑learning model, provides its TensorFlow implementation, and describes the online deployment workflow.

CNNNLPTensorFlow
0 likes · 14 min read
Text Anti‑Spam Techniques and TextCNN Model for Real‑Time Spam Detection on the Huajiao Platform
HomeTech
HomeTech
Nov 13, 2019 · Artificial Intelligence

Sequence Labeling for Entity Recognition in Automotive Search: Techniques and Applications

This article examines how sequence labeling methods such as pattern matching, CRF, and deep‑learning models like BiLSTM‑CRF and BERT are applied to automotive search tasks—including car‑series, model, and location/entity recognition—detailing their development, implementation challenges, and performance results.

BERTCRFNLP
0 likes · 11 min read
Sequence Labeling for Entity Recognition in Automotive Search: Techniques and Applications
vivo Internet Technology
vivo Internet Technology
Nov 12, 2019 · Artificial Intelligence

Elasticsearch Retrieval Optimization in Gitee: Interview with Chen Xin

In an interview, Gitee’s chief architect Chen Xin explains why Elasticsearch was chosen for code search, outlines how combining search with NLP can both aid semantic understanding and enrich repository queries, and shares his views on the platform’s fast‑evolving ecosystem and upcoming community meetup.

ElasticsearchGiteeNLP
0 likes · 4 min read
Elasticsearch Retrieval Optimization in Gitee: Interview with Chen Xin
JD Tech Talk
JD Tech Talk
Nov 5, 2019 · Artificial Intelligence

GeoBERT: A Multi‑Task Pre‑trained Language Model for Chinese Address Text

This article introduces GeoBERT, a novel pre‑training method for Chinese address strings that leverages seven jointly constrained tasks to capture spatial semantics, administrative hierarchy, and similarity relationships, enabling downstream address classification, segmentation, POI extraction, similarity comparison, and authenticity verification with reduced annotation dependence.

Chinese LanguageGeoBERTGeocoding
0 likes · 15 min read
GeoBERT: A Multi‑Task Pre‑trained Language Model for Chinese Address Text
JD Tech Talk
JD Tech Talk
Oct 30, 2019 · Artificial Intelligence

Solution Overview for the Scientific Paper Recommendation Matching Competition

This article presents a comprehensive solution to a competition that requires matching description paragraphs with the three most relevant papers from a 200,000‑paper corpus, detailing background, task definition, evaluation metrics, modeling strategy, and core algorithms such as SIF, InferSent, Bi‑LSTM, and BERT.

BERTNLPcompetition
0 likes · 9 min read
Solution Overview for the Scientific Paper Recommendation Matching Competition
DataFunTalk
DataFunTalk
Oct 21, 2019 · Artificial Intelligence

Challenges and Future Directions of Chatbots: From Virtual Life to Multimodal Interaction

This talk analyzes the technical and practical challenges of current chatbots, introduces the next‑generation paradigm of virtual life, and details how AI technologies such as speech, vision, NLP, and knowledge graphs can be combined with multimodal interaction to create more intelligent and engaging conversational agents.

AIChatbotKnowledge Graph
0 likes · 25 min read
Challenges and Future Directions of Chatbots: From Virtual Life to Multimodal Interaction
Ctrip Technology
Ctrip Technology
Oct 11, 2019 · Artificial Intelligence

Intelligent Content Extraction and Generation Practices on Ctrip's Marco Polo AI Platform

This article details Ctrip's AI‑driven Marco Polo platform, describing how large‑scale NLP pipelines combine extraction, richness evaluation, semantic matching and deep‑learning generation (CopyNet, TA‑seq2seq) to produce high‑quality recommendation reasons across multiple product scenarios.

Content ExtractionNLPRecommendation Systems
0 likes · 16 min read
Intelligent Content Extraction and Generation Practices on Ctrip's Marco Polo AI Platform
DataFunTalk
DataFunTalk
Oct 9, 2019 · Artificial Intelligence

Multilingual Content Understanding in UC International Feed Recommendation

This article presents a comprehensive overview of the challenges, requirements, and technical solutions for multilingual content understanding in UC's international information‑flow recommendation system, covering structured signal construction, low‑resource NLP techniques, transfer learning, quality modeling, and image‑based signal integration.

NLPRecommendation Systemscontent understanding
0 likes · 14 min read
Multilingual Content Understanding in UC International Feed Recommendation
DataFunTalk
DataFunTalk
Sep 26, 2019 · Artificial Intelligence

NLP Algorithm Practices in Alibaba's Brand Advertising

This article presents a comprehensive overview of Alibaba's brand advertising business model, its technical architecture, and the practical application of NLP algorithms—specifically brand intent recognition and short‑text relevance—detailing model evolution, evaluation results, and future research directions.

BERTBrand AdvertisingNLP
0 likes · 14 min read
NLP Algorithm Practices in Alibaba's Brand Advertising
Qunar Tech Salon
Qunar Tech Salon
Sep 12, 2019 · Artificial Intelligence

A Comprehensive Overview of Attention Mechanisms in Deep Learning

This article systematically reviews the history, core concepts, variants, and practical implementations of attention mechanisms—from early additive and multiplicative forms to self‑attention, multi‑head attention, and recent transformer‑based models—highlighting why attention has become fundamental in modern AI research.

Deep LearningNLPSelf-Attention
0 likes · 16 min read
A Comprehensive Overview of Attention Mechanisms in Deep Learning
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 11, 2019 · Artificial Intelligence

How Cognitive Concept Graphs Power Modern Search Understanding

This article explains the motivation, challenges, architecture, and algorithms behind building a large‑scale cognitive concept graph for search, detailing data construction, concept mining, fusion, confidence scoring, hierarchical structuring, validation, service algorithms, platform access, and real‑world applications such as intent recognition and entity recommendation.

Knowledge GraphNLPcognitive concept graph
0 likes · 19 min read
How Cognitive Concept Graphs Power Modern Search Understanding
Xianyu Technology
Xianyu Technology
Aug 20, 2019 · Artificial Intelligence

How Xianyu Boosted Sales with an AI-Powered Auto-Reply Chatbot

Xianyu tackled slow seller responses, inconsistent answers, and bargaining friction by building an AI-driven auto-reply system that extracts product attributes, generates knowledge bases, and uses intent recognition and response generation models, resulting in faster replies, higher conversion rates, and reduced seller workload.

AIAlibabaChatbot
0 likes · 12 min read
How Xianyu Boosted Sales with an AI-Powered Auto-Reply Chatbot
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 19, 2019 · Artificial Intelligence

How RE2 Boosts FAQ Chatbot Accuracy: A Deep Dive into Text Matching Models

This article explains the design and evaluation of RE2, a lightweight yet expressive text‑matching framework for FAQ‑style chatbots, detailing its five‑layer architecture, block‑wise residual connections, experimental results on SNLI, MultiNLI, SciTail, Quora and WikiQA datasets, and its significant performance improvements in Alibaba’s DingXiaoMi service.

Deep LearningFAQ chatbotIndustrial AI
0 likes · 13 min read
How RE2 Boosts FAQ Chatbot Accuracy: A Deep Dive into Text Matching Models
Youku Technology
Youku Technology
Aug 14, 2019 · Artificial Intelligence

Technical Analysis of “Chang'an” – The Beidou Star System for Reducing Content Uncertainty and Boosting Hit Potential

The talk details how Youku’s Beidou Star AI platform deconstructs the drama “Chang’an Twelve Hours” with NLP, computer‑vision, knowledge graphs and multi‑task deep models to quantify script, character and emotion uncertainty, enabling predictive scoring that lifted the series’ daily index above one million and outlines future hybrid decision‑engine research.

AIComputer VisionContent Analytics
0 likes · 12 min read
Technical Analysis of “Chang'an” – The Beidou Star System for Reducing Content Uncertainty and Boosting Hit Potential
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 9, 2019 · Artificial Intelligence

Demystifying Attention: A Beginner’s Guide to History, Types, and Applications

This article provides a comprehensive, beginner‑friendly overview of attention mechanisms—from their origins in early neural machine translation papers to modern self‑attention, multi‑head attention, and transformer variants—explaining core concepts, common variants, and why attention has become essential across NLP and vision tasks.

NLPattention
0 likes · 18 min read
Demystifying Attention: A Beginner’s Guide to History, Types, and Applications
JD Retail Technology
JD Retail Technology
Aug 8, 2019 · Artificial Intelligence

From Word Representations to Sentiment Analysis – Talk by Dr. Feng Ao

On August 6, Dr. Feng Ao presented a comprehensive overview of the evolution of word representations and sentiment analysis, illustrating the shift from traditional linguistic features to modern pretrained models such as BERT and XLNet, and sharing practical convolutional experiments relevant to industry applications.

NLPSentiment Analysisartificial intelligence
0 likes · 4 min read
From Word Representations to Sentiment Analysis – Talk by Dr. Feng Ao
Didi Tech
Didi Tech
Aug 2, 2019 · Artificial Intelligence

How Didi’s Open‑Source DELTA Platform Accelerates NLP and Speech Model Development

At ACL 2019, Didi unveiled DELTA, an open‑source TensorFlow‑based training framework that unifies NLP and speech tasks, offers configurable pipelines, benchmark models, and seamless deployment, enabling AI developers to quickly move from research to production while leveraging Didi’s extensive open‑source ecosystem.

AI PlatformModel TrainingNLP
0 likes · 6 min read
How Didi’s Open‑Source DELTA Platform Accelerates NLP and Speech Model Development
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 9, 2019 · Artificial Intelligence

Demystifying Attention: A Clear Guide to Its History, Types, and Why It Works

This article systematically reviews the evolution of attention mechanisms—from early additive and multiplicative forms to self‑attention and multi‑head variants—explaining their core three‑step framework, key differences, and why they have become essential across NLP, vision, and broader AI applications.

Deep LearningNLPSelf-Attention
0 likes · 19 min read
Demystifying Attention: A Clear Guide to Its History, Types, and Why It Works
Beike Product & Technology
Beike Product & Technology
Jun 28, 2019 · Artificial Intelligence

Building a Comprehensive Tagging System for Real‑Estate Recommendation at Beike

This article explains how Beike, China’s largest residential service platform, leverages its massive house, client, and text data to design a multi‑layered tag architecture, detailing data sources, tag construction methods—including classification, keyword, geographic, anonymous topic, and temporal tags—and their application to improve personalized house search and recommendation.

NLPReal EstateTagging
0 likes · 14 min read
Building a Comprehensive Tagging System for Real‑Estate Recommendation at Beike
DataFunTalk
DataFunTalk
Jun 23, 2019 · Artificial Intelligence

Understanding XLNet: Differences from BERT, Innovations, and Experimental Analysis

This article examines XLNet, contrasting it with BERT by detailing its novel permutation language modeling, dual‑stream attention, and larger pre‑training data, and analyzes experimental results that show XLNet’s superior performance on reading‑comprehension, GLUE, and other NLP tasks, especially for long documents.

BERTNLPPermutation Language Model
0 likes · 27 min read
Understanding XLNet: Differences from BERT, Innovations, and Experimental Analysis
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 18, 2019 · Artificial Intelligence

From Word2Vec to Quick-Thought: A Complete Guide to Modern Embeddings

This article reviews the evolution of word and sentence embeddings, covering foundational theories like vector semantics and distributional hypothesis, practical models such as Word2Vec, GloVe, fastText, Skip‑Thought, Quick‑Thought, and evaluation techniques, while offering implementation tips and real‑world use cases.

GloVeNLPWord2Vec
0 likes · 21 min read
From Word2Vec to Quick-Thought: A Complete Guide to Modern Embeddings
DataFunTalk
DataFunTalk
Jun 10, 2019 · Artificial Intelligence

BERT Applications Across NLP Domains: Progress, Challenges, and Future Directions

This article surveys the rapid proliferation of BERT-based research over the past six months, analyzing its impact on various NLP tasks such as question answering, information retrieval, dialog systems, summarization, data augmentation, classification, and sequence labeling, while also discussing the model's strengths, limitations, and future research opportunities.

BERTNLPdata augmentation
0 likes · 52 min read
BERT Applications Across NLP Domains: Progress, Challenges, and Future Directions
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 5, 2019 · Artificial Intelligence

Tracing the Evolution of Language Models: From N‑grams to GPT‑2

This article reviews the historical development of natural language processing language models, covering expert rule‑based systems, statistical n‑grams, smoothing techniques, neural network models such as NNLM, RNN, word2vec, GloVe, ELMo, and the transformer‑based breakthroughs of GPT, BERT and GPT‑2, and summarizes their impact on modern NLP tasks.

BERTDeep LearningGPT
0 likes · 25 min read
Tracing the Evolution of Language Models: From N‑grams to GPT‑2
58 Tech
58 Tech
May 31, 2019 · Artificial Intelligence

Deep Learning Approaches for Chinese Word Segmentation: BiLSTM‑CRF and BERT

This article reviews modern deep‑learning methods for Chinese word segmentation, comparing traditional CRF‑based approaches with BiLSTM‑CRF and BERT models, describing their architectures, training procedures, experimental results, and practical considerations for deployment.

BERTBiLSTMCRF
0 likes · 17 min read
Deep Learning Approaches for Chinese Word Segmentation: BiLSTM‑CRF and BERT
DataFunTalk
DataFunTalk
May 29, 2019 · Artificial Intelligence

General‑Domain Conversational QA: Technologies, Challenges, and Alibaba UC’s Practice

This article reviews the evolution, architecture, and key technical challenges of general‑domain conversational QA systems, describing Alibaba UC’s search background, dialogue bot types, data pipelines, and advanced methods such as transfer learning, few‑shot learning, and multi‑dimensional dialogue management.

AlibabaDialogue SystemsFew‑Shot Learning
0 likes · 12 min read
General‑Domain Conversational QA: Technologies, Challenges, and Alibaba UC’s Practice
Alibaba Cloud Developer
Alibaba Cloud Developer
May 27, 2019 · Artificial Intelligence

From Neurons to BERT: Tracing the Evolution of Deep Learning in NLP

This article walks through the development of deep learning for natural language processing, starting with basic neural cells and shallow networks, then exploring CNNs, RNNs, LSTMs, TextCNN, ESIM, ELMo, and culminating with the Transformer‑based BERT model, its training objectives, fine‑tuning strategies, and performance comparisons.

BERTCNNDeep Learning
0 likes · 19 min read
From Neurons to BERT: Tracing the Evolution of Deep Learning in NLP
Beike Product & Technology
Beike Product & Technology
May 23, 2019 · Artificial Intelligence

Practical Applications and Challenges of Machine Learning and AI at QCon Beijing 2019

At QCon Beijing 2019, four Beike technology experts presented the practical use and challenges of machine learning for user profiling, deep‑learning‑based house‑quality scoring, intelligent customer‑service systems, and AI‑driven floor‑plan generation, summarizing the architecture, data pipelines, model evolution, and future improvement directions.

AIDeep LearningGAN
0 likes · 16 min read
Practical Applications and Challenges of Machine Learning and AI at QCon Beijing 2019
Qunar Tech Salon
Qunar Tech Salon
Apr 29, 2019 · Artificial Intelligence

Multi‑Level Deep Model Fusion for Fake News Detection Using BERT – Winning Solution of WSDM Cup 2019

The article details the Travel team's award‑winning solution for the WSDM Cup 2019 fake‑news detection task, describing data analysis, preprocessing, label‑propagation augmentation, a BERT‑based baseline, a three‑stage multi‑level model‑fusion framework, experimental results, and future directions.

BERTModel FusionNLP
0 likes · 12 min read
Multi‑Level Deep Model Fusion for Fake News Detection Using BERT – Winning Solution of WSDM Cup 2019
Youku Technology
Youku Technology
Apr 22, 2019 · Artificial Intelligence

Exploring the Construction of an Entertainment Brain: AI and Big Data Practices in the Fish Brain Platform

The talk introduces Alibaba’s Fish Brain platform, an AI‑powered decision‑support system for entertainment that combines a three‑layer data‑model, AI‑processed basic data, and application models, leveraging NLP, computer‑vision, custom embeddings, loss functions and predictive hybrid networks to analyze content, user behavior, and forecast performance.

AIBig DataEmbedding
0 likes · 12 min read
Exploring the Construction of an Entertainment Brain: AI and Big Data Practices in the Fish Brain Platform
DataFunTalk
DataFunTalk
Apr 8, 2019 · Artificial Intelligence

AI Scientific Frontier Conference 2019 – Program, Speakers, and Schedule

The AI Scientific Frontier Conference 2019, co‑hosted by the Chinese Academy of Sciences AI Alliance and Beijing Institute of Technology, gathers leading researchers to present cutting‑edge talks on AI theory, deep learning, computer vision, robotics, NLP, big data, and related applications, with detailed schedules, speaker bios, venue information, and registration details provided.

AIData ScienceDeep Learning
0 likes · 60 min read
AI Scientific Frontier Conference 2019 – Program, Speakers, and Schedule
Hulu Beijing
Hulu Beijing
Apr 4, 2019 · Artificial Intelligence

How BERT, GPT, and ELMo Revolutionize Language Feature Representation

Natural language processing, a cornerstone of AI, relies on language models to capture linguistic features; this article reviews classic pre‑training models—ELMo, GPT, and BERT—explaining their architectures, training objectives, and how they boost downstream NLP tasks despite data‑scarcity challenges.

BERTDeep LearningELMo
0 likes · 10 min read
How BERT, GPT, and ELMo Revolutionize Language Feature Representation
Python Crawling & Data Mining
Python Crawling & Data Mining
Apr 4, 2019 · Artificial Intelligence

Build and Visualize Three Kingdoms Character Networks with Python

This article demonstrates how to extract the character relationships from the classic novel Romance of the Three Kingdoms using Python, building entity dictionaries, constructing a co‑occurrence social network with HarvestText and NetworkX, visualizing subgraphs, ranking important figures, detecting communities, and creating an animated view of the network’s evolution over the story.

HarvestTextNLPPython
0 likes · 13 min read
Build and Visualize Three Kingdoms Character Networks with Python
DataFunTalk
DataFunTalk
Mar 15, 2019 · Artificial Intelligence

Designing Personalized, Dynamic, and Multimodal Knowledge Graphs for Chatbots

The article explores how chatbots require personalized dense knowledge graphs, dynamic temporal graphs, subjective emotion modeling, integration with external services, and multimodal media support, while also promoting a new NLP book and a related giveaway for readers.

AIChatbotDynamic Graph
0 likes · 9 min read
Designing Personalized, Dynamic, and Multimodal Knowledge Graphs for Chatbots
Meituan Technology Team
Meituan Technology Team
Mar 14, 2019 · Artificial Intelligence

Beauty Guide: How Meituan's Information Flow Content Team Uses Text Generation Technology for Creative Optimization

Meituan’s information flow content team leverages advanced text‑generation techniques—including extractive and generative models, context‑aware representations, and Seq2Seq with attention—to overcome creative constraints, improve evaluation, and boost business metrics such as click‑through rates and user engagement through optimized titles and merchant copywriting.

AIInformation FlowMeituan
0 likes · 32 min read
Beauty Guide: How Meituan's Information Flow Content Team Uses Text Generation Technology for Creative Optimization
DataFunTalk
DataFunTalk
Mar 13, 2019 · Artificial Intelligence

A Comprehensive Overview of NLP Development and Deep Learning Models

This article reviews the history of natural language processing, explains key deep‑learning models such as NNLM, Word2vec, CNN, RNN, attention mechanisms, and Transformers, and discusses their applications, future trends, and practical considerations in NLP tasks.

NLPTransformerattention
0 likes · 38 min read
A Comprehensive Overview of NLP Development and Deep Learning Models
Mafengwo Technology
Mafengwo Technology
Mar 8, 2019 · Artificial Intelligence

How Dynamic Template Matching Transforms User Review Tag Extraction

This article explains a flexible template‑matching approach that dynamically extracts concise, user‑friendly tags from online travel reviews, detailing the system architecture, key concepts, step‑by‑step implementation, and matching rules that improve recall and relevance.

AINLPTemplate Matching
0 likes · 15 min read
How Dynamic Template Matching Transforms User Review Tag Extraction
DataFunTalk
DataFunTalk
Feb 25, 2019 · Artificial Intelligence

NLP Research and Practice at Hulu: From Historical Milestones to Product Development

This article recounts a Hulu NLP research engineer's experience, reviewing key milestones such as NNLM, Word2vec, Transformer and BERT, and then contrasting academic research with product development while illustrating real-world projects like news personalization and content embedding, and describing the supporting AI platform architecture.

AIBERTHulu
0 likes · 23 min read
NLP Research and Practice at Hulu: From Historical Milestones to Product Development
Vipshop Quality Engineering
Vipshop Quality Engineering
Feb 22, 2019 · Artificial Intelligence

How Vipshop Built an AI‑Powered Sentiment Analysis System for Real‑Time Customer Feedback

Vipshop's in‑house sentiment monitoring platform integrates web‑scraped reviews, WeChat comments and internal service messages, applying lexical sentiment scoring, dictionary‑based Chinese word segmentation, TF‑IDF keyword ranking and lightweight classification to deliver real‑time insights, alerts and actionable reports for thousands of daily user comments.

Big DataNLPSentiment Analysis
0 likes · 17 min read
How Vipshop Built an AI‑Powered Sentiment Analysis System for Real‑Time Customer Feedback
Meituan Technology Team
Meituan Technology Team
Feb 21, 2019 · Artificial Intelligence

Fake News Detection with Multi‑level BERT Fusion at WSDM Cup 2019

In the WSDM Cup 2019 fake-news detection challenge, the Meituan Travel team secured second place by combining extensive data analysis, Chinese-English BERT fine-tuning, label-propagation augmentation, and a three-level fusion framework—blending, stacking, and linear regression—that lifted weighted accuracy to 0.88156.

BERTModel FusionNLP
0 likes · 16 min read
Fake News Detection with Multi‑level BERT Fusion at WSDM Cup 2019
Meituan Technology Team
Meituan Technology Team
Jan 25, 2019 · Artificial Intelligence

Interview with Dr. Wang Zhongyuan: AI Research, Knowledge Graphs, and Career Journey

Dr. Wang Zhongyuan, a Renmin University graduate turned AI visionary, built a world‑class NLP center and a 1.8‑billion‑entity knowledge graph at Meituan after pioneering knowledge‑graph research at Microsoft Research Asia and leading Facebook’s entity‑linking service, emphasizing solid fundamentals, data integration, and patient, continuous learning for AI impact.

AINLPcareer
0 likes · 27 min read
Interview with Dr. Wang Zhongyuan: AI Research, Knowledge Graphs, and Career Journey
Meituan Technology Team
Meituan Technology Team
Jan 25, 2019 · Artificial Intelligence

Fine-grained User Review Sentiment Classification: AI Challenger 2018 Champion's Approach

Cheng Huige’s winning AI Challenger 2018 solution treated fine‑grained Chinese review sentiment as a 20‑aspect multi‑class task, combining a high‑capacity LSTM encoder with self‑attention, word‑and‑character embeddings, simplified ELMo pre‑training, diverse tokenizations and a weighted seven‑model ensemble (including BERT), which together delivered the competition’s top F1 performance.

BERTDeep LearningELMo
0 likes · 14 min read
Fine-grained User Review Sentiment Classification: AI Challenger 2018 Champion's Approach
58 Tech
58 Tech
Jan 22, 2019 · Artificial Intelligence

Chinese Word Segmentation: Challenges, Methods, and Practical Practices

The article explains why Chinese word segmentation is essential for NLP tasks, outlines its fundamental difficulties such as ambiguity and out‑of‑vocabulary words, reviews dictionary‑based, statistical, and CRF approaches, and shares practical experiences from 58 Search’s production system.

CRFLanguage ModelNLP
0 likes · 21 min read
Chinese Word Segmentation: Challenges, Methods, and Practical Practices
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 22, 2019 · Artificial Intelligence

How Tmall’s “Most Concerned” Feature Uses AI to Match Reviews with Consumer Questions

The article explains how Tmall’s new “Most Concerned” module leverages NLP techniques, fastText embeddings, Bi‑LSTM classifiers, and a custom clustering algorithm to filter, group, and link consumer questions with relevant product reviews, improving the shopping experience across many product categories.

AINLPclustering
0 likes · 9 min read
How Tmall’s “Most Concerned” Feature Uses AI to Match Reviews with Consumer Questions
Meituan Technology Team
Meituan Technology Team
Jan 10, 2019 · Artificial Intelligence

Deep Learning and Ranking Model Evolution for Hotel Search at Meituan

The talk explains how Meituan transformed its O2O hotel search by layering a multi‑stage retrieval pipeline with intent‑aware NLP, then progressively upgrading ranking—from XGBoost to MLPs, feature‑embedding networks, and finally a Wide‑Deep multi‑task model—while tackling data sparsity, diverse scenarios, and deploying the system via TensorFlow‑Serving and the in‑house MLX platform.

Deep LearningMeituanNLP
0 likes · 33 min read
Deep Learning and Ranking Model Evolution for Hotel Search at Meituan
iQIYI Technical Product Team
iQIYI Technical Product Team
Jan 4, 2019 · Artificial Intelligence

NLP-based Text Opinion Extraction and Sentiment Analysis for iQIYI Video Comments

iQIYI’s NLP pipeline—combining CRF‑based segmentation, bidirectional LSTM/GRU models with attention and a CNN classifier—automatically extracts opinion targets, sentiment words and polarity from unstructured video comments, aggregates them across users to reveal collective attitudes toward actors, plot, and visual effects, and guides future work on implicit opinions and broader sentiment domains.

Deep LearningNLPiQIYI
0 likes · 12 min read
NLP-based Text Opinion Extraction and Sentiment Analysis for iQIYI Video Comments
DataFunTalk
DataFunTalk
Dec 27, 2018 · Artificial Intelligence

Construction and Application of a Tourism Knowledge Graph

This article explains what a tourism knowledge graph is, discusses its architecture, construction methods, practical applications such as QA and recommendation, and explores future directions integrating knowledge graphs with deep learning and multi‑domain fusion.

AIKnowledge GraphNLP
0 likes · 10 min read
Construction and Application of a Tourism Knowledge Graph
DataFunTalk
DataFunTalk
Dec 24, 2018 · Artificial Intelligence

Application Scenarios and Practical Implementation of NLP in Yuewen's Content Mining Platform

This article details the business background, technical architecture, and practical deployments of natural language processing for content mining at Yuewen, covering tag construction, knowledge‑graph building, role analysis, recommendation generation, porn and plagiarism detection, and summarizing lessons learned.

Content MiningNLPtext analysis
0 likes · 15 min read
Application Scenarios and Practical Implementation of NLP in Yuewen's Content Mining Platform
Tencent Cloud Developer
Tencent Cloud Developer
Dec 21, 2018 · Artificial Intelligence

Tencent Xiaowei Conversational AI Platform: Architecture, Models, and Applications

Tencent Xiaowei is an open, easy‑to‑integrate conversational AI platform that combines NLU, dialogue management and generation, supports multi‑turn context via Memory Networks, uses bidirectional RNN and CNN‑based intent classifiers, and powers smart speakers, TVs and customer‑service bots by leveraging Tencent’s rich content ecosystem.

Conversational AIDialogue SystemsNLP
0 likes · 11 min read
Tencent Xiaowei Conversational AI Platform: Architecture, Models, and Applications
Ctrip Technology
Ctrip Technology
Dec 19, 2018 · Artificial Intelligence

Design and Implementation of an AI-Powered Medical Dialogue Assistant

This article describes the challenges and solutions encountered while developing an AI-driven medical dialogue assistant, covering data acquisition, preprocessing, model selection such as DCNN and Bi‑LSTM‑CRF, question generation, and system architecture, with insights applicable to similar healthcare chatbot projects.

AIBiLSTMDCNN
0 likes · 13 min read
Design and Implementation of an AI-Powered Medical Dialogue Assistant
DataFunTalk
DataFunTalk
Dec 11, 2018 · Artificial Intelligence

Multi-Task Learning in Natural Language Processing

An in‑depth overview of multi‑task learning for natural language processing is presented, covering deep learning foundations, challenges, various multi‑task learning paradigms (hard, soft, shared‑private, function‑level, hierarchical, and search‑based sharing), benchmark platforms, and future research directions, illustrated with numerous diagrams.

Deep LearningNLP
0 likes · 15 min read
Multi-Task Learning in Natural Language Processing
Beike Product & Technology
Beike Product & Technology
Dec 6, 2018 · Artificial Intelligence

Designing and Deploying a Real‑Estate Dialogue System: Architecture, Challenges, and Practices

The talk outlines how Beike built a real‑estate conversational AI platform, covering the market need for dialogue systems, the five technical challenges, data‑driven intent and slot extraction, model choices such as FastText and Bi‑LSTM‑CRF, a three‑layer system architecture, multi‑intent handling, and future directions like 4D viewing and an internal AI dialogue platform.

BILSTM-CRFKnowledge GraphNLP
0 likes · 26 min read
Designing and Deploying a Real‑Estate Dialogue System: Architecture, Challenges, and Practices
Meituan Technology Team
Meituan Technology Team
Nov 22, 2018 · Artificial Intelligence

Meituan Brain: Large‑Scale Knowledge Graph Construction and Applications

Meituan Brain builds a massive multi‑modal knowledge graph of billions of entities and triples across food, entertainment, and travel, using advanced extraction, validation, fusion, and reasoning techniques to empower search, recommendation, merchant tools, and fraud detection while addressing scalability and schema‑evolution challenges.

AIKnowledge GraphMeituan
0 likes · 28 min read
Meituan Brain: Large‑Scale Knowledge Graph Construction and Applications
iQIYI Technical Product Team
iQIYI Technical Product Team
Nov 2, 2018 · Artificial Intelligence

iQIYI Tech Salon Session 3 (Beijing): AI Technology Practices and Applications

The third iQIYI Tech Salon in Beijing, held despite strong winds, showcased five expert talks on AI‑driven video library management, NLP for entertainment content, AI‑based video encoding, traffic anti‑fraud systems, and short‑video personalized recommendation, illustrating AI’s impact on content creation, quality, security, and user experience.

AINLPRecommendation Systems
0 likes · 5 min read
iQIYI Tech Salon Session 3 (Beijing): AI Technology Practices and Applications
Meituan Technology Team
Meituan Technology Team
Nov 1, 2018 · Artificial Intelligence

Knowledge Graphs and Their Applications in Meituan's AI Platform

Meituan’s AI platform leverages a massive “Meituan Brain” knowledge graph—combining a common‑sense graph and an encyclopedia‑scale graph of billions of reviews, millions of dishes and merchant tags—integrated with deep‑learning models to enable interpretable, cross‑scenario search, recommendation, sentiment analysis and AI‑driven dining assistance.

NLPText Understandingartificial intelligence
0 likes · 21 min read
Knowledge Graphs and Their Applications in Meituan's AI Platform
DataFunTalk
DataFunTalk
Nov 1, 2018 · Artificial Intelligence

Intelligent Customer Service at Meituan: NLP Techniques, System Architecture, and Real‑World Deployment

The article presents a comprehensive overview of Meituan's intelligent customer service system, detailing its evolution, the roles of QABot, TaskBot and ChatBot, the underlying NLP and knowledge‑graph technologies, model implementations such as DSSM and Seq2seq, and the impressive operational results achieved in food‑delivery and ride‑hailing services.

Deep LearningMeituanNLP
0 likes · 14 min read
Intelligent Customer Service at Meituan: NLP Techniques, System Architecture, and Real‑World Deployment
DataFunTalk
DataFunTalk
Sep 21, 2018 · Artificial Intelligence

Construction of a Second‑Hand E‑commerce Knowledge Graph and Its Application in Pricing Models

This article explains how a knowledge graph for second‑hand e‑commerce is built—from data extraction and entity, attribute, and relation mining to ontology construction, entity alignment, and graph integration—and describes how the resulting graph supports personalized recommendation, search optimization, and statistical or regression‑based pricing models.

Knowledge GraphNLPentity extraction
0 likes · 15 min read
Construction of a Second‑Hand E‑commerce Knowledge Graph and Its Application in Pricing Models
ITPUB
ITPUB
Sep 4, 2018 · Fundamentals

Essential Python Libraries Every Data Scientist Should Know

This article surveys the most useful open‑source Python packages for data science, covering core numerical libraries, visualization tools, machine‑learning frameworks, natural‑language‑processing kits, and data‑mining utilities, while showing GitHub contribution metrics and Google‑trend popularity.

Data ScienceNLPPython
0 likes · 12 min read
Essential Python Libraries Every Data Scientist Should Know
DataFunTalk
DataFunTalk
Sep 2, 2018 · Artificial Intelligence

From Zero to One: Building and Deploying Knowledge Graphs at Beike Real Estate

This article details the evolution, architecture, and practical applications of knowledge graphs at Beike Real Estate, covering their historical background, five‑view advantages, data pipelines, ontology construction, intelligent search, recommendation, and chatbot integration, while also discussing challenges and future directions.

Intelligent AssistantKnowledge GraphNLP
0 likes · 13 min read
From Zero to One: Building and Deploying Knowledge Graphs at Beike Real Estate
DataFunTalk
DataFunTalk
Aug 17, 2018 · Artificial Intelligence

Technical Evolution and Architecture of Shenma Search Engine

The article outlines Shenma Search's development history, its AI‑driven relevance and ranking technologies, the underlying system architecture based on Zookeeper and YARN, and discusses challenges in query understanding, machine‑learning ranking, and deep‑learning solutions for large‑scale search.

AINLPmachine learning
0 likes · 17 min read
Technical Evolution and Architecture of Shenma Search Engine
AntTech
AntTech
Aug 1, 2018 · Artificial Intelligence

Highlights and Paper Summaries from ACL 2018 Conference

An extensive overview of ACL 2018, featuring acceptance statistics, award-winning papers, tutorial insights, and concise summaries of notable research across machine translation, semantic parsing, question answering, domain adaptation, text classification, summarization, dialogue systems, generation, and related tools.

ACL 2018Dialogue SystemsNLP
0 likes · 12 min read
Highlights and Paper Summaries from ACL 2018 Conference
DataFunTalk
DataFunTalk
Jul 30, 2018 · Artificial Intelligence

Enhancing Automated Process Services with Multi‑Turn Dialogue: Insights from Chatopera’s NLP Solutions

The article presents a technical overview of Chatopera’s multi‑turn dialogue platform, covering language model fundamentals, Chinese segmentation, word embeddings, information retrieval, and open‑source tools, while illustrating how these AI techniques enable low‑cost, scalable enterprise chatbot solutions.

NLPinformation retrievalmulti-turn dialogue
0 likes · 11 min read
Enhancing Automated Process Services with Multi‑Turn Dialogue: Insights from Chatopera’s NLP Solutions
Didi Tech
Didi Tech
Jul 27, 2018 · Artificial Intelligence

AI Applications in Mapping and Computer Vision at Didi Tech Seminar

The Didi Tech Seminar showcased how AI—through DeepMotion’s mapping solutions, Sogou’s NLP‑enhanced search, and advanced computer‑vision techniques—augments Didi’s core map engine for ETA estimation, route planning, and dynamic pickup/drop‑off updates, while addressing integration challenges between AI models and traditional mapping algorithms.

AIMappingNLP
0 likes · 6 min read
AI Applications in Mapping and Computer Vision at Didi Tech Seminar
DataFunTalk
DataFunTalk
Jul 2, 2018 · Artificial Intelligence

Overview of Sogou Information Feed Recommendation Algorithms

This article summarizes Sogou's information‑feed recommendation system, covering the architecture from data collection and NLP processing to recall, ranking, and feedback, and detailing the classification, tagging, keyword extraction, and various recall and ranking models such as FastText, TextCNN, collaborative filtering, and wide‑and‑deep learning.

NLPSogouinformation feed
0 likes · 14 min read
Overview of Sogou Information Feed Recommendation Algorithms
Tencent Cloud Developer
Tencent Cloud Developer
Mar 29, 2018 · Artificial Intelligence

How AI Powers a Smart Ops Bot for Seamless Dev‑Ops Collaboration

This article explains the motivation behind the growing gap between developers and operations, introduces Tencent Cloud's AI‑driven intelligent operations robot, outlines its core features, typical use cases, and dives into the retrieval‑based dialogue system and matching models that enable natural‑language interactions.

AI OpsChatbotDevOps
0 likes · 13 min read
How AI Powers a Smart Ops Bot for Seamless Dev‑Ops Collaboration
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 25, 2018 · Artificial Intelligence

How to Crush the Kaggle Toxic Comment Challenge: Data Prep, Models, and Ensemble Secrets

This article breaks down the Kaggle toxic comment classification competition, detailing thorough data cleaning, advanced word‑vector techniques, pseudo‑labeling, BPE tokenization, diverse neural models and ensemble strategies, and shares practical insights and pitfalls from the author's nine‑month competition journey.

BPEKaggleNLP
0 likes · 9 min read
How to Crush the Kaggle Toxic Comment Challenge: Data Prep, Models, and Ensemble Secrets
MaGe Linux Operations
MaGe Linux Operations
Mar 22, 2018 · Artificial Intelligence

Mapping Character Relationships in 'Heavenly Sword and Dragon Slaying' with Jieba, Word2Vec & NetworkX

This article demonstrates how to combine Jieba segmentation, Word2Vec embeddings, and NetworkX graph visualization to extract and analyze character relationships from the Chinese novel "Heavenly Sword and Dragon Slaying," detailing data preparation, model training, entity matrix construction, and network graph generation.

Graph VisualizationNLPPython
0 likes · 10 min read
Mapping Character Relationships in 'Heavenly Sword and Dragon Slaying' with Jieba, Word2Vec & NetworkX
Tencent Cloud Developer
Tencent Cloud Developer
Mar 21, 2018 · Artificial Intelligence

Abusive Comment Detection Using TextCNN: A Strategy + Algorithm Approach

The article proposes a hybrid approach that first filters blacklist words and then classifies suspicious comments with a character-level TextCNN, achieving around 89% precision and 87% recall, demonstrating that simple convolutional networks outperform keyword filters and RNNs for short, noisy abusive Chinese text.

Abusive Comment DetectionDeep LearningNLP
0 likes · 10 min read
Abusive Comment Detection Using TextCNN: A Strategy + Algorithm Approach
Meituan Technology Team
Meituan Technology Team
Mar 8, 2018 · Artificial Intelligence

Interview with Ren Dengjun: From NLP Research to Advertising Tech Leadership at Meituan

Ren Dengjun, a Meituan Dianping tech leader, recounts her shift from NLP research at Northeastern to industry roles at Toshiba, Alibaba, and Youyi, now heading Meituan’s location‑based advertising platform, while sharing insights on leadership, gender bias, and balancing technical and managerial challenges.

MeituanNLPRen Dengjun
0 likes · 17 min read
Interview with Ren Dengjun: From NLP Research to Advertising Tech Leadership at Meituan
Qunar Tech Salon
Qunar Tech Salon
Jan 10, 2018 · Artificial Intelligence

An Introductory Overview of Natural Language Processing

Natural Language Processing, a branch of AI, is traced from its Turing origins through early rule‑based methods, statistical and deep‑learning paradigms, covering lexical analysis, syntax, semantics, knowledge graphs, and current applications, highlighting historical shifts, challenges, and future research directions.

Knowledge GraphsLanguage ProcessingNLP
0 likes · 22 min read
An Introductory Overview of Natural Language Processing
iQIYI Technical Product Team
iQIYI Technical Product Team
Dec 15, 2017 · Artificial Intelligence

Sentiment Classification of iQIYI User Comments: Model Selection, Feature Engineering, and Online Deployment

The team built a lightweight three‑class sentiment classifier for iQIYI user comments using a linear‑kernel SVM with high‑dimensional bag‑of‑words features and an expanded ~100k word lexicon, achieving over 96% accuracy across domains, and deployed it as a Spring Boot PMML service with zero‑downtime refresh, while planning GBDT‑enhanced features and word‑embedding optimizations.

DeploymentNLPSentiment Analysis
0 likes · 13 min read
Sentiment Classification of iQIYI User Comments: Model Selection, Feature Engineering, and Online Deployment
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 25, 2017 · Artificial Intelligence

How Alibaba’s NLP Team Dominated Global Entity Extraction and Chinese Grammar Competitions

Alibaba’s iDST NLP team, led by Dr. Si Luo, clinched the top spot in both the KBP2017 English entity discovery challenge and the 2017 Chinese Grammatical Error Diagnosis contest, showcasing cutting‑edge deep‑learning techniques, massive multilingual processing capacity, and innovative transfer‑learning methods.

AI competitionsAlibabaDeep Learning
0 likes · 9 min read
How Alibaba’s NLP Team Dominated Global Entity Extraction and Chinese Grammar Competitions
MaGe Linux Operations
MaGe Linux Operations
Nov 22, 2017 · Artificial Intelligence

Top 15 Python Libraries Every Data Scientist Should Master in 2017

This article surveys the most essential Python packages for data science in 2017, covering core scientific computing, data manipulation, visualization, machine learning, deep learning, natural language processing, and web scraping, and explains why each library remains indispensable for modern analysts.

Data ScienceNLPPython
0 likes · 13 min read
Top 15 Python Libraries Every Data Scientist Should Master in 2017
Efficient Ops
Efficient Ops
Nov 21, 2017 · Artificial Intelligence

How AI Transforms Efficient Operations: Insights and Practices

This article summarizes Zhao Jiunchun's conference talk on leveraging AI and machine learning—covering pattern extraction, supervised learning, loss functions, classification algorithms, NLP, and real-world Tencent ZhiYun case studies—to enhance operational efficiency, monitoring, and intelligent automation.

AINLP
0 likes · 6 min read
How AI Transforms Efficient Operations: Insights and Practices
MaGe Linux Operations
MaGe Linux Operations
Oct 19, 2017 · Artificial Intelligence

Master Python NLP with NLTK: From Installation to Advanced Tokenization

This tutorial introduces natural language processing (NLP) with Python, covering the basics of NLP, popular libraries, NLTK installation, text tokenization, stop‑word removal, stemming, lemmatization, synonym/antonym handling, and multilingual support, all illustrated with clear code examples and visual guides.

LemmatizationNLPNLTK
0 likes · 10 min read
Master Python NLP with NLTK: From Installation to Advanced Tokenization
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 22, 2017 · Artificial Intelligence

iQIYI NLP Team: Research Topics, Progress, and Applications in Video Services

The iQIYI NLP team applies lexical analysis, knowledge‑graph construction, tag recommendation, query understanding, voice‑assistant semantics, sentiment mining, and box‑office/view‑count prediction—leveraging weakly labeled data, CRF/CNN‑CRF models and deep learning—to enhance video comprehension, recommendation, search and commercial services across the platform.

NLPRecommendation SystemsSpeech Assistant
0 likes · 13 min read
iQIYI NLP Team: Research Topics, Progress, and Applications in Video Services
Qunar Tech Salon
Qunar Tech Salon
Aug 21, 2017 · Artificial Intelligence

Tourism Comment Text Mining and Recommendation System Using NLP and Big Data

This article presents a comprehensive NLP‑driven text‑mining workflow for tourism comment data, covering data cleaning, word2vec training, keyword extraction, sentiment analysis, ranking, and a lightweight architecture that enables fast, accurate recommendation of scenic spots for users.

NLPSentiment Analysismachine learning
0 likes · 5 min read
Tourism Comment Text Mining and Recommendation System Using NLP and Big Data
Qunar Tech Salon
Qunar Tech Salon
Jul 3, 2017 · Artificial Intelligence

Interview with Dr. Lv Zhengdong on Neural‑Symbolic Systems and the Future of Natural Language Understanding

Dr. Lv Zhengdong discusses the challenges of true language understanding, the integration of symbolic reasoning with neural networks, recent advances in neural‑symbolic models, and the practical prospects of NLP in domains such as law and finance, emphasizing the need for hybrid approaches.

AI researchNLPinterview
0 likes · 16 min read
Interview with Dr. Lv Zhengdong on Neural‑Symbolic Systems and the Future of Natural Language Understanding
MaGe Linux Operations
MaGe Linux Operations
Jun 27, 2017 · Artificial Intelligence

Top Python Libraries Every Data Scientist Should Master

This article reviews the most essential Python libraries for data science—including NumPy, SciPy, Pandas, Matplotlib, Seaborn, Bokeh, Plotly, Scikit‑Learn, TensorFlow, Keras, NLTK, Gensim, Scrapy and Statsmodels—highlighting their core features, GitHub activity and typical use cases.

NLPdata-sciencelibraries
0 likes · 12 min read
Top Python Libraries Every Data Scientist Should Master