Artificial Intelligence 27 min read

Deep Learning for Text Matching and Ranking at Meituan

Meituan leverages deep‑learning models such as Word2Vec, DSSM, and LSTM‑based encoders within its ClickNet framework to compute text similarity and rank results, integrating rich business features like user location and merchant rating, thereby surpassing traditional TF‑IDF, BM25, and XGBoost approaches and boosting click‑through rates and revenue.

Meituan Technology Team

Jun 21, 2018

Deep Learning for Text Matching and Ranking at Meituan

Meituan has deployed AI technologies across its services, including search, recommendation, advertising, risk control, scheduling, speech recognition, robotics, and delivery.

To illustrate the construction of its AI “brain”, a series of articles titled “AI in Meituan” are released, and a book “Meituan Machine Learning Practice” is forthcoming.

Background

Recent advances in deep learning have driven research in text processing. Text tasks can be divided into four dimensions: word‑level, sentence‑level, document‑level, and system‑level applications.

Word‑level: sequence labeling with CRF, Bi‑LSTM+CRF, etc.

Sentence‑level: parsers, Shift‑Reduce, Seq2Seq generation.

Document‑level: sentiment analysis with CNN, reading comprehension with RNN.

System‑level: information retrieval, machine translation, dialogue systems.

Deep‑learning‑based Text Matching

Text matching computes similarity between two pieces of text and is used in search, advertising, suggestion, deduplication, etc.

Traditional vector‑space models represent documents as high‑dimensional TF‑IDF vectors and use similarity measures such as Cosine, BM25, etc.

Matrix‑factorization (LSA) reduces dimensionality and captures latent semantics, but struggles with polysemy.

Probabilistic topic models (pLSA, LDA) introduce latent topics to improve semantic representation, with LDA employing Dirichlet priors and inference via Variational EM or Gibbs sampling.

Since 2013, deep learning models such as Word2Vec (CBOW, Skip‑Gram), DSSM, CDSSM, and LSTM‑DSSM have become dominant for semantic matching. Meituan’s ClickNet framework combines DNN/CNN/RNN encoders with business‑specific features to predict click similarity.

Deep‑learning‑based Ranking Models

Ranking selects a small subset of candidates for display in search, recommendation, or advertising. Three classic paradigms are Pointwise, Pairwise, and Listwise.

Pairwise RankNet (2005) uses a neural network to predict pairwise preferences; later extensions include LambdaRank.

Wide&Deep (Google) jointly learns from sparse “wide” features and dense “deep” embeddings.

YouTube DNN ranking model feeds heterogeneous features (user, video, context) through embeddings and fully‑connected layers.

Meituan adapts these architectures (ClickNet‑v1, ClickNet‑v2) to its O2O scenario, emphasizing business features such as user location, merchant rating, and click‑weighting.

Experimental results show ClickNet outperforms linear models and XGBoost on classification tasks (e.g., Higgs dataset) and improves CTR and revenue in production.

Conclusion

Deep learning’s strong fitting ability and reduced reliance on handcrafted features have made it pervasive in text‑related tasks at Meituan, from semantic matching to ranking, with ongoing research in sentiment analysis, dialogue, summarization, and keyword generation.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

AI deep learning semantic search text matching Ranking Models

Written by

Meituan Technology Team

Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.