Deep Learning for Text Matching and Ranking at Meituan
Meituan leverages deep‑learning models such as Word2Vec, DSSM, and LSTM‑based encoders within its ClickNet framework to compute text similarity and rank results, integrating rich business features like user location and merchant rating, thereby surpassing traditional TF‑IDF, BM25, and XGBoost approaches and boosting click‑through rates and revenue.
Meituan has deployed AI technologies across its services, including search, recommendation, advertising, risk control, scheduling, speech recognition, robotics, and delivery.
To illustrate the construction of its AI “brain”, a series of articles titled “AI in Meituan” are released, and a book “Meituan Machine Learning Practice” is forthcoming.
Background
Recent advances in deep learning have driven research in text processing. Text tasks can be divided into four dimensions: word‑level, sentence‑level, document‑level, and system‑level applications.
Word‑level: sequence labeling with CRF, Bi‑LSTM+CRF, etc.
Sentence‑level: parsers, Shift‑Reduce, Seq2Seq generation.
Document‑level: sentiment analysis with CNN, reading comprehension with RNN.
System‑level: information retrieval, machine translation, dialogue systems.
Deep‑learning‑based Text Matching
Text matching computes similarity between two pieces of text and is used in search, advertising, suggestion, deduplication, etc.
Traditional vector‑space models represent documents as high‑dimensional TF‑IDF vectors and use similarity measures such as Cosine, BM25, etc.
Matrix‑factorization (LSA) reduces dimensionality and captures latent semantics, but struggles with polysemy.
Probabilistic topic models (pLSA, LDA) introduce latent topics to improve semantic representation, with LDA employing Dirichlet priors and inference via Variational EM or Gibbs sampling.
Since 2013, deep learning models such as Word2Vec (CBOW, Skip‑Gram), DSSM, CDSSM, and LSTM‑DSSM have become dominant for semantic matching. Meituan’s ClickNet framework combines DNN/CNN/RNN encoders with business‑specific features to predict click similarity.
Deep‑learning‑based Ranking Models
Ranking selects a small subset of candidates for display in search, recommendation, or advertising. Three classic paradigms are Pointwise, Pairwise, and Listwise.
Pairwise RankNet (2005) uses a neural network to predict pairwise preferences; later extensions include LambdaRank.
Wide&Deep (Google) jointly learns from sparse “wide” features and dense “deep” embeddings.
YouTube DNN ranking model feeds heterogeneous features (user, video, context) through embeddings and fully‑connected layers.
Meituan adapts these architectures (ClickNet‑v1, ClickNet‑v2) to its O2O scenario, emphasizing business features such as user location, merchant rating, and click‑weighting.
Experimental results show ClickNet outperforms linear models and XGBoost on classification tasks (e.g., Higgs dataset) and improves CTR and revenue in production.
Conclusion
Deep learning’s strong fitting ability and reduced reliance on handcrafted features have made it pervasive in text‑related tasks at Meituan, from semantic matching to ranking, with ongoing research in sentiment analysis, dialogue, summarization, and keyword generation.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
Meituan Technology Team
Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
