Improving Search Relevance in PointCheck

The article details Meituan‑Dianping's search relevance pipeline, describing how multi‑similarity matrix structures, multi‑stage domain‑adaptive training, POI field summarization, and online inference optimizations together improve a BERT‑based relevance model's offline metrics and reduce the BadCase rate in production.

Meituan Technology Team
Meituan Technology Team
Meituan Technology Team
Improving Search Relevance in PointCheck

Background

Search relevance measures the match between a user query and a merchant (POI) and is critical for the Dianping app. Queries can involve merchant names, dishes, addresses, categories, or complex combinations, while POIs contain many fields (name, address, tags, dishes, etc.), leading to three main relevance problems: textual mismatch, semantic drift, and category mismatch.

Existing Relevance Techniques

Early methods relied on term‑based matching (TF‑IDF, BM25) which are fast but lack semantic understanding. Traditional semantic models include latent space approaches such as Partial Least Squares [1] and translation‑based models [2]. Deep semantic matching now uses representation‑based (e.g., DSSM [3], NRM [4], SentenceBERT [5]) and interaction‑based (e.g., ESIM [6], CEDR [8]) architectures, often built on pretrained language models.

Dianping Search Relevance Design

To address the limitations of pure representation models, Dianping adopts an interaction‑based approach built on the MT‑BERT pretrained model [11]. Three challenges are tackled:

POI Input Construction : Because POIs lack a single high‑information field like a web title, Dianping extracts a POI matching‑field summary that selects the most query‑relevant textual fields in real time (see Fig. 4).

Model Adaptation : A two‑stage training pipeline is used. Stage 1 performs continual domain‑adaptive pre‑training on click and negative‑sample data; Stage 2 fine‑tunes on manually annotated relevance data, enriched with hard‑case mining and contrastive sample generation (Fig. 5‑6).

Online Performance : Since a 12‑layer BERT model is computationally heavy, Dianping introduces caching of high‑frequency query scores, TF‑Serving deployment with the ART framework (based on Faster‑Transformer [15]), a pre‑ranking “golden rule” layer, and parallel execution with the core ranking module (Fig. 8‑9).

POI Matching‑Field Summary

The pipeline computes similarity features between the query and each POI field, selects the most informative fields, and concatenates them with the merchant name and category to form the POI side input.

Two‑Stage Training

Stage 1 uses click data to capture search‑specific semantics (e.g., “开心烧烤” vs. “高兴烧烤” are different brands). Positive samples are queries with click‑through rate above a threshold; negatives are selected via a Skip‑Above strategy and further filtered by rule‑based noise reduction. Stage 2 adds manually labeled data, focusing on hard cases such as cross‑dish matches and edge‑sample mining, and generates contrastive pairs to improve synonym generalization.

Multi‑Similarity Matrix Deep Interaction Model

Inspired by CEDR [8] and MatchPyramid [13], the model splits the encoded query and POI vectors, computes four similarity matrices (Indicator, Dot‑product, Cosine, Euclidean), fuses them, and feeds the result to a scoring layer. The Indicator matrix explicitly encodes token‑level matches (e.g., exact, split, or partial matches) to help the model distinguish high‑textual‑overlap but unrelated cases.

Practical Results

Offline Evaluation

Benchmarking on a manually labeled set shows that the base method (BERT with query + POI summary) improves negative‑example F1 by 1.84 %. Adding two‑stage training raises F1 by an additional 10.35 %, and the multi‑similarity matrix model adds another 11.14 % gain, achieving AUC = 0.96 and overall F1 = 0.97 (Table 1).

Online Impact

After deployment, the monthly BadCase rate on the first‑page results dropped by 2.9 percentage points, and NDCG increased by 2 pp. Example cases (Fig. 10) illustrate how the model correctly promotes “佩姐名品” while demoting unrelated merchants, and distinguishes “榴莲蛋糕” from a non‑relevant “榴莲千层” restaurant.

Conclusion and Outlook

The system combines POI field summarization, domain‑adaptive two‑stage training, and a multi‑similarity deep interaction architecture to achieve strong relevance performance while meeting real‑time inference constraints. Future work includes incorporating richer entity‑type priors, external knowledge, finer‑grained ranking tiers, and extending relevance modeling to non‑merchant search modules.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

BERTsearch relevanceMeituanpretrained language modelmulti-stage trainingmulti-similarity matrixonline inference optimization
Meituan Technology Team
Written by

Meituan Technology Team

Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.