How Suning Leverages Query Logs to Auto‑Discover Product Synonyms

Suning’s search team automatically extracts domain‑specific synonym pairs from massive query‑click logs using candidate extraction, multi‑feature similarity calculations, and Apriori pattern mining, dramatically improving e‑commerce search recall and user experience.

Suning Technology
Suning Technology
Suning Technology
How Suning Leverages Query Logs to Auto‑Discover Product Synonyms

In the e‑commerce domain, users express the same product with varied terms, leading to recall gaps in search. Suning’s search R&D team mined massive product and query‑click logs to automatically discover domain‑specific synonyms, improving retrieval and data management.

Data Source and Candidate Extraction

The system relies on query‑click logs, which capture diverse user‑entered keywords. After cleaning noise, candidate terms of length 2‑4 characters are extracted from queries and filtered by frequency thresholds.

Computing Synonym Relations

Three similarity features are used:

Similarity of clicked product vectors.

Similarity of surrounding character (context) vectors.

Similarity of occurrence patterns.

Each feature yields a cosine‑based similarity score; the final score is a weighted sum.

Click‑Vector Similarity

Product click vectors are built from user clicks and compared using cosine distance.

Context‑Vector Similarity

For each candidate term, a character count vector is created from the five characters before and after each occurrence in queries; cosine similarity measures contextual closeness.

Pattern‑Based Similarity

Query terms are tokenized, sorted, and grouped by product ID and term count. Apriori frequent itemset mining extracts K‑1 itemsets; patterns are matched to find interchangeable terms, incrementing similarity counts.

Post‑Processing

After filtering by similarity thresholds, further cleaning removes incomplete terms and overlapping pairs to reduce noise.

Through this multi‑feature approach combining cosine similarity and Apriori mining on large‑scale query and click data, Suning generated tens of thousands of high‑quality synonym pairs, far exceeding manual knowledge and significantly enhancing search experience.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

e-commercedata miningSearch Optimizationsimilarity algorithmsquery logssynonym mining
Suning Technology
Written by

Suning Technology

Official Suning Technology account. Explains cutting-edge retail technology and shares Suning's tech practices.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.