Tagged articles

search relevance

21 articles · Page 1 of 1

Jan 26, 2026 · Artificial Intelligence

How We Scaled a 3.5B MoE LLM for Real‑Time Search Relevance

This article details the engineering challenges and solutions for deploying a 3.5 billion‑parameter MoE LLM in Taobao's search relevance pipeline, covering large‑batch scheduling, dynamic load balancing, intra‑batch KV‑Cache reuse, and MoE kernel tuning to meet sub‑second latency requirements.

Inference OptimizationKV cacheLLM

0 likes · 15 min read

How We Scaled a 3.5B MoE LLM for Real‑Time Search Relevance

Alimama Tech

Dec 3, 2025 · Artificial Intelligence

How LORE Transforms E‑Commerce Search Relevance with Generative AI

The article details the development and deployment of LORE, a large generative model that reshapes e‑commerce search relevance by combining knowledge injection, chain‑of‑thought reasoning, and multimodal alignment, achieving simultaneous improvements in user experience and revenue metrics.

Chain-of-ThoughtMultimodale-commerce

0 likes · 15 min read

How LORE Transforms E‑Commerce Search Relevance with Generative AI

DeWu Technology

Nov 3, 2025 · Artificial Intelligence

How Large Language Models Boost Search Relevance: A Real‑World Case Study

This article explains how a leading e‑commerce platform leveraged large language models to overcome traditional search relevance challenges, detailing the iterative workflow, model distillation, performance gains, deployment results, and future directions for smarter, more accurate product search.

AIe-commercelarge language models

0 likes · 10 min read

How Large Language Models Boost Search Relevance: A Real‑World Case Study

Xianyu Technology

Feb 22, 2023 · Artificial Intelligence

Integrating Retrieval and Generation Tasks for Deep Semantic Matching in Xianyu Search

The paper introduces SimBert, a later‑fusion model that jointly trains a dual‑tower retrieval component and an auxiliary generation task on the item tower, using a two‑stage pre‑training and fine‑tuning pipeline, which yields a 3.6% relevance boost and reduces bad‑case rates in Xianyu search.

BERTmulti-task trainingretrieval-generation

0 likes · 8 min read

Integrating Retrieval and Generation Tasks for Deep Semantic Matching in Xianyu Search

DataFunSummit

Feb 3, 2023 · Artificial Intelligence

Interactive BERT for Relevance in Health E‑commerce Search

This article presents an in‑depth exploration of an interactive BERT‑based relevance model for health e‑commerce search, detailing the business context, query and product feature extraction, domain‑specific sample generation, model architecture enhancements, offline and online performance gains, and practical deployment through knowledge distillation.

AIBERTSemantic Modeling

0 likes · 14 min read

Interactive BERT for Relevance in Health E‑commerce Search

DataFunTalk

Jan 18, 2023 · Artificial Intelligence

Search Relevance System Architecture and Practices in QQ Browser

This article presents the QQ Browser search relevance team's experience integrating QQ Browser and Sogou search systems, detailing business overview, relevance system evolution, algorithm architecture, evaluation metrics, deep semantic matching, relevance calibration, and model distillation techniques to improve search relevance performance.

Evaluation MetricsInformation Retrievalmodel distillation

0 likes · 31 min read

Search Relevance System Architecture and Practices in QQ Browser

DataFunTalk

Jan 11, 2023 · Artificial Intelligence

Exploring Interactive BERT for Relevance in Health E‑commerce Search

This article presents a comprehensive overview of Alibaba Health's interactive BERT approach for improving relevance in health e‑commerce search, covering business background, model design, domain‑specific data construction, knowledge‑distilled twin‑tower deployment, experimental results, and a detailed Q&A session.

AIBERTSemantic Modeling

0 likes · 14 min read

Exploring Interactive BERT for Relevance in Health E‑commerce Search

Tencent Cloud Developer

Jan 9, 2023 · Artificial Intelligence

Search Relevance Architecture and Practices in QQ Browser

The QQ Browser search relevance team describes a unified, billion‑scale architecture that combines a main and vertical subsystem, a pyramid‑shaped ranking pipeline (recall, coarse, fine), a dedicated GPU‑accelerated relevance service, and hybrid semantic‑matching models (dual‑tower, BERT, matrix fusion) evaluated with offline and online metrics to deliver accurate, fresh, and authoritative results for diverse content and long‑tail queries.

Evaluation MetricsInformation Retrievaldeep learning

0 likes · 28 min read

Search Relevance Architecture and Practices in QQ Browser

Meituan Technology Team

Jul 6, 2022 · Artificial Intelligence

Improving Search Relevance in PointCheck

The article details Meituan‑Dianping's search relevance pipeline, describing how multi‑similarity matrix structures, multi‑stage domain‑adaptive training, POI field summarization, and online inference optimizations together improve a BERT‑based relevance model's offline metrics and reduce the BadCase rate in production.

BERTMeituanmulti-similarity matrix

0 likes · 31 min read

Improving Search Relevance in PointCheck

Hulu Beijing

May 18, 2022 · Artificial Intelligence

How Hulu Optimizes Video Search for TV Remotes and Short Queries

This article examines Hulu's video search engine, highlighting challenges such as ensuring relevance beyond text matching, handling ultra‑short queries on TV remotes, addressing content gaps, and integrating AI‑driven query understanding, retrieval, and ranking to improve user experience.

HuluInformation Retrievalmedia streaming

0 likes · 7 min read

How Hulu Optimizes Video Search for TV Remotes and Short Queries

HaoDF Tech Team

Sep 15, 2021 · Artificial Intelligence

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

This article describes how Haodf Online improved its medical question‑answer search by analyzing search challenges, adopting semantic similarity models based on pre‑trained language embeddings, designing contrastive training tasks, and evaluating the resulting increase in click‑through rate and user engagement.

Model Optimizationmedical AInatural language processing

0 likes · 12 min read

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

Xianyu Technology

Jul 1, 2021 · Artificial Intelligence

Improving Search Relevance in Xianyu: System Design and Model Implementation

The paper describes Xianyu’s new relevance‑matching pipeline—integrating basic, text‑matching, semantic (BERT‑based dual‑tower), multimodal, and click‑graph features and fusing them with a GBDT model—which boosts search DCG@10 by 6.5 %, query satisfaction by 24 % and click interaction by over 20 % while outlining future enhancements for finer attribute matching and richer structured data.

MultimodalRankinge-commerce

0 likes · 13 min read

Improving Search Relevance in Xianyu: System Design and Model Implementation

Baidu Geek Talk

Jan 27, 2021 · Cloud Computing

Elastic Nearline Computing Architecture for Leveraging Idle Resources in Baidu's PaaS Platform

Baidu’s elastic nearline computing architecture inserts an asynchronous, resource‑adaptive layer between online and offline processing, dynamically harvesting idle CPU, GPU and Kunlun XPU capacity to pre‑compute complex recommendation and search policies, enabling peak‑shifting, valley‑filling, higher timeliness and significant business growth at low cost.

Cloud ComputingPaaS resource schedulingPeak Shaving

0 likes · 18 min read

Elastic Nearline Computing Architecture for Leveraging Idle Resources in Baidu's PaaS Platform

DataFunTalk

Jan 15, 2021 · Artificial Intelligence

Zhihu Search Text Relevance Evolution and BERT Knowledge Distillation Practices

This talk by Zhihu search algorithm engineer Shen Zhan details the evolution of text relevance models from TF‑IDF/BM25 to deep semantic matching and BERT, explains the challenges of deploying BERT at scale, and describes practical knowledge‑distillation techniques that improve both online latency and offline storage while maintaining search quality.

BERTSemantic Retrievalknowledge distillation

0 likes · 14 min read

Zhihu Search Text Relevance Evolution and BERT Knowledge Distillation Practices

58UXD

Sep 10, 2020 · Product Management

How a Fruit Store Story Reveals the Secrets of Search Recall and Precision

Using a fruit shop analogy, the article explains recall and precision metrics, illustrates their impact on recruitment search, and presents a matrix of design patterns—including cross‑database search, preset search sets, and matching labels—to boost both recall and accuracy while maintaining user experience.

Precisiondesign patternsrecall

0 likes · 14 min read

How a Fruit Store Story Reveals the Secrets of Search Recall and Precision

DataFunTalk

May 16, 2020 · Artificial Intelligence

Exploring Search Matching Models and Their Applications in DiDi Food

This article introduces the background of search relevance, reviews three common matching model types—representation‑based, interaction‑based, and hybrid—describes their architectures such as DSSM, CDSSM, DRMM and DUET, and presents experimental results of these models on DiDi Food’s search system.

DiDi FoodInformation RetrievalRanking

0 likes · 15 min read

Exploring Search Matching Models and Their Applications in DiDi Food

Didi Tech

May 15, 2020 · Artificial Intelligence

Search Matching Models and Applications in DiDi Food

The article outlines DiDi Food’s search relevance challenge, defines semantic matching versus traditional keyword methods, describes the recall‑ranking pipeline, and reviews three families of deep matching models—representation‑based (e.g., DSSM), interaction‑based (e.g., DRMM) and hybrid (e.g., DUET)—including experimental results and a recruitment notice.

DiDi FoodInformation Retrievaldeep matching

0 likes · 16 min read

Search Matching Models and Applications in DiDi Food

Alibaba Cloud Developer

Mar 31, 2020 · Artificial Intelligence

How Alibaba’s AliCoCo Knowledge Graph Revolutionizes E‑Commerce Search & Recommendation

Alibaba’s AliCoCo, a large‑scale e‑commerce cognitive concept net, models user needs as graph nodes, linking concepts, primitives, taxonomy and items, and leverages advanced NLP, BiLSTM‑CRF, projection learning and knowledge‑enhanced models to boost search relevance, recommendation diversity, and overall user experience.

Knowledge Graphe-commercenatural language processing

0 likes · 25 min read

How Alibaba’s AliCoCo Knowledge Graph Revolutionizes E‑Commerce Search & Recommendation

Amap Tech

Aug 27, 2019 · Artificial Intelligence

POI Category Tagging: Multi‑Label Classification, Feature Engineering and Model Design

The system tackles POI category tagging as a multi‑label classification problem by engineering textual and non‑textual features, mining click‑log and external samples through active learning, and deploying hierarchical and per‑tag deep textCNN models with feature fusion, achieving over 5 % accuracy gain, ten‑fold speedup, and markedly higher precision and coverage that boost map‑search relevance.

POI taggingTextCNNfeature engineering

0 likes · 19 min read

POI Category Tagging: Multi‑Label Classification, Feature Engineering and Model Design

Ctrip Technology

Jun 29, 2017 · Backend Development

Understanding Elasticsearch Scoring: Lucene Scoring Functions, Query Boosting, and Function Score Queries

This article explains how Elasticsearch computes relevance scores using Lucene's practical scoring formula, term frequency, inverse document frequency, field-length norms, and query normalization, and demonstrates query-time boosting, constant_score, function_score, decay functions, and script_score with practical DSL examples.

ElasticsearchLuceneQuery Boosting

0 likes · 14 min read

Understanding Elasticsearch Scoring: Lucene Scoring Functions, Query Boosting, and Function Score Queries

Suning Technology

Apr 18, 2017 · Artificial Intelligence

How Deep Learning Is Revolutionizing E‑Commerce Search and Chatbots

At the 2017 QCon Beijing conference, Suning’s Silicon Valley Research Institute director Jim demonstrated how deep‑learning techniques can transform e‑commerce by vectorizing product data for smarter search relevance and by combining AI models with limited labeled data to build conversational chat‑bot platforms that understand user intent.

AIChatbotdeep learning

0 likes · 5 min read