Tagged articles
21 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 26, 2026 · Artificial Intelligence

How We Scaled a 3.5B MoE LLM for Real‑Time Search Relevance

This article details the engineering challenges and solutions for deploying a 3.5 billion‑parameter MoE LLM in Taobao's search relevance pipeline, covering large‑batch scheduling, dynamic load balancing, intra‑batch KV‑Cache reuse, and MoE kernel tuning to meet sub‑second latency requirements.

Inference OptimizationKV cacheLLM
0 likes · 15 min read
How We Scaled a 3.5B MoE LLM for Real‑Time Search Relevance
Alimama Tech
Alimama Tech
Dec 3, 2025 · Artificial Intelligence

How LORE Transforms E‑Commerce Search Relevance with Generative AI

The article details the development and deployment of LORE, a large generative model that reshapes e‑commerce search relevance by combining knowledge injection, chain‑of‑thought reasoning, and multimodal alignment, achieving simultaneous improvements in user experience and revenue metrics.

Model AlignmentSystem Architecturechain-of-thought
0 likes · 15 min read
How LORE Transforms E‑Commerce Search Relevance with Generative AI
DeWu Technology
DeWu Technology
Nov 3, 2025 · Artificial Intelligence

How Large Language Models Boost Search Relevance: A Real‑World Case Study

This article explains how a leading e‑commerce platform leveraged large language models to overcome traditional search relevance challenges, detailing the iterative workflow, model distillation, performance gains, deployment results, and future directions for smarter, more accurate product search.

AIe‑commercelarge language models
0 likes · 10 min read
How Large Language Models Boost Search Relevance: A Real‑World Case Study
Xianyu Technology
Xianyu Technology
Feb 22, 2023 · Artificial Intelligence

Integrating Retrieval and Generation Tasks for Deep Semantic Matching in Xianyu Search

The paper introduces SimBert, a later‑fusion model that jointly trains a dual‑tower retrieval component and an auxiliary generation task on the item tower, using a two‑stage pre‑training and fine‑tuning pipeline, which yields a 3.6% relevance boost and reduces bad‑case rates in Xianyu search.

BERTmulti-task trainingretrieval-generation
0 likes · 8 min read
Integrating Retrieval and Generation Tasks for Deep Semantic Matching in Xianyu Search
DataFunSummit
DataFunSummit
Feb 3, 2023 · Artificial Intelligence

Interactive BERT for Relevance in Health E‑commerce Search

This article presents an in‑depth exploration of an interactive BERT‑based relevance model for health e‑commerce search, detailing the business context, query and product feature extraction, domain‑specific sample generation, model architecture enhancements, offline and online performance gains, and practical deployment through knowledge distillation.

AIBERTSemantic Modeling
0 likes · 14 min read
Interactive BERT for Relevance in Health E‑commerce Search
DataFunTalk
DataFunTalk
Jan 18, 2023 · Artificial Intelligence

Search Relevance System Architecture and Practices in QQ Browser

This article presents the QQ Browser search relevance team's experience integrating QQ Browser and Sogou search systems, detailing business overview, relevance system evolution, algorithm architecture, evaluation metrics, deep semantic matching, relevance calibration, and model distillation techniques to improve search relevance performance.

Evaluation Metricsinformation retrievalmodel distillation
0 likes · 31 min read
Search Relevance System Architecture and Practices in QQ Browser
DataFunTalk
DataFunTalk
Jan 11, 2023 · Artificial Intelligence

Exploring Interactive BERT for Relevance in Health E‑commerce Search

This article presents a comprehensive overview of Alibaba Health's interactive BERT approach for improving relevance in health e‑commerce search, covering business background, model design, domain‑specific data construction, knowledge‑distilled twin‑tower deployment, experimental results, and a detailed Q&A session.

AIBERTSemantic Modeling
0 likes · 14 min read
Exploring Interactive BERT for Relevance in Health E‑commerce Search
Tencent Cloud Developer
Tencent Cloud Developer
Jan 9, 2023 · Artificial Intelligence

Search Relevance Architecture and Practices in QQ Browser

The QQ Browser search relevance team describes a unified, billion‑scale architecture that combines a main and vertical subsystem, a pyramid‑shaped ranking pipeline (recall, coarse, fine), a dedicated GPU‑accelerated relevance service, and hybrid semantic‑matching models (dual‑tower, BERT, matrix fusion) evaluated with offline and online metrics to deliver accurate, fresh, and authoritative results for diverse content and long‑tail queries.

Deep LearningEvaluation MetricsSystem Architecture
0 likes · 28 min read
Search Relevance Architecture and Practices in QQ Browser
Meituan Technology Team
Meituan Technology Team
Jul 6, 2022 · Artificial Intelligence

Improving Search Relevance in PointCheck

The article details Meituan‑Dianping's search relevance pipeline, describing how multi‑similarity matrix structures, multi‑stage domain‑adaptive training, POI field summarization, and online inference optimizations together improve a BERT‑based relevance model's offline metrics and reduce the BadCase rate in production.

BERTMeituanmulti-similarity matrix
0 likes · 31 min read
Improving Search Relevance in PointCheck
Hulu Beijing
Hulu Beijing
May 18, 2022 · Artificial Intelligence

How Hulu Optimizes Video Search for TV Remotes and Short Queries

This article examines Hulu's video search engine, highlighting challenges such as ensuring relevance beyond text matching, handling ultra‑short queries on TV remotes, addressing content gaps, and integrating AI‑driven query understanding, retrieval, and ranking to improve user experience.

HuluQuery Understandinginformation retrieval
0 likes · 7 min read
How Hulu Optimizes Video Search for TV Remotes and Short Queries
HaoDF Tech Team
HaoDF Tech Team
Sep 15, 2021 · Artificial Intelligence

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

This article describes how Haodf Online improved its medical question‑answer search by analyzing search challenges, adopting semantic similarity models based on pre‑trained language embeddings, designing contrastive training tasks, and evaluating the resulting increase in click‑through rate and user engagement.

Model Optimizationmedical-ainatural language processing
0 likes · 12 min read
Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study
Xianyu Technology
Xianyu Technology
Jul 1, 2021 · Artificial Intelligence

Improving Search Relevance in Xianyu: System Design and Model Implementation

The paper describes Xianyu’s new relevance‑matching pipeline—integrating basic, text‑matching, semantic (BERT‑based dual‑tower), multimodal, and click‑graph features and fusing them with a GBDT model—which boosts search DCG@10 by 6.5 %, query satisfaction by 24 % and click interaction by over 20 % while outlining future enhancements for finer attribute matching and richer structured data.

e‑commercefeature engineeringmachine learning
0 likes · 13 min read
Improving Search Relevance in Xianyu: System Design and Model Implementation
Baidu Geek Talk
Baidu Geek Talk
Jan 27, 2021 · Cloud Computing

Elastic Nearline Computing Architecture for Leveraging Idle Resources in Baidu's PaaS Platform

Baidu’s elastic nearline computing architecture inserts an asynchronous, resource‑adaptive layer between online and offline processing, dynamically harvesting idle CPU, GPU and Kunlun XPU capacity to pre‑compute complex recommendation and search policies, enabling peak‑shifting, valley‑filling, higher timeliness and significant business growth at low cost.

PaaS resource schedulingPeak Shavingcloud computing
0 likes · 18 min read
Elastic Nearline Computing Architecture for Leveraging Idle Resources in Baidu's PaaS Platform
DataFunTalk
DataFunTalk
Jan 15, 2021 · Artificial Intelligence

Zhihu Search Text Relevance Evolution and BERT Knowledge Distillation Practices

This talk by Zhihu search algorithm engineer Shen Zhan details the evolution of text relevance models from TF‑IDF/BM25 to deep semantic matching and BERT, explains the challenges of deploying BERT at scale, and describes practical knowledge‑distillation techniques that improve both online latency and offline storage while maintaining search quality.

BERTknowledge distillationmachine learning
0 likes · 14 min read
Zhihu Search Text Relevance Evolution and BERT Knowledge Distillation Practices
58UXD
58UXD
Sep 10, 2020 · Product Management

How a Fruit Store Story Reveals the Secrets of Search Recall and Precision

Using a fruit shop analogy, the article explains recall and precision metrics, illustrates their impact on recruitment search, and presents a matrix of design patterns—including cross‑database search, preset search sets, and matching labels—to boost both recall and accuracy while maintaining user experience.

Design PatternsUser experienceprecision
0 likes · 14 min read
How a Fruit Store Story Reveals the Secrets of Search Recall and Precision
DataFunTalk
DataFunTalk
May 16, 2020 · Artificial Intelligence

Exploring Search Matching Models and Their Applications in DiDi Food

This article introduces the background of search relevance, reviews three common matching model types—representation‑based, interaction‑based, and hybrid—describes their architectures such as DSSM, CDSSM, DRMM and DUET, and presents experimental results of these models on DiDi Food’s search system.

DiDi FoodNeural Networksdeep matching
0 likes · 15 min read
Exploring Search Matching Models and Their Applications in DiDi Food
Didi Tech
Didi Tech
May 15, 2020 · Artificial Intelligence

Search Matching Models and Applications in DiDi Food

The article outlines DiDi Food’s search relevance challenge, defines semantic matching versus traditional keyword methods, describes the recall‑ranking pipeline, and reviews three families of deep matching models—representation‑based (e.g., DSSM), interaction‑based (e.g., DRMM) and hybrid (e.g., DUET)—including experimental results and a recruitment notice.

DiDi Fooddeep matchinginformation retrieval
0 likes · 16 min read
Search Matching Models and Applications in DiDi Food
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 31, 2020 · Artificial Intelligence

How Alibaba’s AliCoCo Knowledge Graph Revolutionizes E‑Commerce Search & Recommendation

Alibaba’s AliCoCo, a large‑scale e‑commerce cognitive concept net, models user needs as graph nodes, linking concepts, primitives, taxonomy and items, and leverages advanced NLP, BiLSTM‑CRF, projection learning and knowledge‑enhanced models to boost search relevance, recommendation diversity, and overall user experience.

Knowledge Graphe‑commercenatural language processing
0 likes · 25 min read
How Alibaba’s AliCoCo Knowledge Graph Revolutionizes E‑Commerce Search & Recommendation
Amap Tech
Amap Tech
Aug 27, 2019 · Artificial Intelligence

POI Category Tagging: Multi‑Label Classification, Feature Engineering and Model Design

The system tackles POI category tagging as a multi‑label classification problem by engineering textual and non‑textual features, mining click‑log and external samples through active learning, and deploying hierarchical and per‑tag deep textCNN models with feature fusion, achieving over 5 % accuracy gain, ten‑fold speedup, and markedly higher precision and coverage that boost map‑search relevance.

POI taggingTextCNNfeature engineering
0 likes · 19 min read
POI Category Tagging: Multi‑Label Classification, Feature Engineering and Model Design
Ctrip Technology
Ctrip Technology
Jun 29, 2017 · Backend Development

Understanding Elasticsearch Scoring: Lucene Scoring Functions, Query Boosting, and Function Score Queries

This article explains how Elasticsearch computes relevance scores using Lucene's practical scoring formula, term frequency, inverse document frequency, field-length norms, and query normalization, and demonstrates query-time boosting, constant_score, function_score, decay functions, and script_score with practical DSL examples.

ElasticsearchQuery BoostingScoring
0 likes · 14 min read
Understanding Elasticsearch Scoring: Lucene Scoring Functions, Query Boosting, and Function Score Queries
Suning Technology
Suning Technology
Apr 18, 2017 · Artificial Intelligence

How Deep Learning Is Revolutionizing E‑Commerce Search and Chatbots

At the 2017 QCon Beijing conference, Suning’s Silicon Valley Research Institute director Jim demonstrated how deep‑learning techniques can transform e‑commerce by vectorizing product data for smarter search relevance and by combining AI models with limited labeled data to build conversational chat‑bot platforms that understand user intent.

AIChatbotDeep Learning
0 likes · 5 min read
How Deep Learning Is Revolutionizing E‑Commerce Search and Chatbots