Tag

search architecture

1 views collected around this technical thread.

Baidu Tech Salon
Baidu Tech Salon
Jun 17, 2025 · Operations

How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets

This article explains how Baidu's vertical search platform tackled massive data growth and scaling challenges by redesigning its data management system, introducing elastic scheduling, decoupling ETCD access, implementing auto‑scaling, and advancing shard expansion to improve performance, stability, and cost efficiency.

ETCDShardingauto scaling
0 likes · 18 min read
How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets
Bilibili Tech
Bilibili Tech
Jan 7, 2025 · Cloud Native

Design and Implementation of Bilibili's Large-Scale Recall System

Bilibili’s large‑scale recall system separates online processing into a two‑tier merge service and an index service, supports multi‑channel text, item‑to‑item and vector indexes with real‑time updates, uses horizontal sharding, robust CI/CD, monitoring and degradation mechanisms, and is being extended toward model‑based recall and greater automation.

BilibiliVector Searchcloud native
0 likes · 16 min read
Design and Implementation of Bilibili's Large-Scale Recall System
Baidu Geek Talk
Baidu Geek Talk
Dec 16, 2024 · Artificial Intelligence

AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications

AIAPI, Baidu’s AI‑native retrieval platform for large language models, tackles hallucination, slow domain updates, and output opacity by delivering authoritative, timely, full‑content data through a dual‑channel architecture that combines traditional search and RAG, employs reusable ranking, graph‑enhanced data layers, dynamic caching that cuts storage by 70 %, and QueryPlan‑based QoS, achieving markedly higher retrieval quality and a 34 % speed gain with Wenxin 4.0.

AI-Native SystemsAIAPICost Optimization
0 likes · 12 min read
AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 18, 2024 · Backend Development

Implementing Full‑Text Document Search with Elasticsearch and Milvus

This article describes how to combine Elasticsearch’s keyword matching with Milvus’s vector‑based semantic search to build a scalable document search service, covering data preprocessing, architecture, query handling, custom scoring, DSL configuration, and result merging.

ElasticsearchFull-Text SearchMilvus
0 likes · 12 min read
Implementing Full‑Text Document Search with Elasticsearch and Milvus
Baidu Geek Talk
Baidu Geek Talk
Jan 8, 2024 · Backend Development

Exgraph: A Graph Execution Engine for Task Orchestration

Exgraph, Baidu Search’s graph execution engine, uses a human‑readable description language and a robust execution core with dependency injection, object pooling, and interruption handling to orchestrate complex, parallel or conditional tasks, improving code readability and unifying diverse execution scenarios in search architecture.

DAGGo developmentObject Pooling
0 likes · 10 min read
Exgraph: A Graph Execution Engine for Task Orchestration
Baidu Geek Talk
Baidu Geek Talk
Nov 9, 2023 · Artificial Intelligence

Deep Learning Model Architecture Evolution in Baidu Search

The article chronicles Baidu Search’s Model Architecture Group’s evolution of deep‑learning‑driven search, detailing the shift from inverted‑index to semantic vector indexing, the use of transformer‑based models for text and image queries, large‑scale offline/online pipelines, and extensive GPU‑centric optimizations such as pruning, quantization and distillation, all aimed at delivering precise, cost‑effective results to hundreds of millions of users.

ERNIEGPU inferenceModel Distillation
0 likes · 14 min read
Deep Learning Model Architecture Evolution in Baidu Search
Architect
Architect
Apr 25, 2022 · Cloud Native

Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform

This article presents a cloud‑native redesign of Baidu's search middle‑platform that introduces intelligent data management, elastic scaling, on‑demand resource allocation, precise fan‑out, and localized computation to address efficiency, cost, stability, and performance challenges of large‑scale search workloads.

cloud nativedata managementelastic scaling
0 likes · 14 min read
Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform
Baidu Geek Talk
Baidu Geek Talk
Dec 15, 2021 · Cloud Native

Cloud-Native Intelligent Data Management Architecture for Baidu Search Platform

Cloud-native redesign of Baidu's search middle platform introduces partition, shard, replica, and addressing controllers that enable elastic scaling, on-demand resource allocation, precise fan‑out, and localized computation, reducing capacity adjustment time from weeks to hours, cutting costs by 30‑80%, raising availability above 99.9% and halving query latency.

Big Datacloud-nativedata management
0 likes · 17 min read
Cloud-Native Intelligent Data Management Architecture for Baidu Search Platform
Xianyu Technology
Xianyu Technology
Mar 18, 2021 · Backend Development

Multi-Engine Concurrent Search Architecture for Idlefish

Idlefish’s new multi‑engine concurrent search architecture replaces the tightly‑coupled single‑engine pipeline with deep engine isolation, asynchronous multi‑engine recall, and unified result merging, cutting dump build time from 14 h to 5 h, shrinking memory use dramatically, improving latency by only ~15 ms, and boosting exposure by 50 % and orders by 33 %.

Big DataJavaLua
0 likes · 10 min read
Multi-Engine Concurrent Search Architecture for Idlefish
iQIYI Technical Product Team
iQIYI Technical Product Team
Jun 19, 2020 · Artificial Intelligence

Emoji Search at iQIYI Douya: From ElasticSearch to Lucene and Semantic Retrieval

iQIYI Douya’s emoji search evolved from ElasticSearch to a pure Lucene implementation and added semantic vector retrieval, enabling fast, scalable, and more accurate text‑based search of AI‑generated images for small‑to‑medium businesses by combining custom tokenization, dense embeddings, and hybrid ranking.

ElasticsearchImage SearchLucene
0 likes · 14 min read
Emoji Search at iQIYI Douya: From ElasticSearch to Lucene and Semantic Retrieval
Architecture Digest
Architecture Digest
May 8, 2020 · Big Data

Elasticsearch Adoption Cases in Chinese Companies: JD.com, Ctrip, Qunar, 58.com, Didi and More

This article surveys how major Chinese internet companies such as JD.com, Ctrip, Qunar, 58.com and Didi have adopted Elasticsearch and the Elastic Stack for high‑volume order queries, log analysis, real‑time monitoring, security analytics, and large‑scale distributed search, describing their architecture evolution, shard strategies, and operational practices.

Big DataElasticsearchLog Analytics
0 likes · 16 min read
Elasticsearch Adoption Cases in Chinese Companies: JD.com, Ctrip, Qunar, 58.com, Didi and More
Fangduoduo Tech
Fangduoduo Tech
May 25, 2019 · Backend Development

How Fangdd Scales Real‑Estate Search with Elasticsearch: Architecture & Lessons

This article explains how Fangdd leverages Elasticsearch to boost search performance across consumer, broker, and internal products, detailing a platformized architecture that separates indexing and querying, addresses operational challenges, and outlines design patterns for index management and incremental updates.

ElasticsearchIndexingbackend development
0 likes · 12 min read
How Fangdd Scales Real‑Estate Search with Elasticsearch: Architecture & Lessons
Youzan Coder
Youzan Coder
Aug 31, 2018 · Big Data

Evolution of Youzan Search Platform Architecture: From 1.0 to 4.0

The Youzan Search Platform evolved from a simple Elasticsearch cluster in 2015 to a modular, message‑driven architecture with proxy validation, caching, and management tools, and now plans a cloud‑native, Kubernetes‑based 4.0 version that automates data sync, isolates workloads, and scales elastically to support billions of records.

Big DataElasticsearchProxy
0 likes · 14 min read
Evolution of Youzan Search Platform Architecture: From 1.0 to 4.0
Architecture Digest
Architecture Digest
Apr 2, 2017 · Artificial Intelligence

Mogujie's Search System Architecture and Online Request Flow

This article introduces Mogujie's end‑to‑end search system architecture, detailing its online and offline components such as Topn, ABTest, QR, fine‑ranking, search engine, UPS, and feature platforms, and then walks through a real‑world online request example to illustrate how queries are processed, rewritten, personalized, and finally ranked.

MogujieRankingonline flow
0 likes · 11 min read
Mogujie's Search System Architecture and Online Request Flow