Baidu Tech Salon
Nov 10, 2023 · Artificial Intelligence
Baidu Search Deep Learning Model Architecture and Optimization Practices
Baidu's Search Architecture team details how its deep‑learning models have evolved to deliver direct answer results via semantic embeddings, describes a massive online inference pipeline that rewrites queries, ranks relevance, and classifies types, and outlines optimization techniques—including data I/O, CPU/GPU balancing, pruning, quantization, and distillation—to achieve high‑throughput, low‑latency search.
BaiduGPU optimizationInference System
0 likes · 13 min read