Tagged articles
13 articles
Page 1 of 1
TAL Education Technology
TAL Education Technology
Jul 31, 2025 · Databases

How Milvus Powers Billion-Scale Vector Search for AI at TAL Education

This article explains how TAL Education leverages the open‑source Milvus vector database—covering its architecture, features, cloud‑native deployment, monitoring, and real‑world AI applications such as intelligent grading and multimodal search—to handle billions of vectors with millisecond‑level similarity retrieval.

AICloud NativeEducation Technology
0 likes · 14 min read
How Milvus Powers Billion-Scale Vector Search for AI at TAL Education
Alimama Tech
Alimama Tech
Dec 14, 2023 · Artificial Intelligence

AI-Driven Content Risk Control: System Evolution and Optimization at Alibaba

Alibaba Mom’s AI‑driven content risk platform has evolved from simple rule‑matching to a data‑centric, serverless architecture that integrates large‑model acceleration, decision‑tree compilation, high‑throughput vector retrieval and elastic word‑matching, delivering sub‑100 ms text and sub‑1 s image moderation while remaining stable during peak promotional traffic.

AIDevOpscontent moderation
0 likes · 25 min read
AI-Driven Content Risk Control: System Evolution and Optimization at Alibaba
Baidu Geek Talk
Baidu Geek Talk
Oct 31, 2023 · Artificial Intelligence

Interview on Baidu's Open‑Source Large‑Scale Vector Search Engine Puck

Baidu has open‑sourced its high‑performance, trillion‑scale vector search engine Puck—originally built for ultra‑large image‑search workloads, winner of multiple BIGANN categories, now supporting diverse embeddings alongside the medium‑size Tinker algorithm—to accelerate community innovation, improve code quality, and broaden AI retrieval applications across search, recommendation and cloud services.

AIANNBaidu
0 likes · 12 min read
Interview on Baidu's Open‑Source Large‑Scale Vector Search Engine Puck
High Availability Architecture
High Availability Architecture
Apr 27, 2023 · Artificial Intelligence

Design and Optimization of Bilibili's Large‑Scale Video Duplicate Detection System

This article describes the design, algorithmic improvements, and engineering performance optimizations of Bilibili's massive video duplicate detection (collision) system, covering challenges of low‑edit‑degree reposts, two‑stage retrieval, self‑supervised feature extraction, GPU‑accelerated preprocessing, and the resulting gains in accuracy and throughput.

BilibiliDeep Learningfeature extraction
0 likes · 17 min read
Design and Optimization of Bilibili's Large‑Scale Video Duplicate Detection System
Bilibili Tech
Bilibili Tech
Apr 21, 2023 · Artificial Intelligence

Design and Optimization of Bilibili's Large-Scale Video Duplicate Detection System

Bilibili built a massive video‑duplicate detection platform that trains a self‑supervised ResNet‑50 feature extractor, removes black borders, and uses a two‑stage ANN‑plus‑segment‑level matching pipeline accelerated by custom GPU decoding and inference, boosting duplicate rejection 7.5×, recall 3.75×, and cutting manual misses from 65 to 5 per day.

Deep LearningGPU Accelerationfeature extraction
0 likes · 19 min read
Design and Optimization of Bilibili's Large-Scale Video Duplicate Detection System
Alimama Tech
Alimama Tech
Mar 23, 2022 · Artificial Intelligence

Advancements in Keyword Recall for Search Advertising: From Binary Retrieval to Hierarchical Bidding Graph

The paper reports a year‑long evolution of Alibaba’s search‑advertising keyword recall, replacing the traditional two‑stage rewrite‑and‑score pipeline with a low‑storage binary retrieval model and then a joint recall framework built on a hierarchical bidding graph, delivering near‑full‑precision recall, 16× memory savings, and quota‑free global ranking.

AIbinary retrievalhierarchical bidding graph
0 likes · 22 min read
Advancements in Keyword Recall for Search Advertising: From Binary Retrieval to Hierarchical Bidding Graph
Baidu Geek Talk
Baidu Geek Talk
Feb 14, 2022 · Artificial Intelligence

How Baidu’s PUCK Dominated the First BigANN Vector Search Competition

The inaugural BigANN competition, organized by NeurIPS, showcased large‑scale ANN research, and Baidu's self‑developed PUCK algorithm secured top scores across all four tracks by leveraging multi‑layer quantization, two‑level inverted indexing, and extensive system‑level optimizations.

ANNBigANNPUCK
0 likes · 8 min read
How Baidu’s PUCK Dominated the First BigANN Vector Search Competition
Alimama Tech
Alimama Tech
Dec 8, 2021 · Artificial Intelligence

Dual Vector Foil (DVF): Decoupled Index and Model Retrieval for Large-Scale Recall

The Dual Vector Foil (DVF) system decouples index construction from model training by building a post‑training HNSW graph, enabling any complex model to score candidates, which yields a 5.7 % recall boost, cuts latency from ~40 ms to 6.5 ms, and raises QPS over tenfold while simplifying maintenance.

dual vector foilindexinglarge-scale retrieval
0 likes · 27 min read
Dual Vector Foil (DVF): Decoupled Index and Model Retrieval for Large-Scale Recall
Architecture Digest
Architecture Digest
Dec 8, 2021 · Cloud Native

Implementing Compute-Storage Separation for Large-Scale Retrieval Systems Using Fluid

This article describes the challenges of operating massive, TB‑scale retrieval clusters at Zuoyebang, and presents a Fluid‑based compute‑storage separation architecture that improves data distribution, update efficiency, scalability, and stability, enabling containerized search services to be managed like regular stateless workloads.

Compute-Storage SeparationData OrchestrationFluid
0 likes · 13 min read
Implementing Compute-Storage Separation for Large-Scale Retrieval Systems Using Fluid
58 Tech
58 Tech
Mar 3, 2021 · Artificial Intelligence

Design and Implementation of a Faiss‑Based Vector Search Platform

The article describes the design, architecture, and key components of a vector search platform built on Faiss that supports full‑index construction, incremental and distributed indexing, online retrieval, city‑level search, and vector update/delete operations to meet large‑scale AI application needs.

AIKubernetesdistributed indexing
0 likes · 10 min read
Design and Implementation of a Faiss‑Based Vector Search Platform
DataFunTalk
DataFunTalk
Dec 9, 2020 · Artificial Intelligence

WeChat Identify: From Object Detection to Large‑Scale Image Search – Technical Overview

This article details the evolution of WeChat’s Identify product, explaining its end‑to‑end image recognition pipeline—including object detection, multi‑label classification, mobile‑side detection, large‑scale retrieval, unsupervised clustering, and system architecture—while showcasing various application scenarios such as product, plant, and landmark recognition.

Computer VisionMobile AIWeChat
0 likes · 12 min read
WeChat Identify: From Object Detection to Large‑Scale Image Search – Technical Overview
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 10, 2019 · Artificial Intelligence

How Joint Optimization of Tree-Based Indexes Boosts Large-Scale Recommendation Accuracy

This article introduces JTM, a joint optimization framework that simultaneously learns deep scoring models and tree-structured indexes, addressing the limitations of traditional recommendation pipelines and demonstrating significant precision and recall gains on large-scale datasets such as Amazon Books and UserBehavior.

Deep LearningRecommendation Systemsjoint optimization
0 likes · 20 min read
How Joint Optimization of Tree-Based Indexes Boosts Large-Scale Recommendation Accuracy
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 22, 2019 · Artificial Intelligence

How Alibaba Engineers Shrank Image Features from 120KB to 13KB and Cut Euclidean Distance to 9µs

This article details how Alibaba's engineers reduced per‑image feature size from 120 KB to an average of 13 KB and accelerated Euclidean distance calculations from 50 µs to 9 µs through bitmap, offset, and duplicate‑value compression techniques, enabling efficient large‑scale image retrieval.

Performance Optimizationeuclidean distancefeature vectors
0 likes · 15 min read
How Alibaba Engineers Shrank Image Features from 120KB to 13KB and Cut Euclidean Distance to 9µs