Tagged articles
2 articles
Page 1 of 1
DeWu Technology
DeWu Technology
Jul 16, 2025 · Artificial Intelligence

How We Built a Scalable Offline‑Online Sequence Modeling System for Community Search

This article details the design of a community‑search pipeline that leverages long‑term user interaction sequences for CTR/CVR prediction, describes the global, online and offline architectures, enumerates the major performance and consistency challenges encountered, and presents the practical optimizations and future directions adopted to achieve reliable, high‑throughput sequence modeling.

AI OptimizationData ConsistencySequence Modeling
0 likes · 12 min read
How We Built a Scalable Offline‑Online Sequence Modeling System for Community Search
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 7, 2024 · Artificial Intelligence

RTAMS-GANNS: A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighbor Search

RTAMS‑GANNS, the award‑winning real‑time adaptive multi‑stream GPU system for online approximate nearest neighbor search, eliminates costly memory allocations and serial execution by using a dynamic memory‑block insertion algorithm and separate CUDA streams, cutting latency by 40‑80% and reliably serving over 100 million daily users in production.

GPUPerformance EvaluationVector Insertion
0 likes · 19 min read
RTAMS-GANNS: A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighbor Search