Backend Development 15 min read

Design and Optimization of Baidu Short Video Push System

This article presents a comprehensive overview of Baidu's short‑video Push system, detailing its architecture, core modules, data flows, and successive optimizations such as personalized send‑time estimation, user‑group management, frequency‑control redesign, and protobuf compression, illustrating how these improvements enhance scalability, reliability, and resource efficiency.

Architect
Architect
Architect
Design and Optimization of Baidu Short Video Push System

The article introduces the short‑video Push system used by multiple Baidu apps (e.g., Haokan Video, Live, Du Xiao Shi) and explains its purpose: delivering personalized, operational, and real‑time notifications to billions of active users to boost engagement and retention.

Background : In an era of information overload, Push technology shifts from pull‑based retrieval to proactive delivery, making timely, user‑specific notifications essential for mobile applications.

Key Terminology : definitions of Message Push, Personalized Push, Operational Push, and Real‑time Push are provided to clarify the different push scenarios.

System Overview : The architecture consists of a material center, user center, personalized recall service, realtime‑API, frequency‑control (UFC) service, pre‑processing service, sending service, receipt service, and a control center (PCC). Data flows from client SDKs through user and material centers, into recommendation engines, task creation, pre‑processing, and finally delivery via cloud Push middle‑platform to vendor agents or long‑link services.

Core Modules are described in detail, highlighting their responsibilities and interactions.

Data Flow : User behavior logs are collected, transformed into tables, used by strategies to generate material and user sets, which are then recalled, packaged into tasks, sent to the Push middle‑platform, and finally delivered to devices. Receipt data is fed back for monitoring and model training.

Iterative Optimizations :

1) Personalized first/last push time estimation based on user activity windows to improve click‑through rates.

2) User‑group service redesign: replacing physical‑machine‑based user packages with bitmap‑based user groups managed via the AMIS platform, enabling logical expressions, efficient bitmap operations, and reduced memory usage.

3) Frequency‑control service overhaul: moving from static hash‑mod allocation to a dynamically scalable consistent‑hash architecture, adding whitelist handling, and supporting fine‑grained time‑slot controls.

4) Data compression using Google Protocol Buffers, which reduces payload size to about 25% of JSON and speeds up serialization, saving ~75% Redis memory for frequency‑control data.

Conclusion : Push remains a low‑cost, high‑impact channel for mobile product operations. Effective system design—covering architecture, personalization, frequency control, and efficient serialization—directly influences user experience, engagement, and operational scalability.

Backendpersonalizationdistributed architectureProtobufpush systemfrequency control
Architect
Written by

Architect

Professional architect sharing high‑quality architecture insights. Topics include high‑availability, high‑performance, high‑stability architectures, big data, machine learning, Java, system and distributed architecture, AI, and practical large‑scale architecture case studies. Open to ideas‑driven architects who enjoy sharing and learning.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.