Smart Era Software Development
Nov 3, 2023 · Operations
Inside Bilibili’s Kafka: Challenges, Guardian Federation, and Future Automation
The article details how Bilibili operates over 1,000 Kafka nodes across 20+ clusters, outlines the scalability and stability challenges they faced, and explains the design and implementation of their self‑built Guardian federation controller, partition‑level throttling, automatic balancing, multi‑tenant isolation, tiered storage, audit, and automated ops workflows.
AutomationCluster GovernanceMulti-tenant Isolation
0 likes · 21 min read
