Vivo's Evolution of Large‑Scale Distributed Messaging Middleware Architecture and Practices
This technical presentation details Vivo's end‑to‑end big‑data architecture, the evolution from Kafka to Pulsar for massive message processing, deployment strategies, high‑availability mechanisms, observability practices, and future plans for cloud‑native, containerized messaging middleware.
Introduction: The talk covers Vivo’s next‑generation data architecture spanning data collection, ingestion, massive computation and distributed storage, focusing on the evolution of its large‑scale distributed messaging middleware.
Business background: Vivo’s data pipeline starts from end‑users, passes through a unified data integration layer, then ETL processing, feeding real‑time and offline warehouses, with Kafka/Pulsar providing high‑throughput, low‑latency, reliable services.
Unified data ingestion evolution: Describes minute‑level failover for HDFS, dual‑link disaster recovery, and the shift from Kafka to the next‑generation Pulsar (Palsar) handling ~2 trillion messages daily.
Kafka deployment analysis: Compares massive single‑cluster versus many small‑cluster deployments, discusses resource isolation, dynamic throttling, traffic and resource balancing, and identifies limitations such as low utilization and slow recovery.
Pulsar core upgrade: Highlights Pulsar’s cloud‑native design with storage‑compute separation, seamless scaling, second‑level failover, intelligent dynamic throttling, and improved client memory isolation.
High‑availability and observability: Details pre‑, during‑, and post‑incident practices, monitoring layers (infrastructure, host, service, user), alert precision, and the observability platform for latency, traffic and GC monitoring.
Future roadmap: Plans for Pulsar’s tiered storage, function computing, SQL integration, full‑link monitoring, and containerized deployment to reduce operational overhead.
DataFunTalk
Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.