Bilibili Tech
Jul 5, 2022 · Big Data
Multi‑Datacenter Architecture for Offline Big Data Processing at Bilibili
To overcome rapid data growth and on‑premise capacity limits, Bilibili adopted a scale‑out, unit‑based multi‑datacenter architecture that isolates failures, intelligently places jobs, replicates data via an enhanced DistCp service, routes reads with an IP‑aware HDFS router, and throttles cross‑site traffic, enabling stable offline big‑data processing of hundreds of petabytes while preserving throughput.
Big DataHDFSYARN
0 likes · 28 min read