Mixed Workload Co-location Practices in Bilibili's Kubernetes Cloud Platform
Bilibili’s Kubernetes cloud platform boosts server utilization by co‑locating latency‑sensitive online services with batch‑oriented offline jobs on the same nodes, using custom schedulers, extended resources, dynamic CPU/memory isolation, and a management console, achieving average CPU usage around 35 % and significant cost savings.