Cloud Native 4 min read

How Dragonfly Accelerates Image Distribution and Scales Kubernetes Batch Processing

At KubeCon+CloudNativeCon 2023 in Amsterdam, Volcano Engine and ByteDance presented two technical sessions covering Dragonfly's P2P image distribution best practices and large‑scale Kubernetes batch processing strategies, offering deep insights and real‑world implementations for cloud‑native developers.

ByteDance Cloud Native
ByteDance Cloud Native
ByteDance Cloud Native
How Dragonfly Accelerates Image Distribution and Scales Kubernetes Batch Processing

From April 18‑21, 2023, CNCF held KubeCon+CloudNativeCon in Amsterdam, gathering leading open‑source communities and cloud‑native experts.

Volcano Engine’s cloud‑native team and ByteDance’s orchestration team presented two technical sessions.

Best Practices for Accelerated Image Distribution Using Dragonfly

Speakers: Wenbo Qi (Ant Financial/Dragonfly community) and Yingyang Huang (Volcano Engine/Dragonfly community).

Dragonfly is a P2P‑based image and file distribution system. The talk covered Dragonfly & Nydus architecture, design, how Dragonfly speeds model distribution in machine‑learning inference engines, best practices for image acceleration on Volcano Engine, download‑time metrics, and integration with Harbor, Nydus and other ecosystem components.

Kubernetes Batch Processing at Scale – A Scheduling Perspective

Speakers: Lim Haw Jia and Fan Deliang (ByteDance).

The session examined challenges of running massive offline workloads on native Kubernetes—diverse pod types, unified scheduling constraints, and scalability. It shared ByteDance’s experience handling nearly a million daily offline tasks, reasons and benefits of Kubernetes‑hosted batch processing, techniques such as gang scheduling and DRF, parallelizing compute‑intensive scheduling components, and the resulting improvements in resource utilization and cost.

Related open‑source project: https://github.com/kubewharf

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Batch ProcessingDragonflyImage Distribution
ByteDance Cloud Native
Written by

ByteDance Cloud Native

Sharing ByteDance's cloud-native technologies, technical practices, and developer events.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.