Tag

mGPU

0 views collected around this technical thread.

ByteDance SYS Tech
ByteDance SYS Tech
Aug 12, 2024 · Cloud Native

How mGPU Enables Efficient GPU Sharing for AI Workloads

This article explains the mGPU solution that virtualizes NVIDIA GPUs for containers, detailing its driver architecture, compute and memory isolation mechanisms, performance benchmarks on ResNet‑50 inference, and how it boosts GPU utilization by over 50% for AI and high‑performance computing tasks.

AI accelerationCloud NativeContainer Orchestration
0 likes · 10 min read
How mGPU Enables Efficient GPU Sharing for AI Workloads
ByteDance Cloud Native
ByteDance Cloud Native
Aug 9, 2023 · Cloud Native

How Volcano Engine’s New GPU Sharing Scheduler Boosts AI Workloads by 500%

This article explains Volcano Engine's next‑generation GPU sharing scheduling technology, detailing the two‑layer scheduler, card‑level bin‑pack/spread strategies, system architecture, API definitions, and optimization algorithms that together increase GPU deployment density over 500% and improve utilization by more than 50% for AI workloads.

Cloud NativeGPU schedulingKubernetes
0 likes · 13 min read
How Volcano Engine’s New GPU Sharing Scheduler Boosts AI Workloads by 500%