Tagged articles

GPU Sharing

3 articles · Page 1 of 1

Mar 13, 2026 · Cloud Native

Boosting Autonomous Driving Data Pipelines with Koordinator’s ElasticQuota and GPU Sharing

This article details how a leading autonomous‑driving company tackled multi‑tenant resource contention, low GPU utilization, and distributed task dead‑locks on a heterogeneous Kubernetes cluster by adopting Koordinator’s ElasticQuota, Reservation, Gang and Device‑Share features, achieving higher allocation rates, better fairness, and significantly improved GPU efficiency.

ElasticQuotaGPU SharingKoordinator

0 likes · 20 min read

Boosting Autonomous Driving Data Pipelines with Koordinator’s ElasticQuota and GPU Sharing

ByteDance Cloud Native

Aug 12, 2024 · Cloud Native

How mGPU Enables Efficient GPU Sharing for AI Workloads in Cloud‑Native Environments

The article explains the mGPU solution from Volcano Engine, detailing its kernel‑level GPU virtualization, container runtime hooks, and scheduling mechanisms that allow multiple containers to share a single NVIDIA GPU with isolated compute and memory resources, achieving near‑lossless performance and up to 50% higher utilization for AI tasks.

AI workloadsGPU Sharingcontainer runtime

0 likes · 9 min read

How mGPU Enables Efficient GPU Sharing for AI Workloads in Cloud‑Native Environments

ByteDance SYS Tech

Aug 12, 2024 · Cloud Native

How mGPU Enables Efficient GPU Sharing for AI Workloads

This article explains the mGPU solution that virtualizes NVIDIA GPUs for containers, detailing its driver architecture, compute and memory isolation mechanisms, performance benchmarks on ResNet‑50 inference, and how it boosts GPU utilization by over 50% for AI and high‑performance computing tasks.

AI accelerationCloud NativeContainer Orchestration

0 likes · 10 min read

How mGPU Enables Efficient GPU Sharing for AI Workloads