Tag

GPU virtualization

0 views collected around this technical thread.

360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Mar 19, 2025 · Artificial Intelligence

How 360 Cloud Platform Implements GPU Passthrough and Docker+MIG for AI Workloads

This article details 360 Cloud Platform's practical implementation of GPU passthrough and Docker‑MIG solutions, covering the underlying principles, host and OpenStack configuration steps, verification methods, and future directions for full GPU virtualization.

DockerGPU passthroughGPU virtualization
0 likes · 13 min read
How 360 Cloud Platform Implements GPU Passthrough and Docker+MIG for AI Workloads
Baidu Geek Talk
Baidu Geek Talk
Aug 2, 2023 · Cloud Native

Baidu Intelligent Cloud GPU Container Virtualization 2.0: Advancements and Full-Scenario Practices

Baidu Intelligent Cloud’s GPU Container Virtualization 2.0 combines user‑mode and kernel‑mode isolation in a dual‑engine design that unifies scheduling of AI compute, rendering and encoding, supports mixed deployment and multi‑scheduler integration, and boosts GPU utilization across inference, offline tasks, autonomous‑driving simulation, and cloud‑gaming workloads.

AI workloadsContainer OrchestrationGPU virtualization
0 likes · 14 min read
Baidu Intelligent Cloud GPU Container Virtualization 2.0: Advancements and Full-Scenario Practices
DataFunSummit
DataFunSummit
Jul 1, 2023 · Artificial Intelligence

Alibaba Cloud Native Deep Learning Platform PAI‑DLC: Architecture, Features, and Future Outlook

This article introduces Alibaba Cloud's PAI‑DLC, a cloud‑native deep learning platform that integrates machine‑learning capabilities, containerized services, AI‑aware scheduling, GPU virtualization, elastic training with EasyScale, data access, and observability, and discusses its architecture, key features, and future directions.

AI PlatformGPU virtualizationKubernetes
0 likes · 16 min read
Alibaba Cloud Native Deep Learning Platform PAI‑DLC: Architecture, Features, and Future Outlook
Baidu Geek Talk
Baidu Geek Talk
Aug 31, 2022 · Artificial Intelligence

Baidu Intelligent Cloud Launches Cloud-native AI 2.0 to Accelerate AI Engineering

Baidu Intelligent Cloud’s new Cloud‑native AI 2.0 platform tackles AI engineering bottlenecks by offering hybrid‑parallel large‑model training, flexible GPU virtualization, and an AI Accelerate Kit that boosts training efficiency over 50 % and cuts inference latency up to 63 %, raising GPU utilization from ~13 % to about 50 %.

AIAI accelerationGPU virtualization
0 likes · 15 min read
Baidu Intelligent Cloud Launches Cloud-native AI 2.0 to Accelerate AI Engineering
Baidu Geek Talk
Baidu Geek Talk
Jul 18, 2022 · Artificial Intelligence

GPU Container Virtualization for AI Heterogeneous Computing: Architecture and Best Practices

The article surveys GPU container virtualization for AI heterogeneous computing, detailing utilization challenges, historical architectures, various virtualization methods, Baidu's dual-engine user- and kernel-space design with isolation and scheduling features, performance benefits, best‑practice scenarios, and deployment guidance, concluding with a technical Q&A.

AI computingContainerizationGPU virtualization
0 likes · 30 min read
GPU Container Virtualization for AI Heterogeneous Computing: Architecture and Best Practices
DataFunSummit
DataFunSummit
Jun 30, 2022 · Artificial Intelligence

MLOps Practices on the Beike Inference Platform: Architecture, Evolution, and Future Plans

This article presents a comprehensive overview of Beike's machine learning platform and its inference service, detailing the platform's architecture, GPU virtualization, cloud‑native migration, MLOps implementation, and future roadmap to achieve cost‑effective, automated AI model deployment at scale.

AIGPU virtualizationInference Platform
0 likes · 13 min read
MLOps Practices on the Beike Inference Platform: Architecture, Evolution, and Future Plans
DataFunTalk
DataFunTalk
Jun 13, 2021 · Artificial Intelligence

GPU Virtual Sharing for AI Inference Services on Kubernetes

The article presents a GPU virtual‑sharing solution for AI inference workloads that isolates memory and compute resources via CUDA API interception, integrates with Kubernetes using the open‑source aliyun‑gpushare scheduler, and demonstrates doubled GPU utilization and minimal performance loss across multiple tests.

CUDAGPU virtualizationKubernetes
0 likes · 16 min read
GPU Virtual Sharing for AI Inference Services on Kubernetes
iQIYI Technical Product Team
iQIYI Technical Product Team
May 28, 2021 · Artificial Intelligence

iQIYI GPU Virtual Sharing for AI Inference: Architecture, Isolation, and Scheduling

iQIYI created a custom GPU‑virtual‑sharing system that intercepts CUDA calls to enforce per‑container memory limits, rewrites kernel launches for compute isolation, and integrates with a Kubernetes scheduler extender, allowing multiple AI inference containers to share a single V100 with minimal overhead and more than doubling overall GPU utilization.

AI inferenceCUDAGPU virtualization
0 likes · 16 min read
iQIYI GPU Virtual Sharing for AI Inference: Architecture, Isolation, and Scheduling
58 Tech
58 Tech
Oct 28, 2020 · Artificial Intelligence

Optimizing Resource Utilization of 58.com Deep Learning Platform: Practices and Techniques

This article details how 58.com’s end‑to‑end deep‑learning platform was optimized for higher CPU and GPU inference performance using Intel MKL, OpenVINO, mixed TensorFlow deployment, GPU virtualization, and a Prometheus‑Grafana monitoring system, achieving a 37% reduction in GPU usage and a 146% increase in average GPU utilization.

GPU virtualizationIntel MKLKubernetes
0 likes · 12 min read
Optimizing Resource Utilization of 58.com Deep Learning Platform: Practices and Techniques
Architects' Tech Alliance
Architects' Tech Alliance
Mar 5, 2019 · Cloud Computing

Comprehensive Overview of Server Virtualization Technologies

This article provides an in‑depth technical overview of server virtualization, covering its historical evolution, CPU, memory, I/O and GPU virtualization techniques, hardware‑assisted extensions such as VT‑x/VT‑d/VT‑c, and the classification of virtualization architectures for modern cloud environments.

CPU virtualizationGPU virtualizationI/O virtualization
0 likes · 11 min read
Comprehensive Overview of Server Virtualization Technologies
Architects' Tech Alliance
Architects' Tech Alliance
Aug 9, 2017 · Fundamentals

Understanding NVIDIA GRID vGPU Virtualization and Its Allocation Modes

This article explains NVIDIA GRID vGPU virtualization, detailing how GPUs are partitioned by memory size, the supported hypervisors, the operation of virtual GPU resources, differences between full‑allocation vGPU and GPU pass‑through, licensing requirements, and performance considerations for cloud and data‑center environments.

GPU virtualizationNvidiagrid
0 likes · 10 min read
Understanding NVIDIA GRID vGPU Virtualization and Its Allocation Modes
Architects' Tech Alliance
Architects' Tech Alliance
Sep 26, 2016 · Cloud Computing

Comprehensive Overview of Server Virtualization Technologies

This article provides a detailed technical overview of server virtualization, covering its historical roots, CPU, memory, I/O and GPU virtualization techniques, hardware-assisted extensions, and various hypervisor architectures, highlighting why virtualization remains essential in modern cloud computing environments.

CPU virtualizationGPU virtualizationI/O virtualization
0 likes · 12 min read
Comprehensive Overview of Server Virtualization Technologies