Tagged articles
2 articles
Page 1 of 1
Architects' Tech Alliance
Architects' Tech Alliance
Feb 3, 2019 · Fundamentals

Understanding GPUDirect RDMA: Principles, Implementation, and Performance

This article explains the background of GPU communication, introduces DMA and RDMA fundamentals, describes how GPUDirect RDMA enables direct GPU-to-GPU memory access across machines, and presents performance results showing reduced latency and increased bandwidth for distributed deep‑learning training.

Deep LearningGPU communicationGPUDirect
0 likes · 7 min read
Understanding GPUDirect RDMA: Principles, Implementation, and Performance
Architects' Tech Alliance
Architects' Tech Alliance
Feb 1, 2019 · Industry Insights

How GPUDirect P2P Boosts Multi‑GPU Performance and What Limits It in Virtualized Environments

This article explains the background of GPU communication, details NVIDIA's GPUDirect and its Peer‑to‑Peer features, discusses virtualization challenges, and presents performance measurements on an Alibaba Cloud GN5 instance showing latency reduction and near‑linear scaling for deep‑learning workloads.

Deep LearningGPU communicationGPUDirect
0 likes · 6 min read
How GPUDirect P2P Boosts Multi‑GPU Performance and What Limits It in Virtualized Environments