Tagged articles

GPUDirect

2 articles · Page 1 of 1

Feb 3, 2019 · Fundamentals

Understanding GPUDirect RDMA: Principles, Implementation, and Performance

This article explains the background of GPU communication, introduces DMA and RDMA fundamentals, describes how GPUDirect RDMA enables direct GPU-to-GPU memory access across machines, and presents performance results showing reduced latency and increased bandwidth for distributed deep‑learning training.

GPU communicationGPUDirectInfiniBand

0 likes · 7 min read

Understanding GPUDirect RDMA: Principles, Implementation, and Performance

Architects' Tech Alliance

Feb 1, 2019 · Industry Insights

How GPUDirect P2P Boosts Multi‑GPU Performance and What Limits It in Virtualized Environments

This article explains the background of GPU communication, details NVIDIA's GPUDirect and its Peer‑to‑Peer features, discusses virtualization challenges, and presents performance measurements on an Alibaba Cloud GN5 instance showing latency reduction and near‑linear scaling for deep‑learning workloads.

GPU communicationGPUDirectNVLink

0 likes · 6 min read

How GPUDirect P2P Boosts Multi‑GPU Performance and What Limits It in Virtualized Environments