Tag

cluster networking

0 views collected around this technical thread.

Architects' Tech Alliance
Architects' Tech Alliance
Jul 7, 2024 · Operations

Designing High‑Performance Cluster Networks for AI Large Models: InfiniBand vs RoCE

The article analyzes the networking challenges of AI super‑large models, comparing InfiniBand and RoCE technologies, and presents design guidelines for ultra‑scale, high‑bandwidth, low‑latency, and highly stable cluster interconnects to maximize GPU utilization and overall training efficiency.

GPU interconnectHigh Performance ComputingInfiniBand
0 likes · 14 min read
Designing High‑Performance Cluster Networks for AI Large Models: InfiniBand vs RoCE
Architects' Tech Alliance
Architects' Tech Alliance
Jun 17, 2019 · Fundamentals

Overview of IBM GPFS Architecture, Components, and Building‑Block Design

This article provides a comprehensive technical overview of IBM GPFS (General Parallel File System), detailing its core components, cluster management roles, networking models, and best‑practice building‑block configurations for high‑performance computing environments.

Building BlockGPFSHPC
0 likes · 14 min read
Overview of IBM GPFS Architecture, Components, and Building‑Block Design