Tagged articles
3 articles
Page 1 of 1
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 19, 2023 · Cloud Computing

Predictable Network: Alibaba Cloud’s Ethernet Edge for Faster AI Training

This article examines the challenges of scaling AI model training beyond single-chip limits, introduces Alibaba Cloud’s Predictable Network architecture—including high‑performance Ethernet, dual‑uplink, and adaptive routing—and compares its performance, scalability, and reliability against InfiniBand, showing how Ethernet can meet AI workloads with minimal loss.

AI trainingEthernet vs InfiniBandHigh‑Performance Networking
0 likes · 27 min read
Predictable Network: Alibaba Cloud’s Ethernet Edge for Faster AI Training
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 16, 2023 · Cloud Computing

Predictable Network and High‑Performance Network Architecture for Large‑Scale AI Training

The article examines how Alibaba Cloud’s Predictable Network, InfiniBand versus Ethernet trade‑offs, and the HPN high‑performance network design together address the extreme bandwidth, latency, scalability and reliability requirements of modern large‑model AI training workloads in cloud data centers.

AI trainingHigh‑performance computingInfiniBand
0 likes · 24 min read
Predictable Network and High‑Performance Network Architecture for Large‑Scale AI Training
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Nov 16, 2022 · Cloud Computing

Inside Alibaba Cloud’s HAIL Network: Architecture, Innovations, and Future Trends

This article explores Alibaba Cloud’s HAIL data‑center network architecture, its evolution from early enterprise‑grade designs to fully self‑developed hardware, key technical features such as single‑chip design and automated operations, and the emerging trends toward higher throughput, ultra‑low latency, pooling, and predictable networking.

Alibaba CloudData Center NetworkHAIL
0 likes · 13 min read
Inside Alibaba Cloud’s HAIL Network: Architecture, Innovations, and Future Trends