Tagged articles
8 articles
Page 1 of 1
Open Source Linux
Open Source Linux
Jul 11, 2024 · Operations

Why Traditional ECMP Fails for AI Workloads and How Modern Load‑Balancing Solves It

The article examines the rapid growth of AI‑driven compute demand, explains why conventional ECMP load balancing struggles with uneven, high‑bandwidth flows in data‑center networks, and compares advanced strategies such as Fat‑Tree design, VoQ, flow‑based, packet‑based, flowlet, and cell‑based approaches, including vendor implementations.

AI workloadsData Center NetworkECMP
0 likes · 13 min read
Why Traditional ECMP Fails for AI Workloads and How Modern Load‑Balancing Solves It
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Nov 16, 2022 · Cloud Computing

Inside Alibaba Cloud’s HAIL Network: Architecture, Innovations, and Future Trends

This article explores Alibaba Cloud’s HAIL data‑center network architecture, its evolution from early enterprise‑grade designs to fully self‑developed hardware, key technical features such as single‑chip design and automated operations, and the emerging trends toward higher throughput, ultra‑low latency, pooling, and predictable networking.

Alibaba CloudData Center NetworkHAIL
0 likes · 13 min read
Inside Alibaba Cloud’s HAIL Network: Architecture, Innovations, and Future Trends
IT Architects Alliance
IT Architects Alliance
May 23, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage

The article analyzes how the AI boom and high‑performance SSD storage demand sub‑microsecond latency, exposing TCP/IP’s inherent context‑switch and CPU overhead, and explains why RDMA’s kernel‑bypass, zero‑copy design and 1 µs latency make it the preferred network stack for modern data‑center workloads despite challenges in Ethernet deployment.

AI computingData Center NetworkLow latency
0 likes · 11 min read
Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage
Architects' Tech Alliance
Architects' Tech Alliance
Mar 2, 2022 · Cloud Computing

Bus-Level Data Center Network Technology: RDMA Acceleration and Ultra-Low Latency Innovations

The article examines bus‑level data center network technologies, detailing how RDMA and ultra‑low‑latency forwarding mechanisms reduce end‑to‑end delays, enable high‑performance computing and AI workloads, and drive the evolution toward hyper‑converged, cloud‑native infrastructures.

Data Center NetworkHigh‑performance computingRDMA
0 likes · 14 min read
Bus-Level Data Center Network Technology: RDMA Acceleration and Ultra-Low Latency Innovations
Architects' Tech Alliance
Architects' Tech Alliance
Sep 7, 2021 · Fundamentals

Understanding Fat-Tree (CLOS) Network Architecture for Data Centers

The article explains the Fat-Tree (CLOS) network topology introduced in 2008, describing its non‑convergent bandwidth design, three‑layer structure, practical benefits, common configurations, and limitations, while also providing references and visual illustrations of the architecture.

CLOSData Center NetworkFat-Tree
0 likes · 7 min read
Understanding Fat-Tree (CLOS) Network Architecture for Data Centers
Architects Research Society
Architects Research Society
Jun 3, 2020 · Fundamentals

Typical Two‑Layer Spine‑Leaf Topology and Its Scalability

The article explains the classic two‑layer spine‑leaf (Clos) architecture, describing how each leaf switch connects to every spine switch, how oversubscription is handled by adding spines or leaves, and why the design offers predictable latency and easy scalability for data‑center networks.

Data Center NetworkScalabilitynon-blocking architecture
0 likes · 4 min read
Typical Two‑Layer Spine‑Leaf Topology and Its Scalability
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 25, 2018 · Cloud Computing

Alibaba's High‑Performance Intelligent Data Center Network: Evolution, Programmable Forwarding, RDMA, Automation, and the Luoshen Cloud Network Engine

The article reviews Alibaba's large‑scale data‑center network advancements, covering its high‑performance evolution, programmable forwarding planes, massive RDMA deployment, automated control systems, AI‑driven self‑healing, and the Luoshen cloud network engine that underpins Alibaba Cloud services.

AlibabaData Center NetworkProgrammable Forwarding
0 likes · 10 min read
Alibaba's High‑Performance Intelligent Data Center Network: Evolution, Programmable Forwarding, RDMA, Automation, and the Luoshen Cloud Network Engine