AI Cyberspace
AI Cyberspace
Feb 22, 2025 · Cloud Computing

Why RoCEv2 Needs a Lossless Network and How to Achieve It

RoCE, originally built for InfiniBand, was adapted to Ethernet as RoCEv2, which uses IP/UDP headers to enable L3 routing but is highly sensitive to packet loss, requiring a lossless network and employing technologies such as PFC, ECN, DCQCN, and multi‑path transmission to maintain high RDMA performance.

Congestion ControlDCQCNECN
0 likes · 17 min read
Why RoCEv2 Needs a Lossless Network and How to Achieve It
Open Source Linux
Open Source Linux
Jun 5, 2024 · Operations

Unraveling Data Center Congestion: Incast, ECN, and PFC Explained

This article examines why data‑center networks experience congestion, detailing many‑to‑one and all‑to‑all traffic patterns, the role of incast, and how mechanisms such as ECN and PFC can be tuned to achieve loss‑free, low‑latency communication.

CLOSData Center NetworkingECN
0 likes · 10 min read
Unraveling Data Center Congestion: Incast, ECN, and PFC Explained
Architects' Tech Alliance
Architects' Tech Alliance
Jun 1, 2024 · Industry Insights

Why Do Data Center Networks Congest? Unpacking Many‑to‑One and All‑to‑All Incast Scenarios

The article analyzes how CLOS spine‑leaf data‑center networks encounter congestion under many‑to‑one and all‑to‑all traffic patterns, explains the limitations of simply enlarging buffers, and details how ECN and PFC mechanisms can be tuned to achieve loss‑less, low‑latency operation.

CLOSCongestion ControlData Center Networking
0 likes · 12 min read
Why Do Data Center Networks Congest? Unpacking Many‑to‑One and All‑to‑All Incast Scenarios
Linux Code Review Hub
Linux Code Review Hub
Mar 6, 2024 · Operations

Advanced Congestion Management Techniques for Lossless Ethernet Storage Networks

The article examines high‑level strategies for preventing and recovering from congestion in lossless Ethernet storage networks, including disconnecting faulty devices, early frame dropping, traffic isolation, endpoint notifications, rate limiting, pause‑timeout, PFC watchdog mechanisms, detailed Cisco configuration commands, and the benefits and limitations of each approach.

Cisco NexusCongestion ManagementECN
0 likes · 33 min read
Advanced Congestion Management Techniques for Lossless Ethernet Storage Networks