AI Cyberspace
AI Cyberspace
Feb 22, 2025 · Cloud Computing

Why RoCEv2 Needs a Lossless Network and How to Achieve It

RoCE, originally built for InfiniBand, was adapted to Ethernet as RoCEv2, which uses IP/UDP headers to enable L3 routing but is highly sensitive to packet loss, requiring a lossless network and employing technologies such as PFC, ECN, DCQCN, and multi‑path transmission to maintain high RDMA performance.

Congestion ControlDCQCNECN
0 likes · 17 min read
Why RoCEv2 Needs a Lossless Network and How to Achieve It
Open Source Linux
Open Source Linux
Jun 5, 2024 · Operations

Unraveling Data Center Congestion: Incast, ECN, and PFC Explained

This article examines why data‑center networks experience congestion, detailing many‑to‑one and all‑to‑all traffic patterns, the role of incast, and how mechanisms such as ECN and PFC can be tuned to achieve loss‑free, low‑latency communication.

CLOSData Center NetworkingECN
0 likes · 10 min read
Unraveling Data Center Congestion: Incast, ECN, and PFC Explained
Architects' Tech Alliance
Architects' Tech Alliance
Jun 1, 2024 · Industry Insights

Why Do Data Center Networks Congest? Unpacking Many‑to‑One and All‑to‑All Incast Scenarios

The article analyzes how CLOS spine‑leaf data‑center networks encounter congestion under many‑to‑one and all‑to‑all traffic patterns, explains the limitations of simply enlarging buffers, and details how ECN and PFC mechanisms can be tuned to achieve loss‑less, low‑latency operation.

CLOSCongestion ControlData Center Networking
0 likes · 12 min read
Why Do Data Center Networks Congest? Unpacking Many‑to‑One and All‑to‑All Incast Scenarios
Linux Code Review Hub
Linux Code Review Hub
Mar 6, 2024 · Operations

Advanced Congestion Management Techniques for Lossless Ethernet Storage Networks

The article examines high‑level strategies for preventing and recovering from congestion in lossless Ethernet storage networks, including disconnecting faulty devices, early frame dropping, traffic isolation, endpoint notifications, rate limiting, pause‑timeout, PFC watchdog mechanisms, detailed Cisco configuration commands, and the benefits and limitations of each approach.

Cisco NexusCongestion ManagementECN
0 likes · 33 min read
Advanced Congestion Management Techniques for Lossless Ethernet Storage Networks
Linux Code Review Hub
Linux Code Review Hub
Mar 4, 2024 · Operations

How to Troubleshoot Congestion in Lossless Ethernet Storage Networks – Part 5

This article explains a step‑by‑step methodology for detecting, diagnosing, and resolving congestion in lossless Ethernet storage networks, covering severity levels, spine‑leaf troubleshooting workflows, remote monitoring, comparative analysis of pause‑frame metrics, and real‑world case studies that illustrate the impact of over‑utilization and mixed traffic on network performance.

Congestion ManagementEthernetFC/FCoE
0 likes · 28 min read
How to Troubleshoot Congestion in Lossless Ethernet Storage Networks – Part 5
Linux Code Review Hub
Linux Code Review Hub
Feb 27, 2024 · Fundamentals

How to Compute Headroom for Lossless Ethernet Links (Part 2)

This section analytically derives the headroom size for loss‑less Ethernet links using IEEE 802.1Qbb priority flow control, detailing worst‑case delays, interface and cable latency, buffer cell sizing, Cisco Nexus configuration examples, and a comparison with Fibre Channel B2B primitives.

BuffersCiscoDataCenter
0 likes · 22 min read
How to Compute Headroom for Lossless Ethernet Links (Part 2)
Linux Code Review Hub
Linux Code Review Hub
Feb 26, 2024 · Fundamentals

Understanding Ethernet Flow Control and Congestion Management (Part 1)

This article explains Ethernet flow‑control mechanisms (LLFC and PFC), how pause frames and their quanta are calculated, the role of pause and resume thresholds (XOFF/XON), headroom and footroom concepts, buffer‑queue management, and provides Cisco Nexus configuration examples for lossless storage networks.

Cisco NexusCongestion ManagementData Center Networking
0 likes · 19 min read
Understanding Ethernet Flow Control and Congestion Management (Part 1)