Tagged articles
174 articles
Page 2 of 2
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 7, 2022 · Industry Insights

Why Alibaba Cloud Ranks Among the Top 10 Global Network Research Institutions

The 2022 AI‑2000 ranking highlights Alibaba Cloud as one of the ten most influential network research institutions worldwide, detailing its extensive publication record, breakthrough low‑latency RDMA technologies, NFC distance expansion, and the XLINK QUIC protocol that collectively reshape data‑center and wide‑area networking.

AI 2000Alibaba CloudData Center Networking
0 likes · 4 min read
Why Alibaba Cloud Ranks Among the Top 10 Global Network Research Institutions
Tencent Cloud Developer
Tencent Cloud Developer
Jun 6, 2022 · Cloud Computing

High‑Performance Network Solutions: RDMA, RoCE, iWARP and io_uring – Principles, Implementation and Benchmark Analysis

The article reviews high‑performance networking options—RDMA (including RoCE v2 and iWARP) and Linux’s io_uring—explaining their principles, hardware requirements, and benchmark results, and concludes that while RDMA delivers ultra‑low latency for specialized workloads, io_uring offers modest network benefits, leaving TCP as the default for most services.

BenchmarkHigh‑Performance NetworkingRDMA
0 likes · 10 min read
High‑Performance Network Solutions: RDMA, RoCE, iWARP and io_uring – Principles, Implementation and Benchmark Analysis
Architects' Tech Alliance
Architects' Tech Alliance
Jun 4, 2022 · Operations

Comprehensive Survey of Large‑Scale RDMA Technologies and Practices

This article provides a detailed overview of large‑scale RDMA technology, covering basic concepts, major protocols, network‑level techniques such as congestion control, lossless‑to‑lossy evolution and multipath, virtualization, communication libraries for AI training and storage, performance tuning, monitoring, and real‑world deployment experiences.

AIRDMAVirtualization
0 likes · 16 min read
Comprehensive Survey of Large‑Scale RDMA Technologies and Practices
Alibaba Cloud Developer
Alibaba Cloud Developer
May 31, 2022 · Backend Development

How RDMA‑Powered SMC‑R Transforms TCP Performance in Data Centers

This article explains why traditional Linux kernel TCP stacks struggle with high‑performance demands, introduces shared‑memory IPC and RDMA concepts, describes the SMC‑R hybrid protocol that transparently replaces TCP sockets, and outlines practical acceleration methods and community contributions.

Kernel Network StackRDMASMC-R
0 likes · 7 min read
How RDMA‑Powered SMC‑R Transforms TCP Performance in Data Centers
IT Architects Alliance
IT Architects Alliance
May 23, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage

The article analyzes how the AI boom and high‑performance SSD storage demand sub‑microsecond latency, exposing TCP/IP’s inherent context‑switch and CPU overhead, and explains why RDMA’s kernel‑bypass, zero‑copy design and 1 µs latency make it the preferred network stack for modern data‑center workloads despite challenges in Ethernet deployment.

AI computingData Center NetworkLow latency
0 likes · 11 min read
Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage
Architects' Tech Alliance
Architects' Tech Alliance
May 19, 2022 · Fundamentals

An Introduction to RDMA: Concepts, Advantages, Protocols, and Programming Basics

This article explains the fundamentals of Remote Direct Memory Access (RDMA), comparing it with traditional networking, outlining its core advantages, suitable use cases, the three main RDMA protocols (Infiniband, RoCE, iWARP), deployment requirements, communication flow, and essential programming concepts.

High‑Performance NetworkingLow latencyRDMA
0 likes · 9 min read
An Introduction to RDMA: Concepts, Advantages, Protocols, and Programming Basics
Architects' Tech Alliance
Architects' Tech Alliance
May 14, 2022 · Fundamentals

High‑Performance Computing Network Solutions: RoCE v2, RDMA, and InfiniBand Overview

The article explains how high‑performance computing (HPC) networks overcome TCP/IP limitations by using RDMA‑based technologies such as RoCE v1/v2 and InfiniBand, detailing their architectures, advantages, vendor implementations, and cost‑effective migration to Ethernet‑based solutions for GPU‑driven workloads.

HPCHighPerformanceComputingInfiniBand
0 likes · 7 min read
High‑Performance Computing Network Solutions: RoCE v2, RDMA, and InfiniBand Overview
Architects' Tech Alliance
Architects' Tech Alliance
Mar 14, 2022 · Fundamentals

An Introduction to RDMA: Principles, Protocols, Advantages, and Programming Basics

This article provides a comprehensive overview of Remote Direct Memory Access (RDMA), covering its definition, how it differs from traditional networking, core advantages such as zero‑copy and CPU offload, typical use cases, the three main RDMA protocols, deployment requirements, and essential programming concepts and terminology.

CPU OffloadData centerHigh‑performance computing
0 likes · 9 min read
An Introduction to RDMA: Principles, Protocols, Advantages, and Programming Basics
Architects' Tech Alliance
Architects' Tech Alliance
Mar 13, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP in AI-Driven Data Centers

The article analyzes how the AI era’s demand for ultra‑low latency and high throughput exposes fundamental limits of the traditional TCP/IP stack, and explains why RDMA’s kernel‑bypass, zero‑copy design, and emerging congestion‑control algorithms are becoming the preferred network fabric for modern data‑center workloads.

AI FabricData centerLow latency
0 likes · 12 min read
Why RDMA Is Replacing TCP/IP in AI-Driven Data Centers
Architects' Tech Alliance
Architects' Tech Alliance
Mar 4, 2022 · Operations

What Is InfiniBand RDMA and How to Configure It on RHEL 8?

This guide explains the fundamentals of InfiniBand and RDMA, details the InfiniBand Verbs API, outlines the steps required for kernel data handling, and provides practical configuration instructions for RoCE, IPoIB, and the subnet manager on Red Hat Enterprise Linux 8.

IPoIBInfiniBandNetwork Configuration
0 likes · 11 min read
What Is InfiniBand RDMA and How to Configure It on RHEL 8?
Architects' Tech Alliance
Architects' Tech Alliance
Mar 2, 2022 · Cloud Computing

Bus-Level Data Center Network Technology: RDMA Acceleration and Ultra-Low Latency Innovations

The article examines bus‑level data center network technologies, detailing how RDMA and ultra‑low‑latency forwarding mechanisms reduce end‑to‑end delays, enable high‑performance computing and AI workloads, and drive the evolution toward hyper‑converged, cloud‑native infrastructures.

Data Center NetworkHigh‑performance computingRDMA
0 likes · 14 min read
Bus-Level Data Center Network Technology: RDMA Acceleration and Ultra-Low Latency Innovations
IT Architects Alliance
IT Architects Alliance
Jan 20, 2022 · Industry Insights

Why Hyper‑Converged Data Center Networks Are the Future of AI‑Driven Infrastructure

The article analyzes how hyper‑converged data‑center networking unifies compute, storage, and HPC on a lossless Ethernet fabric, addresses AI‑era performance bottlenecks, compares RDMA over Ethernet with InfiniBand, and outlines the core metrics, value, and key technologies that enable zero‑loss, low‑latency, high‑throughput operation.

AIData centerHyper-Converged Network
0 likes · 13 min read
Why Hyper‑Converged Data Center Networks Are the Future of AI‑Driven Infrastructure
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 6, 2022 · Big Data

Inside Alibaba Cloud’s MRACC Engine: How It Won the TPCx‑BB Benchmark

Alibaba Cloud’s self‑developed MRACC (Apasara Compute MapReduce Accelerator) leveraged hardware‑software integration, Spark and Hadoop optimizations, and eRDMA networking to achieve the top TPCx‑BB SF3000 performance, delivering up to 2‑3× faster SQL queries and 30% faster Spark shuffle, with significant cost efficiency gains.

BenchmarkBig DataRDMA
0 likes · 9 min read
Inside Alibaba Cloud’s MRACC Engine: How It Won the TPCx‑BB Benchmark
Open Source Linux
Open Source Linux
Nov 22, 2021 · Cloud Computing

How Alibaba’s SMC‑R Brings Zero‑Modification RDMA to Cloud Applications

This article explains the background, architecture, and performance of Alibaba Cloud's SMC‑R technology, which enables transparent, zero‑modification use of RDMA over standard socket interfaces, offering higher throughput and lower latency for cloud workloads while providing automatic fallback to TCP.

Alibaba CloudElastic RDMARDMA
0 likes · 11 min read
How Alibaba’s SMC‑R Brings Zero‑Modification RDMA to Cloud Applications
IT Architects Alliance
IT Architects Alliance
Sep 13, 2021 · Industry Insights

Why Hyper‑Converged Data Center Networks Are the Future of AI‑Driven Infrastructure

The article analyzes how AI‑driven workloads, exploding storage and compute capabilities, and distributed architectures expose the limits of traditional three‑network data‑center designs, and explains why a lossless, hyper‑converged Ethernet network with zero‑loss, low‑latency, high‑throughput characteristics is becoming essential.

AIData centerHyper-Converged Network
0 likes · 12 min read
Why Hyper‑Converged Data Center Networks Are the Future of AI‑Driven Infrastructure
Architects' Tech Alliance
Architects' Tech Alliance
Sep 9, 2021 · Fundamentals

Understanding DMA and RDMA: Principles, Advantages, and Protocols

This article explains the concepts of Direct Memory Access (DMA) and Remote Direct Memory Access (RDMA), compares traditional data transfer with DMA-enabled paths, outlines RDMA's advantages such as zero-copy and kernel bypass, and reviews the main RDMA protocols, standards bodies, and hardware ecosystem.

DMAHigh-Performance ComputingKernel Bypass
0 likes · 14 min read
Understanding DMA and RDMA: Principles, Advantages, and Protocols
Architects' Tech Alliance
Architects' Tech Alliance
Aug 6, 2021 · Big Data

Performance Optimization Techniques for the Ceph Distributed Storage System

This article reviews Ceph's architecture, enumerates common benchmarking tools, analyzes its advantages and challenges, and presents a comprehensive set of performance‑optimization methods covering storage‑engine tuning, network communication, data placement, configuration parameters, hardware‑specific adaptations, and future research directions.

CephNVMePerformance Optimization
0 likes · 20 min read
Performance Optimization Techniques for the Ceph Distributed Storage System
Architects' Tech Alliance
Architects' Tech Alliance
Jul 3, 2021 · Cloud Computing

Performance Optimization Techniques for the Ceph Distributed Storage System

This article reviews Ceph's architecture, benchmarks, monitoring methods, and a wide range of performance‑optimizing strategies—including storage‑engine tweaks, network‑communication improvements, data‑placement algorithms, configuration tuning, and hardware‑specific adaptations—while also outlining future research directions.

CephNVMeRDMA
0 likes · 18 min read
Performance Optimization Techniques for the Ceph Distributed Storage System
Architects' Tech Alliance
Architects' Tech Alliance
Apr 28, 2021 · Industry Insights

Why InfiniBand Is Outpacing Ethernet in High‑Performance Computing

The article provides a comprehensive technical overview of InfiniBand, covering its history, standards, architecture layers, packet format, performance advantages, and a detailed comparison with Ethernet, highlighting why it has become the preferred high‑speed interconnect for HPC workloads.

Data TransferHigh‑performance computingInfiniBand
0 likes · 15 min read
Why InfiniBand Is Outpacing Ethernet in High‑Performance Computing
Architects' Tech Alliance
Architects' Tech Alliance
Mar 7, 2021 · Fundamentals

Understanding RDMA: InfiniBand, iWARP, and RoCE Technologies and Their Differences

This article explains Remote Direct Memory Access (RDMA), its origins in InfiniBand, the Ethernet‑based variants iWARP and RoCE (including RoCEv1 and RoCEv2), compares their architectures, performance characteristics, and deployment requirements for high‑performance computing and data‑center networks.

High‑Performance NetworkingInfiniBandRDMA
0 likes · 11 min read
Understanding RDMA: InfiniBand, iWARP, and RoCE Technologies and Their Differences
Architects' Tech Alliance
Architects' Tech Alliance
Jan 10, 2021 · Industry Insights

Why RoCE Is Revolutionizing Data Center Networking: A Deep Dive into RDMA over Ethernet

This article explains the fundamentals of RDMA and RoCE, compares RoCE v1 and v2, outlines deployment steps, highlights performance benefits such as low CPU usage and zero‑copy, and answers common questions about its differences from iWARP and InfiniBand, helping data‑center engineers evaluate the technology.

Data Center NetworkingHigh BandwidthLow latency
0 likes · 8 min read
Why RoCE Is Revolutionizing Data Center Networking: A Deep Dive into RDMA over Ethernet
Tencent Cloud Developer
Tencent Cloud Developer
Sep 17, 2020 · Cloud Computing

Evolution and Performance Optimization of Tencent Cloud Block Storage (CBS)

Tencent Cloud Block Storage (CBS) has evolved through three generations—apllo, atlas, and HiSTOR—adopting a client‑direct, distributed architecture, SPDK, RDMA and user‑space TCP to cut latency to sub‑microseconds while delivering exabyte‑scale throughput, high IOPS, and reliable multi‑copy replication for cloud VM workloads.

Distributed SystemsRDMASPDK
0 likes · 24 min read
Evolution and Performance Optimization of Tencent Cloud Block Storage (CBS)
Tencent Cloud Developer
Tencent Cloud Developer
Aug 25, 2020 · Cloud Computing

Introducing Tencent Cloud CBS Enhanced and Ultra‑Fast SSD Cloud Disks: Architecture and Performance Optimizations

Tencent Cloud’s new CBS 3.0‑based Enhanced and Ultra‑Fast SSD cloud disks cut latency by over 50 % to sub‑100 µs, boost IOPS up to 1.1 million and throughput to 4 GB/s, and achieve these gains through SPDK‑driven virtualization, RDMA‑based data paths, a user‑space ZTCP stack, zero‑copy memory handling and dedicated hardware acceleration, targeting latency‑sensitive and IO‑intensive workloads such as large databases, video processing and AI inference.

RDMASSDTencent Cloud
0 likes · 8 min read
Introducing Tencent Cloud CBS Enhanced and Ultra‑Fast SSD Cloud Disks: Architecture and Performance Optimizations
DataFunTalk
DataFunTalk
May 8, 2020 · Artificial Intelligence

Distributed Machine Learning Framework GDBT for High‑Dimensional Real‑Time Recommendation Systems

The article explains how the fourth paradigm's distributed machine learning framework GDBT tackles the massive data, high‑dimensional features, and real‑time requirements of modern recommendation systems by leveraging heterogeneous computing, parameter servers, RDMA networking, and optimized workloads.

GDBTParameter ServerRDMA
0 likes · 18 min read
Distributed Machine Learning Framework GDBT for High‑Dimensional Real‑Time Recommendation Systems
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 28, 2020 · Artificial Intelligence

How Alibaba Cloud Powers AI with Cutting‑Edge Heterogeneous Compute

This article explains how Alibaba Cloud builds a high‑performance AI infrastructure by combining advanced hardware such as Shenlong servers, GPUs, FPGAs, NPUs, and custom interconnects like RDMA, together with virtualization, FPGA‑as‑a‑Service, AIACC, and resource‑pooling technologies to deliver scalable, cost‑effective AI services.

AI hardwareAlibaba CloudFPGA as a Service
0 likes · 20 min read
How Alibaba Cloud Powers AI with Cutting‑Edge Heterogeneous Compute
Architects' Tech Alliance
Architects' Tech Alliance
Apr 3, 2020 · Industry Insights

Why InfiniBand Beats TCP/IP: Deep Dive into Architecture and Socket Direct

This article explains how InfiniBand’s RDMA‑based architecture, layered protocol stack, and Mellanox Socket Direct technology deliver far higher bandwidth, lower latency, and better CPU efficiency than traditional TCP/IP networks, and it presents performance test results that show up to an 80% latency reduction.

FabricHigh‑performance computingInfiniBand
0 likes · 11 min read
Why InfiniBand Beats TCP/IP: Deep Dive into Architecture and Socket Direct
UCloud Tech
UCloud Tech
Jan 16, 2020 · Operations

How to Build a Low‑Latency, Lossless RoCE Network for High‑Performance Data Centers

This article explains how to design a low‑overhead, high‑performance lossless RoCE network for data centers, covering RDMA basics, mainstream network options, QoS, lossless and congestion‑control designs, buffer management, deadlock analysis, and practical tuning to achieve sub‑100 µs latency and near‑full bandwidth utilization.

Data Center NetworkingLossless EthernetQoS
0 likes · 21 min read
How to Build a Low‑Latency, Lossless RoCE Network for High‑Performance Data Centers
Architects' Tech Alliance
Architects' Tech Alliance
Jul 18, 2019 · Fundamentals

Overview of OpenFabrics Enterprise Distribution (OFED) and InfiniBand Software Architecture

This article provides a comprehensive overview of the OpenFabrics Enterprise Distribution (OFED) and the InfiniBand software architecture, covering its history, components, middleware, protocol stack, and how it enables high‑performance, low‑latency networking for IP, storage, and compute applications.

High-Performance ComputingInfiniBandLinux
0 likes · 11 min read
Overview of OpenFabrics Enterprise Distribution (OFED) and InfiniBand Software Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Jul 17, 2019 · Fundamentals

Understanding NVMe over Fabrics: Protocols, RDMA, and Fabric Options

This article explains the NVMe over Fabrics architecture, compares various fabric transports such as FC, InfiniBand, RoCE v2, iWARP and TCP, and details how RDMA‑based technologies like zero‑copy, kernel bypass and CPU‑free transfers give NVMe‑oF its performance advantages while also covering protocol differences, FC‑NVMe, and the emergence of NVMe/TCP.

FabricsNVMeNetworking
0 likes · 11 min read
Understanding NVMe over Fabrics: Protocols, RDMA, and Fabric Options
Architects' Tech Alliance
Architects' Tech Alliance
Jun 13, 2019 · Fundamentals

Understanding OpenFabrics Enterprise Distribution (OFED) and the InfiniBand Software Architecture

This article explains the OpenFabrics Enterprise Distribution (OFED) ecosystem, its history, the InfiniBand hardware and software stack, key protocols such as IPoIB, SDP and iSER, and how these technologies enable high‑performance, low‑latency networking across Linux, Windows and virtualized environments.

High-Performance ComputingInfiniBandLinux
0 likes · 12 min read
Understanding OpenFabrics Enterprise Distribution (OFED) and the InfiniBand Software Architecture
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 12, 2019 · Artificial Intelligence

How Alibaba’s PAISoar Accelerates Deep Learning: 101× Speedup on 128 GPUs

Alibaba engineers detail the PAISoar distributed training framework, showing how RDMA‑optimized hardware, Ring AllReduce algorithms, and user‑friendly APIs boost deep‑learning models—like the GreenNet CNN—to 101‑fold speedups on 128 GPUs, dramatically reducing training time from days to under a day.

AI InfrastructureDeep LearningDistributed Training
0 likes · 17 min read
How Alibaba’s PAISoar Accelerates Deep Learning: 101× Speedup on 128 GPUs
Architects' Tech Alliance
Architects' Tech Alliance
Jun 9, 2019 · Fundamentals

Detailed Overview of NVMe Architecture and NVMe over Fabrics

This article provides a comprehensive technical overview of NVMe architecture, the NVMe‑over‑Fabric extensions—including InfiniBand, RoCE, iWARP, Fibre Channel, and TCP—explaining their RDMA‑based advantages, protocol differences, and practical considerations for data‑center storage deployments.

Fibre ChannelInfiniBandNVMe
0 likes · 12 min read
Detailed Overview of NVMe Architecture and NVMe over Fabrics
Architects' Tech Alliance
Architects' Tech Alliance
Apr 8, 2019 · Fundamentals

Understanding RDMA: Principles, Advantages, and Implementation Details

This article explains how RDMA (Remote Direct Memory Access) technology, originating from InfiniBand and extended to Ethernet (RoCE) and TCP/IP (iWARP), provides ultra‑low latency, high throughput, and minimal CPU usage for high‑performance computing and big‑data applications by bypassing traditional OS and protocol stack processing.

High‑Performance NetworkingLow latencyRDMA
0 likes · 8 min read
Understanding RDMA: Principles, Advantages, and Implementation Details
Architects' Tech Alliance
Architects' Tech Alliance
Apr 7, 2019 · Fundamentals

Understanding NVMe/TCP and Its Role in Modern Data Center Storage

The article explains the evolution of NVMe‑oF, compares RDMA, FC and TCP transports, highlights the advantages and challenges of NVMe/TCP in modern data‑center and cloud storage, and discusses Lightbits' LightOS and accelerator card as a cost‑effective solution for high‑performance distributed storage.

Data centerLightbitsNVMe
0 likes · 10 min read
Understanding NVMe/TCP and Its Role in Modern Data Center Storage
Architects' Tech Alliance
Architects' Tech Alliance
Feb 14, 2019 · Fundamentals

Understanding RDMA (Remote Direct Memory Access): Background, Related Work, and Technical Details

This article provides a comprehensive overview of Remote Direct Memory Access (RDMA), covering its background, limitations of traditional TCP/IP, related technologies such as TOE, U-Net, VIA, and detailed explanations of RDMA concepts, hardware implementations, verbs, and communication workflows.

RDMARemote Direct Memory Accessnetwork architecture
0 likes · 17 min read
Understanding RDMA (Remote Direct Memory Access): Background, Related Work, and Technical Details
Architects' Tech Alliance
Architects' Tech Alliance
Feb 3, 2019 · Fundamentals

Understanding GPUDirect RDMA: Principles, Implementation, and Performance

This article explains the background of GPU communication, introduces DMA and RDMA fundamentals, describes how GPUDirect RDMA enables direct GPU-to-GPU memory access across machines, and presents performance results showing reduced latency and increased bandwidth for distributed deep‑learning training.

Deep LearningGPU communicationGPUDirect
0 likes · 7 min read
Understanding GPUDirect RDMA: Principles, Implementation, and Performance
Architects' Tech Alliance
Architects' Tech Alliance
Jan 13, 2019 · Fundamentals

Overview of InfiniBand Technology and Its Protocol Stack

This article provides a comprehensive overview of InfiniBand technology, covering its open‑standard architecture, history, OFED software stack, protocol layers, performance advantages over traditional storage networks, and its primary use cases in high‑performance computing and data‑center environments.

High-Performance ComputingInfiniBandNetworking
0 likes · 11 min read
Overview of InfiniBand Technology and Its Protocol Stack
Architects' Tech Alliance
Architects' Tech Alliance
Jan 10, 2019 · Fundamentals

Understanding RDMA: Principles, Advantages, and Implementation Details

This article explains the challenges of high‑performance computing and big‑data workloads on traditional TCP/IP stacks, introduces RDMA technology, its variants (InfiniBand, RoCE, iWARP), key protocols, hardware components, and how it achieves ultra‑low latency and high throughput with minimal CPU involvement.

InfiniBandNetwork ProtocolsRDMA
0 likes · 13 min read
Understanding RDMA: Principles, Advantages, and Implementation Details
Architects' Tech Alliance
Architects' Tech Alliance
Jan 3, 2019 · Industry Insights

How NVMe over Fabrics Is Transforming Modern Storage Networks

This article examines the evolution from legacy SCSI and SAS storage protocols to NVMe and NVMe over Fabrics, explaining the performance bottlenecks of traditional storage, the technical advantages of NVMe, deployment options, vendor implementations, and future trends shaping data‑center storage architectures.

Data centerNVMeRDMA
0 likes · 11 min read
How NVMe over Fabrics Is Transforming Modern Storage Networks
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 7, 2018 · Databases

How Alibaba Achieved Extreme Database Elasticity with Hybrid Cloud, Containers, and Storage‑Compute Separation

This article explains how Alibaba transformed its database infrastructure through hybrid‑cloud high‑performance ECS, container‑based multi‑instance deployment, and a user‑space storage‑compute separation architecture with RDMA, dramatically improving resource utilization, scaling speed, and cost efficiency for massive traffic spikes.

RDMAStorage Compute Separationcloud-native
0 likes · 15 min read
How Alibaba Achieved Extreme Database Elasticity with Hybrid Cloud, Containers, and Storage‑Compute Separation
Architects' Tech Alliance
Architects' Tech Alliance
Dec 4, 2018 · Fundamentals

Understanding RDMA High‑Performance Networks: Principles, Benefits, and Applications in Machine Learning

The article explains the background, architecture, and performance advantages of RDMA high‑performance networking, compares it with traditional TCP/IP, describes its deployment at Baidu for machine‑learning workloads, and outlines future use cases such as storage acceleration, GPU communication, and core services.

High‑Performance NetworkingRDMARoCE
0 likes · 12 min read
Understanding RDMA High‑Performance Networks: Principles, Benefits, and Applications in Machine Learning
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 26, 2018 · Databases

How Alibaba’s DBFS Achieved Storage‑Compute Separation for Massive 11.11 Sales

This article details Alibaba's journey from the 2017 pilot of storage‑compute separation to the 2018 large‑scale deployment of the DBFS user‑space file system, highlighting innovations such as zero‑copy I/O, RDMA integration, adaptive page cache, asynchronous I/O, atomic writes, online resize, and hardware‑software co‑design that enabled elastic, high‑performance database operations during the Double‑11 shopping festival.

DBFSDatabase PerformanceRDMA
0 likes · 15 min read
How Alibaba’s DBFS Achieved Storage‑Compute Separation for Massive 11.11 Sales
Architects' Tech Alliance
Architects' Tech Alliance
Nov 25, 2018 · Industry Insights

Why RDMA Makes NVMe‑over‑Fabric Faster: A Deep Dive into Fabrics, FC, InfiniBand, RoCE and TCP

The article examines how NVMe‑over‑Fabric extends NVMe beyond PCIe using various fabrics—FC, InfiniBand, RoCE v2, iWARP and TCP—highlighting RDMA’s zero‑copy, kernel‑bypass and CPU‑free advantages, and comparing protocol differences, performance trade‑offs, and the evolution toward NVMe/TCP.

Fibre ChannelInfiniBandNVMe
0 likes · 13 min read
Why RDMA Makes NVMe‑over‑Fabric Faster: A Deep Dive into Fabrics, FC, InfiniBand, RoCE and TCP
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Nov 21, 2018 · Cloud Computing

Alibaba Data Center Network Architecture HAIL 5.1: High Availability, De‑stacking, and Low‑Latency RDMA Design

The article describes Alibaba's HAIL 5.1 data‑center network architecture introduced for the 2018 Double‑11 event, detailing its high‑availability de‑stacking design, low‑latency RDMA deployment, and future HAIL 2.0 evolution to support larger‑scale, intelligent, and high‑performance cloud networking.

Data centerLow latencyRDMA
0 likes · 9 min read
Alibaba Data Center Network Architecture HAIL 5.1: High Availability, De‑stacking, and Low‑Latency RDMA Design
UCloud Tech
UCloud Tech
Nov 8, 2018 · Cloud Computing

Unlocking 13× IOPS: Inside UCloud’s High‑Performance SSD Cloud Disk Architecture

UCloud’s latest SSD cloud disk redesign dramatically improves performance—raising IOPS by 13‑fold, cutting latency tenfold, and expanding capacity—through a two‑layer IO path, 1 MB metadata shards, multithreaded models, overload protection, online migration, and upcoming RDMA/SPDK‑based ultra‑high‑performance storage solutions.

IO performanceRDMASPDK
0 likes · 13 min read
Unlocking 13× IOPS: Inside UCloud’s High‑Performance SSD Cloud Disk Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Oct 31, 2018 · Fundamentals

Understanding InfiniBand: Architecture, Protocols, and Performance

InfiniBand is a high‑performance network protocol that uses credit‑based flow control and switched fabric architecture to provide low latency, high bandwidth, and reliable data transfer, offering advantages over TCP/IP such as reduced packet loss, efficient RDMA, and support for various upper‑layer protocols.

High‑performance computingInfiniBandRDMA
0 likes · 10 min read
Understanding InfiniBand: Architecture, Protocols, and Performance
Architects' Tech Alliance
Architects' Tech Alliance
Oct 28, 2018 · Fundamentals

Understanding OpenFabrics Enterprise Distribution (OFED) and InfiniBand Software Architecture

This article provides a comprehensive overview of OpenFabrics Enterprise Distribution (OFED), its history, component stack, and the layered InfiniBand software architecture, explaining how various protocols such as IPoIB, SDP, and iSER enable high‑performance, low‑latency networking for Linux and Windows applications.

High-Performance ComputingInfiniBandLinux
0 likes · 8 min read
Understanding OpenFabrics Enterprise Distribution (OFED) and InfiniBand Software Architecture
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 25, 2018 · Cloud Computing

Alibaba's High‑Performance Intelligent Data Center Network: Evolution, Programmable Forwarding, RDMA, Automation, and the Luoshen Cloud Network Engine

The article reviews Alibaba's large‑scale data‑center network advancements, covering its high‑performance evolution, programmable forwarding planes, massive RDMA deployment, automated control systems, AI‑driven self‑healing, and the Luoshen cloud network engine that underpins Alibaba Cloud services.

AlibabaData Center NetworkProgrammable Forwarding
0 likes · 10 min read
Alibaba's High‑Performance Intelligent Data Center Network: Evolution, Programmable Forwarding, RDMA, Automation, and the Luoshen Cloud Network Engine
Tencent Cloud Developer
Tencent Cloud Developer
Sep 11, 2018 · Databases

CynosDB Architecture and Core Mechanisms: A Comprehensive Technical Overview

CynosDB is Tencent Cloud’s high‑performance, highly‑available NewSQL distributed database that uses a shared‑storage architecture with primary and replica instances, log‑structured storage, RDMA/SPDK transmission, Raft consensus, SSI MVCC, two‑phase locking, and state‑machine replication to provide elastic scaling, fast recovery, and cost‑effective enterprise data management.

CynosDBMVCCNewSQL
0 likes · 17 min read
CynosDB Architecture and Core Mechanisms: A Comprehensive Technical Overview
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 23, 2018 · Operations

Inside Alibaba’s Vision for the Future of Network Technology at SIGCOMM 2018

Alibaba showcased its cutting‑edge network research at SIGCOMM 2018, highlighting programmable hardware, integrated software‑hardware designs, AI‑driven operations, the Hyper‑scale Edge Network vision, hierarchical network lifecycle management, large‑scale RDMA deployment, and advanced network visualization, positioning itself alongside leading global tech firms.

AlibabaNetworkingRDMA
0 likes · 8 min read
Inside Alibaba’s Vision for the Future of Network Technology at SIGCOMM 2018
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Aug 22, 2018 · Fundamentals

Alibaba’s Contributions and Vision at SIGCOMM 2018: Future Network Technologies

At SIGCOMM 2018 in Hungary, Alibaba showcased its pioneering network research, sharing visions on programmable hardware, integrated software‑hardware design, intelligent operations, the Hyper‑scale Edge Network, RDMA deployment, and network visualization, while presenting several influential papers and collaborating with leading academia.

AlibabaEdge ComputingFuture Networks
0 likes · 8 min read
Alibaba’s Contributions and Vision at SIGCOMM 2018: Future Network Technologies
Didi Tech
Didi Tech
Jun 8, 2018 · Artificial Intelligence

DiDi PS: High-Performance RDMA-Based Parameter Server for Distributed Deep Learning

DiDi PS is a custom RDMA‑based parameter server that uses a ring topology and optimized ibverbs communication to dramatically accelerate distributed deep‑learning training, consistently outperforming OpenMPI, NCCL2, TensorFlow’s built‑in RDMA, and Horovod while providing more stable and scalable synchronization for massive data workloads.

AllreduceDistributed TrainingParameter Server
0 likes · 10 min read
DiDi PS: High-Performance RDMA-Based Parameter Server for Distributed Deep Learning
Architects' Tech Alliance
Architects' Tech Alliance
Apr 22, 2018 · Fundamentals

An Overview of Remote Direct Memory Access (RDMA): Principles, Comparisons, and Implementations

This article provides a comprehensive overview of Remote Direct Memory Access (RDMA), detailing its underlying principles, performance advantages over traditional TCP/IP, various protocol families such as InfiniBand, RoCE, and iWARP, and their respective hardware and software requirements.

High‑performance computingInfiniBandLow latency
0 likes · 9 min read
An Overview of Remote Direct Memory Access (RDMA): Principles, Comparisons, and Implementations
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 30, 2017 · Databases

How PolarDB Redefines Cloud‑Native Relational Databases

This article traces the evolution of relational databases, explains the rise of cloud‑native computing, and details how Alibaba Cloud’s PolarDB combines storage‑compute separation, RDMA networking, shared‑disk architecture, and advanced replication techniques to deliver high‑performance, scalable, and cost‑effective database services.

Distributed SystemsParallel RaftRDMA
0 likes · 23 min read
How PolarDB Redefines Cloud‑Native Relational Databases
JD Tech
JD Tech
Sep 26, 2017 · Cloud Computing

Impact of RDMA Technology on High‑Performance Data Centers and Its Adoption at JD.com

The article explains how RDMA (Remote Direct Memory Access) reduces CPU involvement, lowers latency, and increases bandwidth in data‑center networks, describes JD.com’s practical deployments across AI, big‑data, storage, and HPC workloads, and highlights industry trends toward broader RDMA adoption.

Data centerDistributed SystemsHigh‑performance computing
0 likes · 6 min read
Impact of RDMA Technology on High‑Performance Data Centers and Its Adoption at JD.com
Architects' Tech Alliance
Architects' Tech Alliance
Jul 9, 2017 · Fundamentals

An Introduction to RDMA: Principles, Operation, and Integration with TCP/Ethernet

This article explains the growing need for more efficient data‑center networking, introduces Remote Direct Memory Access (RDMA) technology, describes its working principles, operation types, and how it can be layered over TCP/Ethernet to reduce latency and CPU overhead in high‑performance environments.

Data centerHigh‑performance computingRDMA
0 likes · 14 min read
An Introduction to RDMA: Principles, Operation, and Integration with TCP/Ethernet
Architects' Tech Alliance
Architects' Tech Alliance
Jun 5, 2017 · Fundamentals

Overview of InfiniBand Technology: Development, Advantages, Architecture, Protocol Layers, and Applications

This article provides a comprehensive overview of InfiniBand technology, covering its history, performance advantages over traditional interconnects, architectural concepts, layered protocol specifications, and typical use cases in high‑performance computing and data‑center environments.

Data centerHigh-Performance ComputingInfiniBand
0 likes · 14 min read
Overview of InfiniBand Technology: Development, Advantages, Architecture, Protocol Layers, and Applications