Cloud Computing 12 min read

Unlocking 90 W/s Red Packet Peaks: Alibaba’s High‑Performance Local Storage Blueprint

This article explains how Alibaba Cloud designed a high‑performance local storage solution—leveraging Virtio‑blk, SPDK, and NVMe SSDs—to meet the massive peak pressure, ultra‑low latency, and high IOPS demands of the Alipay red‑packet business, and how the technology is now available in the public cloud.

21CTO
21CTO
21CTO
Unlocking 90 W/s Red Packet Peaks: Alibaba’s High‑Performance Local Storage Blueprint

Red Packet Business Characteristics

Alipay’s red‑packet service saw record participation in 2017, with a peak opening rate of 90 W/s on Chinese New Year’s Eve. The system’s architecture includes modular deployment, unified access, gateways, DAO, databases, and both online and offline data processing.

Red Packet Business Characteristics
Red Packet Business Characteristics

The database layer in the transaction chain has three key traits: massive peak pressure (up to millions of transactions per second during the draw), stringent latency requirements (sub‑100 µs), and built‑in disaster recovery via primary‑secondary replication.

Existing Block Storage Products

Alibaba offers three block‑storage options—SSD cloud disks, high‑efficiency cloud disks, and standard cloud disks. All provide 99.9999999 % reliability, but their IOPS and latency fall short of the red‑packet demands (target IOPS ≥ 200 k, latency < 100 µs).

Existing Block Storage Products
Existing Block Storage Products

High‑Performance Local Storage

To meet these requirements, a high‑performance local storage solution was designed, targeting ultra‑high IOPS and ultra‑low latency.

Key Components

Virtio‑blk : a semi‑virtualized block device interface that enables high‑speed data exchange between the VM and the physical host.

Virtio‑blk Architecture
Virtio‑blk Architecture

SPDK (Storage Performance Development Kit): a user‑space toolkit for building high‑performance, lock‑free storage applications.

SPDK Overview
SPDK Overview

NVMe SSD : PCI‑e SSDs using the NVMe protocol, offering high bandwidth and low latency.

NVMe SSD Advantages
NVMe SSD Advantages

Optimized Data Path

By retaining the standard file system and POSIX API, and streamlining the stack to only the block device layer, SPDK driver, and NVMe SSD, the data path is reduced from seven layers to four, dramatically cutting latency.

Optimized Data Path
Optimized Data Path

Performance Evaluation

Fio benchmarks on CentOS 7 show the high‑performance local disk achieving ~70 µs read latency and ~30 µs write latency, compared to ~130 µs and ~60 µs respectively for a generic virtualized setup.

Latency Distribution
Latency Distribution

Database tests reveal a jump from 14 242 TPS (RT 8.21 ms) to 26 969 TPS (RT 1.7 ms) when using the new storage.

Database Performance Comparison
Database Performance Comparison

Public Cloud Availability

Following its success in the Double‑11 and Chinese New Year red‑packet events, the solution—NVMe SSD + SPDK—was launched to Alibaba Cloud’s public services in February, becoming the world’s first cloud offering with this architecture.

Public Cloud Deployment
Public Cloud Deployment

Current specifications: 3 TB capacity, 500 k IOPS, 50 µs latency, 4 GB/s bandwidth.

Local SSD 2.0
Local SSD 2.0

Conclusion

Combining SPDK with NVMe SSD technology brings local disks close to physical‑machine performance, ensuring stable operation even under extreme peak loads such as Double‑11 and New Year red‑packet activities.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

high performancecloud storageAlibaba CloudNVMeSPDKred packet
21CTO
Written by

21CTO

21CTO (21CTO.com) offers developers community, training, and services, making it your go‑to learning and service platform.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.