Databases 13 min read

JD’s Color Gateway: Tens of Millions QPS with Cloud‑Native Data Warehouse

During the 2022 China Internet Industry Application Salon, JD Cloud’s product manager explained how the Color gateway, an API gateway handling billions of daily requests, overcomes stability, high‑availability, reliability, and performance challenges during peak sales by adopting a cloud‑native ClickHouse data warehouse that boosts processing speed, reduces costs, and provides real‑time analytics.

JD Cloud Developers
JD Cloud Developers
JD Cloud Developers
JD’s Color Gateway: Tens of Millions QPS with Cloud‑Native Data Warehouse

What is the Color Gateway?

Before introducing the Color gateway, we first explain what an API gateway is. An API gateway receives all client requests, routes them to backend services, provides aggregation and protocol conversion, and can aggregate results from multiple services, reducing integration cost, improving efficiency, and ensuring security.

The JD Retail Color gateway, as an API gateway service, handles hundreds of billions of daily traffic and calls, and is a crucial technical component for promotional events.

JD Retail Color Gateway
JD Retail Color Gateway

Challenges During Major Promotions

The Color gateway faces four main challenges during large‑scale sales events:

Stability: must ensure smooth operation under high‑concurrency traffic from core services.

High availability: thousands of backend interfaces must operate independently so that a failure in one does not affect others.

Reliability: security measures such as anti‑scraping and flow control are needed to prevent overload.

Performance: real‑time monitoring of gateway performance and request interception is essential for decision‑making.

Challenges for Color Gateway during promotions
Challenges for Color Gateway during promotions

Architecture Selection and Solution

1. Original Architecture – HBase Pain Points

The monitoring service, core of the gateway, relied on HBase for log analysis, but HBase exhibited several shortcomings:

Inability to meet multi‑dimensional aggregation query performance requirements.

Lack of time‑series aggregation capability (requiring manual development).

Hotspot and memory consumption issues.

Limited data analysis performance, unable to meet response time expectations.

To address these issues, the architecture was upgraded.

Original architecture with HBase pain points
Original architecture with HBase pain points

2. Upgrade – Cloud‑Native Data Warehouse Solution

JD Cloud developed a cloud‑native data warehouse based on ClickHouse, separating storage and compute.

The upgraded architecture divides traffic governance into two services: full‑data (offline, written to reports and monitoring logs) and real‑time data (streamed via Kafka to Flink, then to the cloud‑native warehouse for real‑time analysis).

Upgraded cloud‑native data warehouse architecture
Upgraded cloud‑native data warehouse architecture

Compared with HBase, the cloud‑native warehouse improves processing performance by 60% (from 1.92 × 10⁸ rows/min to 3.1 × 10⁸ rows/min) and enables second‑level scaling of compute resources, dramatically reducing storage costs.

3. Problems with Traditional Data Warehouses

Traditional warehouses suffer from technical bottlenecks, complex architecture, high query latency, difficulty achieving high concurrency and availability, and high operational costs.

4. Required OLAP Engine for Explosive Business Growth

To meet massive data demands, an OLAP engine with the following characteristics is needed:

Separation of storage and compute to cut storage costs by over 50% and allow rapid elastic scaling.

High availability and ultra‑high performance, delivering sub‑second response even under massive load.

Out‑of‑the‑box usability: one‑click cluster creation, visual monitoring, and immediate production readiness.

Desired OLAP engine features
Desired OLAP engine features

5. Advantages of the Cloud‑Native ClickHouse Warehouse

Key benefits include:

Very low cost: storage cost reduced by more than 80% per PB.

Extreme elasticity: compute nodes can be scaled up or down within seconds.

Simplicity: one‑click provisioning of PB‑scale instances with visual monitoring and log analysis.

ClickHouse advantages
ClickHouse advantages

6. ClickHouse Architecture

The system adopts a storage‑compute separation at the database management layer, with a cloud‑control platform providing user‑facing operations such as purchase, configuration, and instance management.

ClickHouse product architecture
ClickHouse product architecture

7. Compute and Storage Node Advantages

Compute nodes are stateless, focusing solely on dynamic resource allocation, improving utilization. Storage nodes are also stateless, organized into groups that handle concurrent requests and support multi‑level caching, enabling fast data access and easy scaling when I/O becomes a bottleneck.

Compute node advantages
Compute node advantages
Storage node advantages
Storage node advantages

8. Second‑Level Elasticity

Users do not need to manage data distribution; they only need to consider compute capacity, while the system automatically handles storage scaling.

Second‑level elasticity
Second‑level elasticity

9. Cost Reduction

ClickHouse’s multi‑replica shared storage cuts storage cost by 50%, distributed storage pricing is about 60% of traditional cloud disks, and billing is based on actual data volume (50‑80% of raw size), yielding overall savings of 50‑80%.

Cost savings with storage‑compute separation
Cost savings with storage‑compute separation

10. Supporting JD Business

The cloud‑native warehouse now powers dozens of core JD services, including real‑time dashboards, promotional product selection platforms, precise marketing, advertising, log and traffic analysis, and more. Its elastic architecture allows dynamic scaling during peak and off‑peak periods, letting users focus on business value.

JD core services supported by ClickHouse
JD core services supported by ClickHouse
Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

performanceCloud NativeCost Optimizationdata-warehouseOLAP
JD Cloud Developers
Written by

JD Cloud Developers

JD Cloud Developers (Developer of JD Technology) is a JD Technology Group platform offering technical sharing and communication for AI, cloud computing, IoT and related developers. It publishes JD product technical information, industry content, and tech event news. Embrace technology and partner with developers to envision the future.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.