Big Data 16 min read

Evolution and Architecture of JD.com Self‑Operated Rebate Platform

The article details the development, challenges, and redesign of JD.com’s self‑operated rebate system, describing its early monolithic architecture, data‑intensive processing pipeline, migration to a modular, high‑availability platform built on Spark, Hive, and Elasticsearch, and the resulting performance and operational improvements.

JD Retail Technology
JD Retail Technology
JD Retail Technology
Evolution and Architecture of JD.com Self‑Operated Rebate Platform

Since its pilot launch in 2014, JD.com’s self‑operated rebate system has served multiple business groups, providing rule management, data aggregation, rebate calculation, settlement, risk monitoring, and analytics across a massive volume of order and policy data.

The original system used a "chimney" monolithic architecture that suffered from weak extensibility, poor stability, lengthy calculation workflows, and high maintenance costs, as illustrated by tangled database table dependencies and complex SQL joins.

To address these issues, JD.com built a new platform (the "Jinfan" platform) starting in 2020, adopting a six‑layer architecture: source tables, extraction layer (offline SparkSQL and real‑time BinLog sync), storage layer (JED relational DB, Elasticsearch, Hive, HBase), computation layer (batch Spark jobs replacing MapReduce), service layer (rule integration and query services), and application layer (dashboards and reports).

Key architectural improvements include database read‑write separation, asynchronous processing, caching, multi‑datacenter deployment for high availability, template‑based extensibility, dynamic field configuration for rebate rules, and a Spark‑based rebate calculation engine that reduces ETL and computation latency.

After deployment, the new platform supported five complex business scenarios within three months, cutting development effort and delivery time by about 50%, and demonstrated significant performance gains in query latency and throughput.

Future work focuses on adding real‑time streaming capabilities, enhancing full‑link data quality monitoring, and further optimizing rebate calculation performance to meet growing business demands.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Big Datahigh availabilityETLSparkrebate system
JD Retail Technology
Written by

JD Retail Technology

Official platform of JD Retail Technology, delivering insightful R&D news and a deep look into the lives and work of technologists.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.