Big Data 13 min read

JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance

The article outlines JD's data‑driven business development strategy, describing the current challenges of its business data marketplace, the governance framework—including layered architecture, standardization, ClickHouse dictionary refresh, and optimization measures—and the resulting performance improvements and future outlook.

DataFunSummit
DataFunSummit
DataFunSummit
JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance

JD presents its data‑driven business development initiative, focusing on constructing a comprehensive business metric data system and governing the data marketplace.

The current marketplace suffers from siloed development, heavy cross‑layer dependencies, low data sharing, and a lack of unified data standards, leading to issues described as "不可知、不可取、不可用、不可控" (unknown, undesirable, unusable, uncontrollable).

To address these problems, JD proposes a governance model that introduces a three‑tier architecture: a traditional data warehouse at the bottom, a foundational layer with standardized generic models in the middle, and a business metric layer on top that derives fine‑grained indicators and visual dashboards.

Standardization includes building reusable business base models, consolidating user‑wide tables, and defining a clear data‑indicator taxonomy (basic, derived, and composite metrics). A unique tool generates global dimension IDs to avoid redundant calculations when computing derived metrics.

For high‑volume SKU data, JD implements a ClickHouse dictionary‑based refresh solution that loads dimension tables into dictionaries, enabling fast, memory‑efficient queries and reducing refresh time.

Optimization measures such as dictionary sharding, field pruning, type conversion, and using array types for one‑to‑many relationships further cut memory usage and improve query performance.

After applying the governance and optimization practices, JD reports a 43% reduction in data‑warehouse read costs, a 51% decrease in application‑layer model count, a 34% storage reduction, and a three‑hour reduction in dashboard data delivery time.

The future outlook emphasizes building an agile, intelligent, and data‑driven marketplace, with ongoing exploration of data‑warehouse construction, asset tiering, and automated data intelligence.

The article concludes with a Q&A session addressing data‑backtrace costs, raw warehouse governance, and metric domain management.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Big DataMetricsClickHouseData GovernanceJD.com
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.