JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance
The article outlines JD's data‑driven business development strategy, describing the current challenges of its business data marketplace, the governance framework—including layered architecture, standardization, ClickHouse dictionary refresh, and optimization measures—and the resulting performance improvements and future outlook.
JD presents its data‑driven business development initiative, focusing on constructing a comprehensive business metric data system and governing the data marketplace.
The current marketplace suffers from siloed development, heavy cross‑layer dependencies, low data sharing, and a lack of unified data standards, leading to issues described as "不可知、不可取、不可用、不可控" (unknown, undesirable, unusable, uncontrollable).
To address these problems, JD proposes a governance model that introduces a three‑tier architecture: a traditional data warehouse at the bottom, a foundational layer with standardized generic models in the middle, and a business metric layer on top that derives fine‑grained indicators and visual dashboards.
Standardization includes building reusable business base models, consolidating user‑wide tables, and defining a clear data‑indicator taxonomy (basic, derived, and composite metrics). A unique tool generates global dimension IDs to avoid redundant calculations when computing derived metrics.
For high‑volume SKU data, JD implements a ClickHouse dictionary‑based refresh solution that loads dimension tables into dictionaries, enabling fast, memory‑efficient queries and reducing refresh time.
Optimization measures such as dictionary sharding, field pruning, type conversion, and using array types for one‑to‑many relationships further cut memory usage and improve query performance.
After applying the governance and optimization practices, JD reports a 43% reduction in data‑warehouse read costs, a 51% decrease in application‑layer model count, a 34% storage reduction, and a three‑hour reduction in dashboard data delivery time.
The future outlook emphasizes building an agile, intelligent, and data‑driven marketplace, with ongoing exploration of data‑warehouse construction, asset tiering, and automated data intelligence.
The article concludes with a Q&A session addressing data‑backtrace costs, raw warehouse governance, and metric domain management.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
DataFunSummit
Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
