Tagged articles
548 articles
Page 3 of 6
Zhuanzhuan Tech
Zhuanzhuan Tech
Dec 14, 2023 · Big Data

Design and Implementation of a Data Service Platform for New Media Business

This article details the background, challenges, design principles, and implementation of a unified data service platform—including data modeling, multi-source governance, real-time processing, and a Doris-based storage solution—to support large‑scale video data for a new media operation.

Apache DorisData GovernanceData Platform
0 likes · 7 min read
Design and Implementation of a Data Service Platform for New Media Business
DataFunSummit
DataFunSummit
Dec 13, 2023 · Artificial Intelligence

Enterprise Large‑Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics

The article explores how enterprises can adopt domain‑specific large language models by addressing talent and cost challenges, outlining training pipelines, data governance for unstructured data, dataset balancing, fine‑tuning techniques, and a product ecosystem that lowers deployment barriers while optimizing performance and economics.

AI deploymentData Governancecost economics
0 likes · 13 min read
Enterprise Large‑Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics
Data Thinking Notes
Data Thinking Notes
Dec 5, 2023 · Big Data

How to Overcome Data Governance Challenges and Unlock Business Value

Enterprises face significant hurdles in data governance and integration, from siloed systems and unclear responsibilities to poor data quality, but by establishing clear rules, fostering user department engagement, and aligning governance with business-driven data applications, they can create a cohesive data asset management framework that drives value.

Big DataData AssetsData Governance
0 likes · 10 min read
How to Overcome Data Governance Challenges and Unlock Business Value
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 5, 2023 · Big Data

NetEase EasyData Metric Middle Platform: Architecture, Core Technologies, and Future Plans

This article details NetEase EasyData's evolution and product matrix, explains why a metric middle platform is needed, describes its core technical architecture—including a unified logical semantic model, a custom metric query language, and engine decoupling—and outlines future development directions.

AnalyticsBig DataData Governance
0 likes · 12 min read
NetEase EasyData Metric Middle Platform: Architecture, Core Technologies, and Future Plans
DataFunTalk
DataFunTalk
Nov 28, 2023 · Big Data

Xiaomi Metric System Construction and Management Best Practices

This article presents Xiaomi's comprehensive metric system framework, covering its definition, business pain points, the OSM and MECE methodologies, model design principles, data warehouse construction, metric management, and future outlook, illustrating how a unified data platform drives efficient business decision‑making.

Business IntelligenceData GovernanceData Warehouse
0 likes · 10 min read
Xiaomi Metric System Construction and Management Best Practices
Data Thinking Notes
Data Thinking Notes
Nov 26, 2023 · Fundamentals

How a Large Enterprise Overcame Master Data Chaos: A Practical Case Study

This article outlines a real‑world enterprise master data project, detailing the definition of master data, the four critical data‑quality challenges faced, the comprehensive solution framework with executive backing, and the six measurable outcomes that improved data governance, efficiency, and decision‑making across the organization.

Data GovernanceData QualityMaster Data
0 likes · 10 min read
How a Large Enterprise Overcame Master Data Chaos: A Practical Case Study
DataFunTalk
DataFunTalk
Nov 23, 2023 · Big Data

Tencent PCG Data Governance System: Architecture, Asset Scoring, and One‑Stop Governance Platform

The article presents Tencent PCG's comprehensive data governance solution, detailing the challenges of massive, heterogeneous data, the four‑chapter framework covering governance overview, meta‑warehouse construction, an open asset‑scoring system, and a one‑stop governance workbench, and explains how lineage, scoring, and rule‑engine mechanisms enable cost‑effective, continuous data governance.

Asset ScoringBig DataData Governance
0 likes · 14 min read
Tencent PCG Data Governance System: Architecture, Asset Scoring, and One‑Stop Governance Platform
DataFunTalk
DataFunTalk
Nov 20, 2023 · Big Data

Automated Data Governance and Optimization with Volcano Engine DataLeap: Challenges, Solutions, and Benefits

This article examines the challenges faced by Volcano Engine's DataLeap in computational governance, outlines automated solutions such as real‑time rule engines and monitoring, and presents concrete performance and cost benefits achieved through resource optimization across large‑scale Spark and Hadoop workloads.

AutomationBig DataData Governance
0 likes · 13 min read
Automated Data Governance and Optimization with Volcano Engine DataLeap: Challenges, Solutions, and Benefits
Data Thinking Notes
Data Thinking Notes
Nov 19, 2023 · Fundamentals

How to Build an Effective Data Asset Management Framework for Enterprises

This article explains why enterprises need a data asset framework, outlines its key components such as catalog management, policy support, and development trends, and provides a step‑by‑step guide with visual diagrams for constructing and operating a comprehensive data asset management system.

Data CatalogData GovernanceData Quality
0 likes · 5 min read
How to Build an Effective Data Asset Management Framework for Enterprises
HomeTech
HomeTech
Nov 15, 2023 · Industry Insights

How to Build Accurate Data Asset Lineage for Data Warehouse Governance

This article explains the challenges of data asset lineage in large data warehouses, presents a comprehensive approach using business‑level instrumentation, SQL interceptor plugins, and ETL script parsing to generate fine‑grained lineage graphs, and demonstrates measurable improvements in coverage and zombie‑table cleanup.

Data GovernanceData LineageData Quality
0 likes · 18 min read
How to Build Accurate Data Asset Lineage for Data Warehouse Governance
Data Thinking Notes
Data Thinking Notes
Nov 14, 2023 · Big Data

How Financial Institutions Master Data Governance for Digital Transformation

This article examines why data governance has become a critical pillar for Chinese financial institutions, outlining external regulations and internal business drivers, describing a comprehensive governance architecture, and presenting a detailed case study of a securities company's data‑asset inventory, platform implementation, and quality management.

Big DataData GovernanceData Quality
0 likes · 16 min read
How Financial Institutions Master Data Governance for Digital Transformation
Architects Research Society
Architects Research Society
Nov 14, 2023 · Fundamentals

MIKE2.0 Method: An Open‑Source Approach to Information Development

The MIKE2.0 method is an open‑source, integrated‑knowledge‑environment framework that guides enterprises through information development, data governance, and architecture via five structured phases, key structures such as SAFE, and practical task outputs, while also offering community resources and implementation guidance.

Data GovernanceInformation DevelopmentMIKE2.0
0 likes · 7 min read
MIKE2.0 Method: An Open‑Source Approach to Information Development
DataFunSummit
DataFunSummit
Nov 10, 2023 · Operations

Data Model Governance Practices at Taobao (Tao Tian Group)

This article presents a comprehensive overview of Taobao's data model governance, covering background challenges, a four‑pillar solution framework, detailed practices such as invalid table decommissioning, source‑table consolidation, data handover, public‑layer operations, incremental control, productization, and future planning to improve efficiency, cost, and quality of large‑scale data models.

Data Governancemetadatamodel governance
0 likes · 26 min read
Data Model Governance Practices at Taobao (Tao Tian Group)
Data Thinking Notes
Data Thinking Notes
Nov 9, 2023 · Big Data

How to Build a Scalable Data Governance System for Massive E‑Commerce Warehouses

This article outlines the challenges of ultra‑large e‑commerce data warehouses—such as SLA pressure, model instability, soaring resource costs, low governance efficiency, and fragmented processes—and presents a one‑stop, tiered data‑governance framework with stability, cost, and efficiency subsystems that drives distributed autonomous governance and measurable business value.

AutomationBig DataCost Optimization
0 likes · 11 min read
How to Build a Scalable Data Governance System for Massive E‑Commerce Warehouses
DataFunSummit
DataFunSummit
Nov 6, 2023 · Big Data

Building and Managing Huolala's User Event Tracking System: Architecture, Governance, and Monitoring

This article details Huolala's user event tracking (埋点) system, covering its background, challenges, the construction of a four‑module management platform, backend SDK design, monitoring and quality assurance mechanisms, and future plans for service integration, data lineage, and governance optimization.

Data Governancebackend SDKdata pipeline
0 likes · 16 min read
Building and Managing Huolala's User Event Tracking System: Architecture, Governance, and Monitoring
Data Thinking Notes
Data Thinking Notes
Nov 5, 2023 · Fundamentals

Why Poor Data Quality Costs Companies $15M Annually and How to Fix It

Low‑quality data can cost enterprises up to $15 million each year, making data quality management essential for accurate decision‑making, compliance, and operational efficiency, and this article explains its importance, evaluation dimensions, common issues, monitoring metrics, responsible roles, and a three‑phase management framework of prevention, control, and remediation.

Big DataBusiness IntelligenceData Governance
0 likes · 32 min read
Why Poor Data Quality Costs Companies $15M Annually and How to Fix It
ByteDance Data Platform
ByteDance Data Platform
Nov 1, 2023 · Big Data

How a Leading E‑Commerce Platform Solves EB‑Scale Data Governance Challenges

Facing massive data volumes and strict SLA requirements during the Double 11 shopping festival, a major e‑commerce platform built a systematic data‑governance framework that addresses quality, stability, cost, and efficiency through multi‑layered grading, digital cost models, automated tools, and full‑lifecycle management.

Big DataCost OptimizationData Governance
0 likes · 23 min read
How a Leading E‑Commerce Platform Solves EB‑Scale Data Governance Challenges
Data Thinking Notes
Data Thinking Notes
Oct 31, 2023 · Information Security

Why Data Classification & Grading Is Critical for Enterprise Security

This article explains the legal and strategic importance of data classification and grading in China, outlines the relevant regulations, describes the principles and processes for implementing classification, and offers practical steps for enterprises to secure data while meeting compliance and business needs.

Data GovernanceEnterprise Compliancedata classification
0 likes · 11 min read
Why Data Classification & Grading Is Critical for Enterprise Security
DataFunTalk
DataFunTalk
Oct 28, 2023 · Big Data

Insights from the First Data Governance Forum: Challenges, Opportunities, and the Role of Large AI Models

The first Data Governance Forum in Shanghai highlighted the intertwined challenges of data quality, compliance, and integration, emphasized the mutual reinforcement between large AI models and data governance, and presented perspectives from industry, academia, and legal experts on how to advance data as a strategic production factor.

Data GovernanceData Marketenterprise strategy
0 likes · 21 min read
Insights from the First Data Governance Forum: Challenges, Opportunities, and the Role of Large AI Models
Big Data Technology & Architecture
Big Data Technology & Architecture
Oct 23, 2023 · Big Data

Bilibili Data Quality Assurance: Architecture, Goals, Core Capabilities, and Future Outlook

This article outlines Bilibili's data quality assurance framework, detailing its evolution across four development stages, the current data platform architecture, identified pain points, four key quality objectives, core capabilities such as a quality data warehouse, comprehensive monitoring, digital optimization, fault handling, and future directions.

Big DataData GovernanceData Platform
0 likes · 22 min read
Bilibili Data Quality Assurance: Architecture, Goals, Core Capabilities, and Future Outlook
Data Thinking Notes
Data Thinking Notes
Oct 22, 2023 · Big Data

Boosting Big Data Governance Capabilities for Digital Transformation

This article outlines how enterprises can enhance their big data governance capabilities during digital transformation, covering the background and challenges of data governance, the emergence of data capability as a core competency with implementation paths, and practical suggestions for governance projects, illustrated with national-level examples.

Big DataData GovernanceDigital Transformation
0 likes · 3 min read
Boosting Big Data Governance Capabilities for Digital Transformation
DataFunSummit
DataFunSummit
Oct 20, 2023 · Big Data

Tencent OLA t‑Metric Metric Platform: Headless BI Practices and Architecture

The article introduces Tencent's OLA data‑governance platform and its t‑Metric metric middle‑platform, explains the Headless BI concept, details the configuration‑driven metric production workflow, core capabilities, architecture, unified query service, ecosystem integration, and answers audience questions about real‑time analysis, dimension handling, and trust mechanisms.

Data GovernanceDataOpsHeadless BI
0 likes · 21 min read
Tencent OLA t‑Metric Metric Platform: Headless BI Practices and Architecture
dbaplus Community
dbaplus Community
Oct 14, 2023 · Big Data

What Is a Data Warehouse? From Basics to Modern Practices

This article explains what a data warehouse is, contrasts it with traditional databases, outlines the evolution from classic to internet‑scale warehouses, details modeling approaches and layered architectures, discusses KPI dictionaries, date dimensions, naming standards, data governance, incremental loading techniques, and upstream/downstream coordination.

Big DataData GovernanceETL
0 likes · 25 min read
What Is a Data Warehouse? From Basics to Modern Practices
DataFunSummit
DataFunSummit
Oct 11, 2023 · Big Data

Tencent Oula Data Asset Suite: End‑to‑End Data Production and Governance Framework

The article presents Tencent Oula’s comprehensive data‑asset platform that integrates data collection, integration, warehouse and metric modeling, governance engines, and AI‑enabled analytics to reduce information entropy, standardize assets, and enable production‑as‑governance across the modern data stack.

AI for BIData Governancedata ops
0 likes · 21 min read
Tencent Oula Data Asset Suite: End‑to‑End Data Production and Governance Framework
DataFunTalk
DataFunTalk
Oct 7, 2023 · Big Data

Alibaba DataWorks Data Stability Governance: Challenges, Solutions, and Practices

This article presents Alibaba's experience in addressing large‑scale data stability challenges by outlining common problems, governance principles, baseline monitoring, team collaboration methods, practical implementations, and proactive measures to ensure reliable and accurate data production on the DataWorks platform.

AlibabaBig DataData Governance
0 likes · 12 min read
Alibaba DataWorks Data Stability Governance: Challenges, Solutions, and Practices
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 22, 2023 · Big Data

Data Lake: Concepts, Architecture, and Application in iQIYI's Data Platform

iQIYI’s data‑middle‑platform team built a four‑zone data lake—raw, product, work, and sensitive—integrated with unified ODS/DWD/MID layers, a metadata catalog, and self‑service tools, leveraging HDFS, Hive/Iceberg, Spark/Trino, and Flink, migrated to Apache Iceberg for real‑time freshness, and now aims to further streamline modules and adopt new technologies.

Apache IcebergData GovernanceData Lake
0 likes · 13 min read
Data Lake: Concepts, Architecture, and Application in iQIYI's Data Platform
DataFunSummit
DataFunSummit
Sep 14, 2023 · Big Data

Data Governance Practices for E‑commerce Platforms Using Volcano Engine DataLeap

The article presents Volcano Engine DataLeap's comprehensive data‑governance framework for e‑commerce platforms, covering challenges of large‑scale warehouses, a top‑level governance architecture, systematic stability, cost, and tool efficiency systems, and detailed implementation steps to achieve autonomous, distributed governance.

Data Governance
0 likes · 19 min read
Data Governance Practices for E‑commerce Platforms Using Volcano Engine DataLeap
DataFunTalk
DataFunTalk
Sep 13, 2023 · Big Data

Design and Implementation of a Lakehouse Data Platform Based on Apache Hudi at Taikang Life Insurance

This article details Taikang Life Insurance's end‑to‑end technical selection, architecture design, implementation, and custom enhancements of an Apache Hudi‑driven lakehouse platform for large‑scale health‑insurance data, covering background, component evaluation, performance benchmarking, multi‑layer architecture, and real‑world results.

Apache HudiBig DataData Governance
0 likes · 44 min read
Design and Implementation of a Lakehouse Data Platform Based on Apache Hudi at Taikang Life Insurance
DataFunTalk
DataFunTalk
Sep 12, 2023 · Big Data

Building an Intelligent Data Governance Platform at NetEase Cloud Music: Architecture, Practices, and Future Plans

This article presents a comprehensive case study of NetEase Cloud Music’s metadata‑driven intelligent governance platform, detailing its scale, construction background, modular architecture, rule‑based automation, practical deployment, and future roadmap for sustainable data ecosystem management.

AutomationBig DataData Governance
0 likes · 22 min read
Building an Intelligent Data Governance Platform at NetEase Cloud Music: Architecture, Practices, and Future Plans
DataFunTalk
DataFunTalk
Sep 10, 2023 · Big Data

Ping An Life Insurance’s Data Middle Platform Construction Practice

The presentation details Ping An Life’s four‑stage data middle‑platform initiative—defining data capability as the foundation of digital transformation, outlining the platform’s architecture and governance, showcasing business‑value applications, and discussing talent and cultural considerations—to illustrate how a large insurer builds a scalable, real‑time data ecosystem.

Big DataData GovernanceDigital Transformation
0 likes · 9 min read
Ping An Life Insurance’s Data Middle Platform Construction Practice
Data Thinking Notes
Data Thinking Notes
Sep 3, 2023 · Big Data

How to Build an Effective Data Governance Framework: Steps & Best Practices

This article outlines a comprehensive data governance framework for Chinese enterprises, covering organizational structures, data asset inventory, six‑stage methodology, and the creation of unified data standards and quality rules to support effective digital transformation and data‑driven decision making.

Big DataData GovernanceData Management
0 likes · 13 min read
How to Build an Effective Data Governance Framework: Steps & Best Practices
Data Thinking Notes
Data Thinking Notes
Aug 30, 2023 · Fundamentals

Mastering Data Governance: A Complete Guide to Metadata, Standards, Quality, and Security

Data governance encompasses a comprehensive framework—including metadata, master data, standards, quality, assets, exchange, security, and lifecycle management—to ensure data’s accuracy, consistency, and value across an organization, offering step‑by‑step guidance, best‑practice models, and visual references for effective implementation.

Data GovernanceData LifecycleData Quality
0 likes · 19 min read
Mastering Data Governance: A Complete Guide to Metadata, Standards, Quality, and Security
ByteDance Data Platform
ByteDance Data Platform
Aug 30, 2023 · Big Data

How We Cut Offline Data Warehouse SLA Delay from 13 Days to Zero with DataLeap

The article details how the "Xingfu Li" real‑estate platform tackled a 13‑day offline data‑warehouse SLA delay by adopting Volcano Engine's DataLeap suite, outlining the challenges, the three‑step governance process, and the measurable improvements achieved across task coverage, alert reduction, and data stability.

Big DataData GovernanceData Warehouse
0 likes · 10 min read
How We Cut Offline Data Warehouse SLA Delay from 13 Days to Zero with DataLeap
Data Thinking Notes
Data Thinking Notes
Aug 23, 2023 · Fundamentals

Mastering Data Metrics and Tags: Build Powerful Indicator Systems

This article provides a comprehensive guide to data metrics and tags, explaining their definitions, classifications, conversion methods, practical usage scenarios, and step‑by‑step approaches for building robust metric and tag systems that support strategic decision‑making and operational efficiency.

Business AnalyticsData GovernanceIndicator System
0 likes · 10 min read
Mastering Data Metrics and Tags: Build Powerful Indicator Systems
Big Data Technology & Architecture
Big Data Technology & Architecture
Aug 22, 2023 · Big Data

DataOps Practices and Challenges at ByteDance: From Model to Productization

The article summarizes ByteDance's DataOps journey, detailing its mid‑platform tool and Data BP model, core performance metrics, quality, hardware and human efficiency challenges, concrete DataOps implementation, productization through DataLeap, best‑practice promotion, and future outlook for data‑driven business value.

Big DataByteDanceData Governance
0 likes · 17 min read
DataOps Practices and Challenges at ByteDance: From Model to Productization
DataFunSummit
DataFunSummit
Aug 21, 2023 · Big Data

Data-Driven Data Governance and Asset Health Scoring at Xiaomi

This article presents Xiaomi's data‑driven governance framework, detailing a three‑level "rocket" plan, the asset health scoring model covering storage, compute, quality, security and compliance, productization efforts, future enhancements, and a Q&A session on implementation challenges.

Data Governanceasset healthdata security
0 likes · 13 min read
Data-Driven Data Governance and Asset Health Scoring at Xiaomi

How Lakehouse Architecture is Transforming Hadoop: A Deep Dive into Hudi, Iceberg, and Delta Lake

This article analyzes the rise of lake‑house architecture in the Hadoop ecosystem, compares the technical capabilities of Hudi, Iceberg and Delta Lake, details implementation enhancements such as MOR and multi‑writer support, showcases Flink integration, presents a real‑time marketing use case, and outlines future development directions.

Big DataData GovernanceDelta Lake
0 likes · 14 min read
How Lakehouse Architecture is Transforming Hadoop: A Deep Dive into Hudi, Iceberg, and Delta Lake
21CTO
21CTO
Aug 16, 2023 · Big Data

6 Must-Have Snowflake Tools to Supercharge Your Data Workflow

This guide reviews six popular Snowflake‑compatible tools—covering data preparation, visualization, integration/ETL, business intelligence, and governance—that can dramatically boost productivity for data professionals.

Business IntelligenceData GovernanceData visualization
0 likes · 11 min read
6 Must-Have Snowflake Tools to Supercharge Your Data Workflow
Data Thinking Notes
Data Thinking Notes
Aug 13, 2023 · Big Data

How to Successfully Deliver a Data Governance Project: Step‑by‑Step Guide

This article outlines a comprehensive methodology for delivering a data governance project, covering planning, blueprint design, implementation, and acceptance phases, with detailed guidance on team formation, stakeholder roles, requirement analysis, platform architecture, management processes, and post‑deployment operations.

Big DataData GovernanceData Platform
0 likes · 12 min read
How to Successfully Deliver a Data Governance Project: Step‑by‑Step Guide
Architects Research Society
Architects Research Society
Aug 5, 2023 · Big Data

Getting Started with Data Mesh: A Quick‑Start Guide

This guide introduces the concept of a data mesh, explains why modern data‑driven organizations need domain‑driven self‑serve design, outlines its three core principles, and provides a curated reading list to help teams transition from monolithic data lakes to distributed, observable data products.

Data GovernanceDistributed SystemsDomain‑Driven Design
0 likes · 10 min read
Getting Started with Data Mesh: A Quick‑Start Guide
Data Thinking Notes
Data Thinking Notes
Aug 2, 2023 · Fundamentals

Mastering Enterprise Data: A Practical Guide to Master Data Management

This article explains why fragmented data hampers business insight in large enterprises and provides a comprehensive overview of master data concepts, governance structures, standards, processes, and step‑by‑step implementation practices to achieve consistent, high‑quality enterprise data.

Data GovernanceData IntegrationEnterprise Data
0 likes · 18 min read
Mastering Enterprise Data: A Practical Guide to Master Data Management
Weimob Technology Center
Weimob Technology Center
Aug 1, 2023 · Big Data

How Weimeng Transformed Data Asset Governance: A Practical Blueprint for Enterprises

Facing fragmented metadata, unclear ownership, and costly data duplication, Weimeng implemented a comprehensive data asset governance framework—covering metadata standards, lineage visualization, metric normalization, and cost management—to boost data quality, security, and business value across its new‑retail platform.

Data GovernanceData Lineagedata operations
0 likes · 15 min read
How Weimeng Transformed Data Asset Governance: A Practical Blueprint for Enterprises
Data Thinking Notes
Data Thinking Notes
Jul 26, 2023 · Big Data

How to Build an Effective Data Asset Catalog for Enterprise Data Governance

This article explains what data assets are, why a data asset catalog is essential for data governance, and provides a step‑by‑step framework—including identification criteria, value dimensions, construction phases, tool support, and core functional modules—to help enterprises systematically create, manage, and leverage a data asset catalog.

Data AssetData CatalogData Governance
0 likes · 16 min read
How to Build an Effective Data Asset Catalog for Enterprise Data Governance
DataFunSummit
DataFunSummit
Jul 15, 2023 · Big Data

Intelligent and Automated Data Quality Management in Big Data Systems

This article explores the challenges of data quality in mature big‑data environments and presents intelligent, automated approaches—including assertions, automatic detection, rule recommendation, link checking, and collaborative mechanisms—to embed quality checks throughout the data pipeline, improving efficiency and reliability.

AutomationData GovernanceData Observability
0 likes · 18 min read
Intelligent and Automated Data Quality Management in Big Data Systems
Data Thinking Notes
Data Thinking Notes
Jul 12, 2023 · Fundamentals

Why Metadata Governance Is the Backbone of Modern Data Platforms

This article explains how metadata serves as essential infrastructure for data platforms, detailing Huawei's classification framework, governance challenges, management architecture, integrated modeling, data lake handling, service management, and data map construction to bridge business and IT domains.

Data GovernanceData LakeData Management
0 likes · 24 min read
Why Metadata Governance Is the Backbone of Modern Data Platforms
AntTech
AntTech
Jul 6, 2023 · Industry Insights

Unlocking AI Value: Data Quality, Privacy, and Blockchain in the Smart Era

The article examines how high‑quality data, robust privacy protection, and blockchain‑enabled trust infrastructure are essential for unlocking the value of AI models, citing market forecasts, examples from smart‑car and fintech firms, and the growing Chinese big‑data market through 2026.

AIBig DataBlockchain
0 likes · 9 min read
Unlocking AI Value: Data Quality, Privacy, and Blockchain in the Smart Era
Data Thinking Notes
Data Thinking Notes
Jul 5, 2023 · Big Data

Top 10 Big Data Trends Shaping China’s Data Industry in 2023

At the 2023 Big Data Industry Development Conference in Beijing, the China Communications Standards Association unveiled the top ten big‑data keywords, highlighting trends such as lake‑warehouse integration, data assetization, DataOps, intelligent analytics, data ethics, security, public data licensing, and cross‑border data flows.

Big DataData EthicsData Governance
0 likes · 16 min read
Top 10 Big Data Trends Shaping China’s Data Industry in 2023
Data Thinking Notes
Data Thinking Notes
Jul 2, 2023 · Big Data

Mastering Data Governance: A Comprehensive Framework for Enterprise Success

This article outlines a complete data governance framework, detailing the five managerial domains—control, process, governance, technology, and value—along with strategies for data strategy, organizational structure, policies, processes, standards, quality, security, and platform tools, and highlights AI’s pivotal role in enhancing governance efficiency.

Big DataData GovernanceData Quality
0 likes · 10 min read
Mastering Data Governance: A Comprehensive Framework for Enterprise Success
DataFunTalk
DataFunTalk
Jul 2, 2023 · Big Data

Bilibili Data Service Middle Platform: Architecture, Practices, and Future Roadmap

This article presents Bilibili's data service middle platform, detailing its background, one‑stop data service architecture, core processes, model and API construction, query mechanisms, full‑link control, cost‑reduction, high‑availability strategies, achieved results, and future roadmap.

Big DataData GovernanceData Service
0 likes · 18 min read
Bilibili Data Service Middle Platform: Architecture, Practices, and Future Roadmap
DataFunSummit
DataFunSummit
Jun 29, 2023 · Big Data

iQIYI Data Link Governance: Offline and Real‑time Pipeline Management and Exploration

This article presents iQIYI’s comprehensive data link governance practice, covering the motivations, offline and real‑time pipeline governance strategies, monitoring mechanisms, data lineage, and exploratory work such as intelligent attribution and field‑level lineage to improve data accuracy, timeliness, and reliability.

Data GovernanceData LineageiQIYI
0 likes · 11 min read
iQIYI Data Link Governance: Offline and Real‑time Pipeline Management and Exploration
DataFunTalk
DataFunTalk
Jun 14, 2023 · Big Data

Active Data Governance with Operator-Level Lineage: Practices and Exploration

This article presents Big Data company's active data governance practice using operator-level lineage, detailing the shortcomings of traditional lineage, the implementation of indicator chain governance, and the exploration of proactive model governance to achieve smarter, more precise data management.

Big DataData GovernanceData Warehouse
0 likes · 14 min read
Active Data Governance with Operator-Level Lineage: Practices and Exploration
Data Thinking Notes
Data Thinking Notes
Jun 11, 2023 · Product Management

How to Score Data Tags for Better Governance and Resource Optimization

This article explains why tag scoring is essential for data governance, outlines a five‑dimensional scoring model—including usage, attention, quality, continuous optimization, and security—and demonstrates how the scores can drive dashboards, alerts, and resource‑saving decisions.

Data GovernanceMetricsResource Optimization
0 likes · 9 min read
How to Score Data Tags for Better Governance and Resource Optimization
Huolala Safety Emergency Response Center
Huolala Safety Emergency Response Center
Jun 9, 2023 · Information Security

How Huolala Built a Robust Big Data Security Framework: Lessons & Practices

This article presents a detailed case study of Huolala's big data security architecture, covering background challenges, lifecycle‑wide protection standards, data classification, encryption, disaster recovery, governance processes, and future improvement plans to enhance data asset protection and compliance.

Data GovernanceHuolalaSecurity Architecture
0 likes · 10 min read
How Huolala Built a Robust Big Data Security Framework: Lessons & Practices
Huolala Tech
Huolala Tech
Jun 8, 2023 · Big Data

How Huolala Built a Robust Big Data Security Framework: Lessons and Practices

This article details Huolala's practical experience in constructing a comprehensive big data security system, covering data lifecycle protection, classification standards, capability development, and governance, while balancing regulatory compliance and business growth.

Big DataData Governancecloud infrastructure
0 likes · 10 min read
How Huolala Built a Robust Big Data Security Framework: Lessons and Practices
Data Thinking Notes
Data Thinking Notes
Jun 4, 2023 · Big Data

How Distributed Lakehouse Architecture Solves Data Swamp Challenges

This article examines the explosion of heterogeneous data sources, the limitations of traditional data lakes and warehouses, and proposes a distributed lakehouse architecture that integrates advanced management layers to improve data governance, reliability, and support both SQL and advanced analytics workloads.

Data GovernanceData LakeData Warehouse
0 likes · 29 min read
How Distributed Lakehouse Architecture Solves Data Swamp Challenges
Ziru Technology
Ziru Technology
Jun 2, 2023 · Information Security

Mastering Data Classification & Grading: Ziroom’s Compliance Blueprint

This article explains how Ziroom implements a comprehensive data classification and grading system to meet the 2021 Data Security Law, improve risk management, optimize security resources, and boost user trust through automated tools, multi‑level categorization, and continuous manual verification.

Data Governancecompliancedata classification
0 likes · 12 min read
Mastering Data Classification & Grading: Ziroom’s Compliance Blueprint
Data Thinking Notes
Data Thinking Notes
May 31, 2023 · Big Data

Why Data Lineage Is Essential: From Concepts to Practical Implementation

This article explains what data lineage is, its components, why it matters for data quality, security, and operational efficiency, and provides a comprehensive implementation guide covering open‑source tools, commercial platforms, custom builds, graph‑database modeling, automatic and manual lineage capture, visualization, analytics, and evaluation metrics.

Data GovernanceData LineageETL
0 likes · 18 min read
Why Data Lineage Is Essential: From Concepts to Practical Implementation
Data Thinking Notes
Data Thinking Notes
May 28, 2023 · Operations

Why Do State‑Owned Enterprises Struggle with Digital Transformation? Key Challenges and Solutions

This analysis examines why Chinese state‑owned enterprises face unclear digital‑transformation goals, weak strategic positioning, fragmented data, talent shortages, and inadequate technology ecosystems, and it outlines the root causes, typical case studies, and recommended actions to achieve effective digital change.

Data GovernanceDigital TransformationOperations
0 likes · 16 min read
Why Do State‑Owned Enterprises Struggle with Digital Transformation? Key Challenges and Solutions
Data Thinking Notes
Data Thinking Notes
May 21, 2023 · Information Security

Why Government Data Sharing Stalls and How a “Three‑Rights” Model Can Unlock It

The article analyzes why government data sharing often fails—citing legal, technical, security, and organizational hurdles—then outlines one‑to‑one and centralized sharing models, highlights four critical success factors, and proposes a “three‑rights” framework supported by blockchain to create trustworthy, sustainable inter‑departmental data exchange.

Big DataBlockchainData Governance
0 likes · 11 min read
Why Government Data Sharing Stalls and How a “Three‑Rights” Model Can Unlock It
Data Thinking Notes
Data Thinking Notes
May 17, 2023 · Big Data

Inside Wing Pay’s Scalable Big Data Platform: Architecture & Governance

This article details how Wing Pay built a comprehensive data development and governance platform, covering company background, business scenarios, goals, challenges, task development workflow, task types, SparkSQL editor features, double‑environment deployment, Airflow scheduling, DataX data bus, resource isolation, compute optimization, data quality monitoring, cloud‑native practices, future outlook, and a Q&A on data permissions and governance.

AirflowBig DataCloud Native
0 likes · 17 min read
Inside Wing Pay’s Scalable Big Data Platform: Architecture & Governance
Data Thinking Notes
Data Thinking Notes
May 14, 2023 · Big Data

Why Data Governance Matters: Boosting Data Quality and Business Value

Data governance, the overarching framework for evaluating, guiding, and supervising an organization’s data lifecycle—from collection to utilization—ensures high data quality, compliance, and security, ultimately maximizing data value and supporting AI-driven initiatives, while distinguishing itself from data management and data control through a strategic, top‑down approach.

Big DataData GovernanceData Management
0 likes · 8 min read
Why Data Governance Matters: Boosting Data Quality and Business Value
DataFunSummit
DataFunSummit
May 13, 2023 · Big Data

Expert Interview on Data Governance: Core Domains, Challenges, and Future Trends

In this interview, three data‑governance experts from Tencent, ByteDance, and Alibaba discuss the fundamental processes, core domains such as metadata, data lineage, metric systems, data quality and security, the main challenges they face, and emerging trends like DataOps, AI‑driven automation, and privacy‑preserving technologies.

Data GovernanceData LineageData Quality
0 likes · 14 min read
Expert Interview on Data Governance: Core Domains, Challenges, and Future Trends
Data Thinking Notes
Data Thinking Notes
May 7, 2023 · Big Data

How Financial Institutions Can Master Data‑Driven Transformation in 2024

This article examines two decades of data warehouse evolution in the financial sector, identifies persistent pain points such as platform lag, data quality, and low service efficiency, and proposes a cloud‑native, data‑centric framework—including a unified blueprint, three‑layer architecture, and six core capabilities—to accelerate enterprise‑wide data capability building and drive high‑quality digital growth.

Big DataCloud NativeData Governance
0 likes · 18 min read
How Financial Institutions Can Master Data‑Driven Transformation in 2024
Top Architect
Top Architect
May 4, 2023 · Big Data

Data Middle Platform: General Architecture and Core Components

The article explains the concept, benefits, and detailed modular architecture of a data middle platform, covering data storage, acquisition, processing, governance, security, and operation frameworks, and illustrates how enterprises can build and evolve such platforms to turn data into valuable services.

Big DataData ArchitectureData Governance
0 likes · 19 min read
Data Middle Platform: General Architecture and Core Components
Data Thinking Notes
Data Thinking Notes
Apr 25, 2023 · Operations

Why Data Quality Matters: A Practical Guide to Governance and Seven‑Dimensional Evaluation

This article explains why data quality is critical for businesses, outlines common data quality problems, their root causes, and presents a comprehensive governance framework—including monitoring rules, alerting, full‑link monitoring, and a seven‑dimensional evaluation model—to ensure high‑quality data delivery.

Big DataData GovernanceData Quality
0 likes · 12 min read
Why Data Quality Matters: A Practical Guide to Governance and Seven‑Dimensional Evaluation
DataFunSummit
DataFunSummit
Apr 23, 2023 · Fundamentals

Data Governance Practices and Implementation Path at Dipu Technology

This article presents Dipu Technology's comprehensive data governance methodology, covering construction paths, a typical enterprise digital platform framework, core governance components, practical case studies, and a Q&A session that together illustrate how businesses can design, implement, and sustain effective data governance across the organization.

Data CatalogData GovernanceData Management
0 likes · 19 min read
Data Governance Practices and Implementation Path at Dipu Technology
Data Thinking Notes
Data Thinking Notes
Apr 19, 2023 · Big Data

How Bilibili Transformed Big Data Governance: From Reactive Storage Management to Proactive Multi‑Dimensional Control

This article details Bilibili's evolution of big data governance, describing the early data growth challenges, the launch of the "Wanglou" project, the development of asset metadata and governance indicator frameworks, storage cost reduction strategies, scoring models, and the shift from passive, single‑point fixes to proactive, multi‑dimensional governance across the organization.

Big DataBilibiliCost Management
0 likes · 22 min read
How Bilibili Transformed Big Data Governance: From Reactive Storage Management to Proactive Multi‑Dimensional Control
DataFunSummit
DataFunSummit
Apr 19, 2023 · Fundamentals

Data Governance Construction Path and Practice by Dipu Technology

The article presents Dipu Technology's comprehensive approach to data governance, outlining construction pathways, a typical enterprise digital platform framework, core governance concepts, implementation steps, case studies, and a Q&A session that together illustrate how to design, execute, and sustain effective data governance across business domains.

Data GovernanceData ManagementEnterprise Data
0 likes · 22 min read
Data Governance Construction Path and Practice by Dipu Technology
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 17, 2023 · Big Data

Comprehensive Guide to Data Governance and Data Asset Management

This article presents a detailed roadmap for enterprise data governance, covering business digitization goals, data governance construction, typical digital platform architecture, core governance actions, implementation pathways, data asset inventory techniques, and real‑world case studies to illustrate practical execution.

Big DataData Asset ManagementData Governance
0 likes · 18 min read
Comprehensive Guide to Data Governance and Data Asset Management
Data Thinking Notes
Data Thinking Notes
Apr 16, 2023 · Big Data

Mastering Data Asset Management: From Inventory to Value Realization

This article outlines a complete data asset management lifecycle—starting with data inventory, moving through governance, classification, responsibility, permission, and security, and culminating in value realization via basic services, profiling, and algorithmic models—providing practical guidance for building a robust big‑data platform.

Big DataData GovernanceData Quality
0 likes · 10 min read
Mastering Data Asset Management: From Inventory to Value Realization
DataFunSummit
DataFunSummit
Apr 16, 2023 · Fundamentals

Why Metric Management Matters and How to Build an Effective Metric Management System

This article explains the importance of metric management for unified data language, consistent data production, and increased metric usage, and outlines a three‑part system covering business‑process metricization, standardization of naming, scope and lifecycle, and operationalization of metrics.

Business IntelligenceData Governancedata operations
0 likes · 9 min read
Why Metric Management Matters and How to Build an Effective Metric Management System
ITPUB
ITPUB
Apr 15, 2023 · Big Data

How Bilibili Turned Big Data Governance from Reactive to Proactive

This article details Bilibili's journey from a late‑started, reactive big‑data platform to a mature, proactive governance system that combines asset metadata, metric‑driven strategies, cost‑aware billing, and automated tooling to achieve massive storage savings and operational efficiency across the organization.

Big DataCost OptimizationData Governance
0 likes · 22 min read
How Bilibili Turned Big Data Governance from Reactive to Proactive
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Apr 14, 2023 · Information Security

How to Build a Secure Enterprise Data Platform: End‑to‑End Architecture and Controls

This article explains the security risks of enterprise data platforms, analyzes gaps in traditional protection methods, and presents a comprehensive three‑layer security architecture—asset, capability, and control layers—along with pre‑, during‑, and post‑process measures to ensure data safety throughout its lifecycle.

Data GovernanceEnterprise Data PlatformSecurity Architecture
0 likes · 19 min read
How to Build a Secure Enterprise Data Platform: End‑to‑End Architecture and Controls
Bilibili Tech
Bilibili Tech
Apr 11, 2023 · Big Data

Bilibili Big Data Governance: From Reactive Storage Management to Proactive Multi‑Dimensional Governance

Bilibili’s exabyte‑scale big‑data platform, after rapid growth created fragmented ownership and costly storage, launched the Wanglou project to build a metadata‑driven, indicator‑based governance framework that cut storage use by half, introduced compliance scoring and automation, and now plans to extend proactive, multi‑dimensional governance to compute, traffic and lake‑house resources.

BilibiliData GovernanceStorage Optimization
0 likes · 21 min read
Bilibili Big Data Governance: From Reactive Storage Management to Proactive Multi‑Dimensional Governance
Data Thinking Notes
Data Thinking Notes
Apr 9, 2023 · Big Data

Why Data Quality Is the Hidden Driver of Big Data Success

In the big‑data era, high‑quality data are essential for reliable analytics, and this article explains data‑quality concepts, key dimensions, analysis methods for missing values, outliers, inconsistencies and duplicates, as well as practical management practices to ensure data assets become a competitive advantage.

Big DataData GovernanceData Management
0 likes · 15 min read
Why Data Quality Is the Hidden Driver of Big Data Success
DataFunSummit
DataFunSummit
Apr 8, 2023 · Big Data

DataCake: A Multi‑Cloud Self‑Service Big Data Platform from SHAREit Group

The article introduces DataCake, a cloud‑native, multi‑cloud big data platform built by SHAREit Group that addresses massive data volume, diverse application scenarios, and governance challenges through a Data Mesh‑inspired self‑service architecture, offering unified data management, intelligent governance, and a roadmap for future enhancements.

Data GovernanceData MeshSelf-Service Platform
0 likes · 10 min read
DataCake: A Multi‑Cloud Self‑Service Big Data Platform from SHAREit Group
Data Thinking Notes
Data Thinking Notes
Apr 5, 2023 · Big Data

Mastering Data Governance: From Challenges to End‑to‑End Solutions

This article explores the key problems data governance aims to solve, outlines a comprehensive governance framework, and details practical implementation steps—including tool integration, metadata management, lake‑in and lake‑out processes, and governance policies—to achieve a closed‑loop, value‑driven data ecosystem.

Big DataData GovernanceData Lake
0 likes · 13 min read
Mastering Data Governance: From Challenges to End‑to‑End Solutions
Aikesheng Open Source Community
Aikesheng Open Source Community
Apr 3, 2023 · Databases

SQL Quality Management with Open-Source SQLE: Insights from the 2023 DAMS China Data Intelligence Management Summit

The 2023 DAMS China Data Intelligence Management Summit in Shanghai featured a technical presentation by Zhang Shenbo on an open‑source SQLE solution for SQL quality control, covering multi‑database auditing, automated review workflows, and practical tips to reduce DBA workload and cross‑department communication.

Data GovernanceData QualityDatabase Management
0 likes · 3 min read
SQL Quality Management with Open-Source SQLE: Insights from the 2023 DAMS China Data Intelligence Management Summit
Data Thinking Notes
Data Thinking Notes
Apr 2, 2023 · Fundamentals

Transforming Bank Data: A Practical Guide to Data Governance and Quality Management

This article explains how modern commercial banks can turn massive operational data into a strategic asset by building a comprehensive data governance framework that addresses data standards, quality management, metadata, master data, and security, while outlining a six‑step methodology for continuous improvement.

BankingData GovernanceData Quality
0 likes · 18 min read
Transforming Bank Data: A Practical Guide to Data Governance and Quality Management
Efficient Ops
Efficient Ops
Mar 30, 2023 · Cloud Computing

How China Merchants Bank Completed Full Cloud Migration and Cut Costs

China Merchants Bank became the first among China's top system‑important banks to fully migrate all debit, credit, corporate accounts and applications to the cloud, detailing the multi‑year effort, architectural overhaul, cost savings, and the strategic shift toward open, distributed systems.

Data Governancebanking digital transformationcloud computing
0 likes · 10 min read
How China Merchants Bank Completed Full Cloud Migration and Cut Costs
Data Thinking Notes
Data Thinking Notes
Mar 26, 2023 · Big Data

Why Data Governance Is the Key to Unlocking Your Data’s True Value

This article explains how effective data governance transforms raw data into a trusted enterprise asset, outlines common pitfalls such as backward and passive governance, and presents a structured, four‑phase approach—including organizational setup, standards, platform selection, and continuous operations—to successfully implement data governance at scale.

Big DataData GovernanceData Management
0 likes · 10 min read
Why Data Governance Is the Key to Unlocking Your Data’s True Value
Volcano Engine Developer Services
Volcano Engine Developer Services
Mar 22, 2023 · Fundamentals

How ByteDance Scales Data Governance: Challenges, Distributed Solutions, and Best Practices

This article examines ByteDance's data governance journey, outlining business, organizational, and cultural challenges, the six-stage evolution framework, real‑world case studies, and the shift from centralized to distributed autonomous governance to improve quality, security, cost, and team efficiency.

Big DataData GovernanceData Quality
0 likes · 18 min read
How ByteDance Scales Data Governance: Challenges, Distributed Solutions, and Best Practices
Data Thinking Notes
Data Thinking Notes
Mar 19, 2023 · Big Data

Why Data Quality Is the Key to Successful Big Data Initiatives

The article explains that while big data aims to boost organizational insight and innovation, its true value depends on high data quality, outlines industry standards, identifies technical, business, and management causes of poor quality, and proposes a three‑phase strategy of prevention, monitoring, and post‑improvement to ensure reliable data for decision‑making.

Big DataData GovernanceData Quality
0 likes · 21 min read
Why Data Quality Is the Key to Successful Big Data Initiatives
Airbnb Technology Team
Airbnb Technology Team
Mar 17, 2023 · Information Security

Airbnb Data Privacy and Security Engineering: Automated Data Protection Service Overview

Airbnb’s Data Protection Service unifies privacy and security metadata, offering APIs that automate annotation verification, export and IDL validation, data‑subject‑rights orchestration, and secret‑leak detection, while assigning ownership, minimizing manual effort, and ensuring global, consistent compliance across the platform.

AirbnbCI checksData Governance
0 likes · 13 min read
Airbnb Data Privacy and Security Engineering: Automated Data Protection Service Overview
Data Thinking Notes
Data Thinking Notes
Mar 14, 2023 · Fundamentals

Practical Data Governance Guide for SMEs: Strategies, Steps, and Tools

This article explains why data governance matters for small‑to‑medium enterprises, outlines its four key values, describes essential governance components, and provides a step‑by‑step framework—including timing, roles, standards, execution mechanisms, tools, and common pitfalls—to help organizations implement effective data governance.

Data GovernanceData ManagementPerformance Monitoring
0 likes · 16 min read
Practical Data Governance Guide for SMEs: Strategies, Steps, and Tools
Data Thinking Notes
Data Thinking Notes
Mar 12, 2023 · Big Data

Why Data Middle Platforms Are Evolving: New Trends in Data Governance and DataOps

The article examines how China's data middle platform concept is reshaping enterprise data strategy, highlighting a shift toward value‑driven adoption, the intertwined relationship with data governance, and emerging trends such as fine‑grained business governance, full‑link monitoring, integrated platforms, and DataOps.

Big DataData GovernanceData Middle Platform
0 likes · 9 min read
Why Data Middle Platforms Are Evolving: New Trends in Data Governance and DataOps
ITPUB
ITPUB
Mar 10, 2023 · Databases

How ICBC Secures MySQL at Scale: Insights from a Senior Database Architect

In this interview, ICBC senior manager Wei Yadong shares the bank's challenges with massive data, the five‑point criteria for database selection, the DevOps‑driven MySQL governance framework, evolving security demands, future database trends for finance, and practical advice for database professionals.

Data GovernanceDatabase ManagementDevOps
0 likes · 16 min read
How ICBC Secures MySQL at Scale: Insights from a Senior Database Architect
Data Thinking Notes
Data Thinking Notes
Mar 8, 2023 · Fundamentals

How BI Portals Transform Enterprise Data Governance for Scalable Analytics

This whitepaper explains why effective BI governance is essential for modern enterprises, outlines the key capabilities of data‑governance tools—including data quality, certification, usage statistics, classification, lineage, glossary, and lifecycle management—and shows how BI portals and data catalogs together enable scalable, user‑centric analytics.

AnalyticsBI governanceBI portal
0 likes · 12 min read
How BI Portals Transform Enterprise Data Governance for Scalable Analytics
Architects Research Society
Architects Research Society
Mar 8, 2023 · Big Data

Understanding DataOps: Principles, Benefits, and Implementation

DataOps, rooted in agile and DevOps philosophies, uses automation and collaborative practices to streamline data processing, improve quality, and align analytics with business goals, offering continuous analytics, faster insights, and breaking data silos for better decision‑making across organizations.

AutomationBig DataContinuous Analytics
0 likes · 10 min read
Understanding DataOps: Principles, Benefits, and Implementation