Tagged articles
548 articles
Page 4 of 6
DataFunTalk
DataFunTalk
Mar 8, 2023 · Artificial Intelligence

Applying AI Algorithms to Big Data Governance: Use Cases and Future Directions

This article presents Datacake's experience of integrating AI algorithms into big data governance, covering the bidirectional relationship between AI and big data, health‑score assessment of data tasks, intelligent Spark parameter tuning, SQL engine selection, and future application scenarios across the data lifecycle.

AIBig DataData Governance
0 likes · 18 min read
Applying AI Algorithms to Big Data Governance: Use Cases and Future Directions
Baidu Geek Talk
Baidu Geek Talk
Mar 6, 2023 · Big Data

Accelerating Data Production and Consumption in Baidu's Performance Platform

Baidu's Performance Platform speeds data production and consumption by adopting a unified stream‑batch architecture with TM and Spark, leveraging the Turing warehouse, introducing tiered service grading, robust governance and compliance measures, and offering self‑service analytics, cutting latency from minutes or days to milliseconds while handling billions of daily records and boosting SLA adherence, data accuracy, and user satisfaction.

Big DataData GovernanceReal-time Processing
0 likes · 12 min read
Accelerating Data Production and Consumption in Baidu's Performance Platform
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 3, 2023 · Fundamentals

Understanding Data Management Principles and Governance: Insights from DMBOK

This article explains the core principles, strategies, frameworks, and governance practices of data management based on DAMA's DMBOK, covering data lifecycle, value, leadership responsibilities, strategic planning, governance models, metrics, and implementation guidelines to help organizations derive business value from high‑quality data.

DMBOKData GovernanceData Management
0 likes · 17 min read
Understanding Data Management Principles and Governance: Insights from DMBOK
DataFunTalk
DataFunTalk
Mar 2, 2023 · Information Security

Data Security Governance Practices at Zhongyuan Bank: Framework, Management System, Technical Architecture, and Future Planning

The article details Zhongyuan Bank's comprehensive data security governance, covering the regulatory background, protection objectives, classification of data assets, organizational and procedural management mechanisms, technical safeguards across the data lifecycle, and future planning to enhance compliance and risk mitigation in the banking sector.

BankingData Governancecompliance
0 likes · 30 min read
Data Security Governance Practices at Zhongyuan Bank: Framework, Management System, Technical Architecture, and Future Planning
DataFunTalk
DataFunTalk
Feb 27, 2023 · Big Data

Comprehensive Overview of Data Middle Platform Architecture and Its Core Frameworks

This article provides a detailed overview of data middle platform concepts, describing a decoupled six‑subsystem architecture—including storage, collection, processing, governance, security, and operation frameworks—while illustrating typical enterprise implementations, industry‑specific solutions, and best‑practice considerations for building scalable, secure, and value‑driven data platforms.

Big DataData GovernanceData Integration
0 likes · 25 min read
Comprehensive Overview of Data Middle Platform Architecture and Its Core Frameworks
DataFunTalk
DataFunTalk
Feb 26, 2023 · Big Data

Design, Optimization, and Use Cases of Data Lineage in ByteDance's DataLeap Platform

This article presents an in‑depth overview of DataLeap's data lineage capabilities, covering the challenges, multi‑layer model design, implementation with Apache Atlas and JanusGraph, performance optimizations, diverse use cases across asset, development, governance and security domains, and future trends for lineage technology.

Apache AtlasBig DataData Governance
0 likes · 19 min read
Design, Optimization, and Use Cases of Data Lineage in ByteDance's DataLeap Platform
DataFunSummit
DataFunSummit
Feb 21, 2023 · Artificial Intelligence

Practices and Reflections on Building an AI Platform at Zhongyuan Bank

This article details Zhongyuan Bank's AI platform construction, covering its objectives, MLOps-driven design, core modules such as data ingestion, processing, model development, training, evaluation, deployment, monitoring, as well as resource orchestration with Kubernetes and Docker, and the accompanying ModelOps governance framework.

AIBankingData Governance
0 likes · 22 min read
Practices and Reflections on Building an AI Platform at Zhongyuan Bank
StarRocks
StarRocks
Feb 21, 2023 · Databases

How Yidian Tianxia Built a Unified Real‑Time & Offline Data Warehouse with StarRocks

Yidian Tianxia tackled massive daily data volumes and complex analytics by defining a five‑layer data‑warehouse standard, comparing ClickHouse and StarRocks performance, and implementing a unified real‑time/offline architecture with StarRocks, DataPlus, and EasyJob, achieving multi‑fold query speedups and lower operational costs.

ClickHouseData GovernanceData Warehouse
0 likes · 14 min read
How Yidian Tianxia Built a Unified Real‑Time & Offline Data Warehouse with StarRocks
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 20, 2023 · Big Data

How Alibaba’s DataWorks Transforms Data Governance for Efficiency, Security, and Cost Savings

This article explores Alibaba's DataWorks platform and its comprehensive data governance practices, covering application efficiency, security controls, cost optimization, organizational structure, and cultural initiatives that together enable scalable, secure, and cost‑effective data management across the enterprise.

Big DataCost OptimizationData Governance
0 likes · 31 min read
How Alibaba’s DataWorks Transforms Data Governance for Efficiency, Security, and Cost Savings
DataFunTalk
DataFunTalk
Feb 18, 2023 · Big Data

Xiaomi Data Governance Evolution: Cost Governance Practices for HDFS and HBase

The article outlines Xiaomi's data governance journey, focusing on storage‑service cost governance, describing the transition from simple cost‑centered governance to big‑data‑driven asset management, and detailing concrete HDFS and HBase practices that achieved significant resource and cost reductions.

Big DataData GovernanceHBase
0 likes · 15 min read
Xiaomi Data Governance Evolution: Cost Governance Practices for HDFS and HBase
DataFunSummit
DataFunSummit
Feb 17, 2023 · Big Data

Data Governance Practices and Platform Construction with Alibaba DataWorks

Alibaba’s DataWorks team shares extensive experiences in building and operating a large‑scale data platform, covering data governance across stages—from data stability and quality to security, cost control, and organizational culture—illustrating how systematic practices and tools drive efficiency, reliability, and value for enterprises.

Big DataCost OptimizationData Governance
0 likes · 55 min read
Data Governance Practices and Platform Construction with Alibaba DataWorks
Data Thinking Notes
Data Thinking Notes
Feb 14, 2023 · Big Data

How Cloud Music Turned 60k Tables into Valuable Data Assets

This article details Cloud Music's year‑long data assetization journey, covering the background, practical achievements, governance methods, and future roadmap for turning massive data warehouses into high‑value, well‑governed assets that drive cost reduction and business insight.

Big DataData GovernanceData Platform
0 likes · 10 min read
How Cloud Music Turned 60k Tables into Valuable Data Assets
Data Thinking Notes
Data Thinking Notes
Feb 9, 2023 · Fundamentals

Why Data Standards Are the Key to Unlocking Business Value

This article explains how data standards form the foundation of data governance, clarifies data assets, breaks silos, accelerates data flow, and outlines their definitions, benefits, common challenges, essential components, and best practices for effective implementation.

Data GovernanceData Managementbest practices
0 likes · 14 min read
Why Data Standards Are the Key to Unlocking Business Value
DataFunSummit
DataFunSummit
Feb 8, 2023 · Product Management

Content‑Driven Data Product Management: Challenges, Governance Frameworks, and Implementation Strategies

This article shares practical insights from a data product expert on the problems faced by content‑oriented data products, outlines a comprehensive governance methodology—including DAMA, Huawei, and Alibaba frameworks—and demonstrates how to operationalize these ideas through concrete examples such as event‑tracking and metric governance.

Big DataData GovernanceData Product Management
0 likes · 16 min read
Content‑Driven Data Product Management: Challenges, Governance Frameworks, and Implementation Strategies
Youzan Coder
Youzan Coder
Feb 7, 2023 · Big Data

Automated Offline Data Cost Optimization in Youzan's Data Platform

Youzan built an automated offline data cost‑optimization platform that gathers accurate metadata, mines unused or failing tables and tasks, and safely decommissions them through a backend‑frontend workflow with owner validation, notifications, rollback safeguards, and plans to extend lineage coverage and real‑time asset handling.

Big DataCost reductionData Governance
0 likes · 11 min read
Automated Offline Data Cost Optimization in Youzan's Data Platform
Data Thinking Notes
Data Thinking Notes
Feb 6, 2023 · Big Data

How Tencent Tackles Data Governance Challenges with the WeData Platform

This article outlines Tencent's data governance challenges, its internal three‑stage practice, detailed case studies such as Tencent News and PCG cost governance, and introduces the WeData platform's architecture and tools for standardization, quality, security, and metadata management, concluding with a Q&A session.

Big DataData GovernanceData Platform
0 likes · 17 min read
How Tencent Tackles Data Governance Challenges with the WeData Platform
Big Data Technology & Architecture
Big Data Technology & Architecture
Feb 4, 2023 · Big Data

Apache Linkis Graduates to Top-Level Project – Overview, Core Features, Roadmap, and Ecosystem

The article announces Apache Linkis’s graduation to an Apache top‑level project, explains its role as a computing middleware linking applications to engines like Spark, Hive, and Flink, details its core capabilities, roadmap, ecosystem integrations, and provides official resources for the community.

ApacheBig DataComputing Middleware
0 likes · 8 min read
Apache Linkis Graduates to Top-Level Project – Overview, Core Features, Roadmap, and Ecosystem
Data Thinking Notes
Data Thinking Notes
Feb 2, 2023 · Fundamentals

Why Metadata Management Is the Key to Unlocking Data Value

This article explains how effective metadata management provides context, improves data quality, enables data lineage tracing, supports governance, and ultimately turns raw data into valuable assets for enterprises navigating complex, evolving data environments.

Data GovernanceData LineageData Management
0 likes · 35 min read
Why Metadata Management Is the Key to Unlocking Data Value
Data Thinking Notes
Data Thinking Notes
Jan 31, 2023 · Fundamentals

Mastering Data Governance: From Metadata to ETL in One Guide

This comprehensive guide walks you through the entire data governance ecosystem, covering metadata fundamentals, classification, maturity models, data standards, modeling, integration, lifecycle management, quality assurance, security, and ETL processes, all illustrated with clear diagrams and practical steps.

Data GovernanceData IntegrationData Quality
0 likes · 13 min read
Mastering Data Governance: From Metadata to ETL in One Guide
DataFunTalk
DataFunTalk
Jan 31, 2023 · Big Data

Tencent's Data Governance Practices and Technical Implementation

This article presents Tencent's comprehensive data governance framework, covering its definition, objectives, challenges, methodology, organizational structure, metadata management, data asset lifecycle, security measures, and technical implementation details such as microservice architecture, data collection, lineage analysis, and storage solutions.

Big DataData GovernanceTencent
0 likes · 19 min read
Tencent's Data Governance Practices and Technical Implementation
DataFunTalk
DataFunTalk
Jan 30, 2023 · Big Data

Data Governance Strategies: Principles, Practices, and Real‑World Case Studies

The article explains why data governance is essential for high‑quality data in big‑data organizations, outlines narrow and broad governance scopes, presents strategic principles, and shares eight detailed case studies from leading Chinese tech companies illustrating practical implementation and lessons learned.

Big DataData Governance
0 likes · 7 min read
Data Governance Strategies: Principles, Practices, and Real‑World Case Studies
Data Thinking Notes
Data Thinking Notes
Jan 29, 2023 · Big Data

How to Turn Data Assets into Business Value: A Roadmap for Enterprises

Enterprises must shift their perception of data assets and embed data‑value into every digital process, establishing governance, unified asset catalogs, operational metrics, security controls, integration, services, and visualization to transform raw data into strategic business outcomes.

Big DataData GovernanceData Integration
0 likes · 12 min read
How to Turn Data Assets into Business Value: A Roadmap for Enterprises
DataFunSummit
DataFunSummit
Jan 27, 2023 · Big Data

Data Governance Strategies: Principles, Practices, and Case Studies

The article explains the importance of data governance, distinguishes narrow and broad governance, outlines strategic principles such as systemic engineering and prioritization, and presents eight case studies from leading Chinese tech companies illustrating practical implementations and effective strategies.

Big DataCase StudyData Governance
0 likes · 8 min read
Data Governance Strategies: Principles, Practices, and Case Studies
DataFunTalk
DataFunTalk
Jan 26, 2023 · Big Data

Tencent Data Governance Practices and the WeData Platform

This article outlines Tencent's data governance challenges, internal practices across three maturity stages, and introduces the WeData platform that provides comprehensive capabilities for data assetization, cost control, quality assurance, security, and metadata management to support large‑scale big‑data operations.

Big DataData GovernanceTencent
0 likes · 15 min read
Tencent Data Governance Practices and the WeData Platform
DataFunTalk
DataFunTalk
Jan 26, 2023 · Big Data

Data Governance Strategies: Principles, Practices, and Real‑World Case Studies

This article explains why data is a company's most valuable asset, distinguishes narrow and broad data‑governance approaches, outlines strategic design principles, and presents eight detailed case studies from leading Chinese tech firms illustrating practical governance implementations and lessons learned.

Big DataData Governance
0 likes · 8 min read
Data Governance Strategies: Principles, Practices, and Real‑World Case Studies
DataFunSummit
DataFunSummit
Jan 21, 2023 · Big Data

Building and Evolving Data Management Systems: From IT to DT Era, Standards, Models, and Marketization

This article outlines the evolution of data management in the big‑data era, covering the history of the industry, key governance frameworks such as DMBOK, DCMM and DMM, the steps to construct a data‑management system, the requirements for a data‑factor market, and an introduction to the DataEasy company and its services.

Big DataDCMMDMBOK
0 likes · 15 min read
Building and Evolving Data Management Systems: From IT to DT Era, Standards, Models, and Marketization
Huolala Tech
Huolala Tech
Jan 16, 2023 · Big Data

How Leading Logistics Companies Master Data Governance for Cost and Stability

At the 2022 DataFun Summit, data governance experts from Huolala, Zhongtong, and SF Express shared comprehensive practices—including governance drivers, quality monitoring, model management, master data processes, platform architecture, cost control, and stability measures—illustrating how large logistics firms implement end‑to‑end data governance to boost efficiency, compliance, and business value.

Big DataCost ManagementData Governance
0 likes · 13 min read
How Leading Logistics Companies Master Data Governance for Cost and Stability
DataFunTalk
DataFunTalk
Jan 14, 2023 · Big Data

Lean Data Methodology: A Visual Guide to Data‑Driven Digital Transformation

Lean Data Methodology combines lean thinking, design thinking, the Cynefin framework, and agile principles to create a data‑driven digital transformation system that defines value, eliminates waste, and equips enterprises with strategic, product, governance, collaboration, platform, and cultural capabilities for building lean digital enterprises.

Data GovernanceData PlatformDigital Transformation
0 likes · 11 min read
Lean Data Methodology: A Visual Guide to Data‑Driven Digital Transformation
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 12, 2023 · Operations

What Is DataOps and How Can It Transform Your Data Management?

DataOps, the data‑centric counterpart of DevOps, combines agile principles, standardized tools, and cross‑team collaboration to manage the full data lifecycle—from integration and development to storage, governance, and service—enabling organizations to handle massive, diverse datasets efficiently, reduce silos, and turn data into actionable value.

Big DataData GovernanceData Integration
0 likes · 15 min read
What Is DataOps and How Can It Transform Your Data Management?
DataFunSummit
DataFunSummit
Jan 9, 2023 · Big Data

JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance

The article outlines JD's data‑driven business development strategy, describing the current challenges of its business data marketplace, the governance framework—including layered architecture, standardization, ClickHouse dictionary refresh, and optimization measures—and the resulting performance improvements and future outlook.

Big DataClickHouseData Governance
0 likes · 13 min read
JD Data‑Driven Business Development: Building a Business Metric Data System and Marketplace Governance
DataFunTalk
DataFunTalk
Jan 8, 2023 · Big Data

ByteDance Event‑Tracking Data Cost Governance Practices

This article describes ByteDance's comprehensive approach to managing the massive volume of event‑tracking (埋点) data, detailing the background, cost‑reduction strategies, experience review, future plans, and a Q&A session that together illustrate how systematic data governance can dramatically cut storage and processing expenses.

Big DataByteDanceData Governance
0 likes · 18 min read
ByteDance Event‑Tracking Data Cost Governance Practices
DataFunSummit
DataFunSummit
Jan 4, 2023 · Big Data

Data Intelligence Expert Interview – Maturity, Trends, and Practices of Data Middle Platforms

The interview gathers insights from data‑platform experts on the maturity stages, technology trends, implementation methodologies, open‑source ecosystems, system architectures, governance, security, and assessment criteria of modern data middle platforms, offering a comprehensive guide for practitioners.

Big DataData GovernanceData Observability
0 likes · 28 min read
Data Intelligence Expert Interview – Maturity, Trends, and Practices of Data Middle Platforms
Meituan Technology Team
Meituan Technology Team
Dec 29, 2022 · Artificial Intelligence

Top 20 Most Popular Meituan Tech Blog Articles of 2022

Meituan’s technology team highlights its twenty most‑read 2022 blog posts, spanning observability, system design, data governance, AI, cloud‑native engineering, and practical innovations such as visual log tracing, Kafka scaling, functional programming, Elasticsearch optimization, CI/CD pipelines, and advanced object‑detection frameworks.

2022 HighlightsData GovernanceMeituan
0 likes · 13 min read
Top 20 Most Popular Meituan Tech Blog Articles of 2022
DataFunTalk
DataFunTalk
Dec 26, 2022 · Product Management

How Product and Business Teams Should Participate in Building Data Metric Systems

The article explains how product and business teams should collaborate with data teams to build and promote data metric systems, emphasizing mutual empowerment, joint methodology, pilot testing, and scaling, while also announcing DataFun's 5‑year anniversary activities and upcoming big‑data and AI publications.

AnalyticsData Governancebusiness collaboration
0 likes · 3 min read
How Product and Business Teams Should Participate in Building Data Metric Systems
DataFunSummit
DataFunSummit
Dec 24, 2022 · Operations

Understanding DataOps: Evolution, Technology Stacks, and Industry Applications

This article explores DataOps from its historical evolution through the digital 3.0 era, outlines its core technology stacks such as Data Fabric, Data Mesh, and Modern Data Stack, and demonstrates practical applications across finance, manufacturing, telecom, and public services, highlighting its role in agile, cloud‑native data management.

Big DataData GovernanceDataOps
0 likes · 18 min read
Understanding DataOps: Evolution, Technology Stacks, and Industry Applications
DataFunTalk
DataFunTalk
Dec 21, 2022 · Fundamentals

The Closed‑Loop Logic of Data Governance at Kuaikan Manhua

Kuaikan Manhua ensures continuous data governance by establishing a closed‑loop of business scope management, data asset standards, and feedback mechanisms that keep data pollution slower than governance speed, enabling systematic, long‑term data quality improvement.

Closed‑LoopData GovernanceData Quality
0 likes · 6 min read
The Closed‑Loop Logic of Data Governance at Kuaikan Manhua
DataFunTalk
DataFunTalk
Dec 20, 2022 · Big Data

ByteDance's Practices for Tracking Data Governance and Pipeline Management

This article explains ByteDance's end‑to‑end tracking data lifecycle management, including pre‑report validation, the rationale for using BMQ over Kafka, quality governance examples, and how Flink‑based pipelines ensure data accuracy through SLA monitoring and checkpoint strategies.

Data GovernanceData TrackingFlink
0 likes · 5 min read
ByteDance's Practices for Tracking Data Governance and Pipeline Management
Data Thinking Notes
Data Thinking Notes
Dec 19, 2022 · Big Data

Data Quality Mastery: From Expectations to Operational Assurance

This article outlines a comprehensive data quality management framework, covering expectations, measurement, assurance, and operational practices, and provides concrete templates, rule designs, and governance processes to help data teams systematically assess, monitor, and improve data reliability throughout the lifecycle.

Big DataData GovernanceData Quality
0 likes · 18 min read
Data Quality Mastery: From Expectations to Operational Assurance
iQIYI Technical Product Team
iQIYI Technical Product Team
Dec 16, 2022 · Databases

Database Mesh 2.0 and Pisanix: Cloud‑Native Database Governance at iQIYI

iQIYI adopts SphereEx’s Database Mesh 2.0 by extending ShardingSphere‑JDBC and integrating the cloud‑native Pisanix proxy, creating a unified, encrypted configuration center that enables dynamic sharding, load‑balancing, read‑write separation, hot‑updates and observability, dramatically simplifying database governance and cloud migration.

Data GovernanceDatabase MeshPisanix
0 likes · 14 min read
Database Mesh 2.0 and Pisanix: Cloud‑Native Database Governance at iQIYI
DataFunSummit
DataFunSummit
Dec 13, 2022 · Big Data

Introducing the Star River Big Data Development Platform: Architecture, Core Capabilities, and Future Plans

This article presents an in‑depth overview of 58.com’s self‑built Star River big data platform, covering its evolution across three eras, resource management hierarchy, core technical capabilities such as metadata services, data maps and lineage, governance practices, and the roadmap for further enhancements.

Big DataData GovernanceData Platform
0 likes · 14 min read
Introducing the Star River Big Data Development Platform: Architecture, Core Capabilities, and Future Plans
DataFunTalk
DataFunTalk
Dec 12, 2022 · Big Data

Cloud‑Native and Intelligent Fusion: Key Trends Shaping the Future of Big Data

The article explains how cloud‑native architectures, data governance, intelligent fusion, and privacy computing are driving the evolution of big data, recounting the history from Google’s early papers and Hadoop to modern managed services, compute‑storage separation, AI‑powered recommendation platforms, and real‑world success cases.

Big DataCloud NativeData Governance
0 likes · 10 min read
Cloud‑Native and Intelligent Fusion: Key Trends Shaping the Future of Big Data
DataFunSummit
DataFunSummit
Dec 7, 2022 · Big Data

Modern Data Governance at NetEase DataFan: Evolution, Challenges, and Solutions

This article details NetEase DataFan's journey in building a full‑stack big‑data platform, explains the design‑first data‑mid‑platform approach, analyzes cost, quality, and security problems encountered, and presents the modern data‑governance framework that integrates development, governance, and consumption into a closed loop.

Big DataCost ManagementData Governance
0 likes · 22 min read
Modern Data Governance at NetEase DataFan: Evolution, Challenges, and Solutions
Data Thinking Notes
Data Thinking Notes
Dec 5, 2022 · Big Data

How NetEase Cloud Music Cut Storage Costs by 30% Through Data Governance

This article details NetEase Cloud Music's year‑long data governance initiative, covering data background, governance strategy, project plan, practical actions, results, and future outlook, and shows how metadata‑driven management reduced storage by over 30% while improving reliability and efficiency.

Big DataCost OptimizationData Governance
0 likes · 17 min read
How NetEase Cloud Music Cut Storage Costs by 30% Through Data Governance
Bilibili Tech
Bilibili Tech
Dec 2, 2022 · Big Data

Data Quality Management: Expectations, Measurement, Assurance, and Operation

The article outlines a complete data‑quality‑management framework that first captures business expectations, then translates them into basic and personalized measurement rules, defines four assurance approaches for handling violations, and scales operation with indicators, tooling, and metrics to continuously improve data quality across the lifecycle.

Data GovernanceData QualityMetrics
0 likes · 19 min read
Data Quality Management: Expectations, Measurement, Assurance, and Operation
Data Thinking Notes
Data Thinking Notes
Nov 24, 2022 · Fundamentals

How to Build an Enterprise Data Governance System from Scratch

This article explains what data governance is, why enterprises need it, the key components such as data quality, metadata, master data, asset and security management, and provides a step‑by‑step framework, organizational structure, platform features, evaluation methods and common pitfalls.

Data AssetsData GovernanceData Quality
0 likes · 17 min read
How to Build an Enterprise Data Governance System from Scratch
Efficient Ops
Efficient Ops
Nov 22, 2022 · Operations

Why Data Quality Is the Hidden Cost Killer and How to Master Its Governance

This article explains why data quality is critical for business success, outlines common data quality problems and their root causes, and presents a practical governance framework with monitoring rules, alerts, full‑link monitoring, and a seven‑dimensional evaluation model to continuously improve data reliability.

Data GovernanceData Qualitydata monitoring
0 likes · 12 min read
Why Data Quality Is the Hidden Cost Killer and How to Master Its Governance
TAL Education Technology
TAL Education Technology
Nov 17, 2022 · Big Data

Real-Time Data Warehouse: Background, Value Assessment, and Half-Year Progress

This article outlines the background and terminology of data warehousing, presents a formula for evaluating warehouse value, and details the team's half‑year efforts—including architecture selection, quality assurance, stability governance, and data‑value externalization—to improve efficiency, quality, stability, and cost in real‑time data services.

Data GovernanceReal-time analyticsdata operations
0 likes · 10 min read
Real-Time Data Warehouse: Background, Value Assessment, and Half-Year Progress
Data Thinking Notes
Data Thinking Notes
Nov 16, 2022 · Big Data

Why Metadata Management Is Essential for Data Warehouses

This article explains the concept of metadata, its role in data warehouses, why managing metadata is critical for building, maintaining, and scaling data warehouse systems, and outlines practical steps, use cases, and tools for effective metadata management.

Data GovernanceData WarehouseETL
0 likes · 15 min read
Why Metadata Management Is Essential for Data Warehouses
DataFunSummit
DataFunSummit
Nov 15, 2022 · Big Data

Industrial Data Governance: Challenges, Practices, and Insights

Industrial data governance, essential for digital transformation, faces challenges such as data heterogeneity, volume, quality, and integration across the value chain, and the presentation outlines background, practical approaches, strategic thinking, and a phased, demand‑driven model to enhance data quality, assetization, and business value.

Big DataData GovernanceDigital Transformation
0 likes · 24 min read
Industrial Data Governance: Challenges, Practices, and Insights
DataFunSummit
DataFunSummit
Nov 12, 2022 · Big Data

SF Express Technology Data Governance Practice and Framework

This article details SF Express Technology’s decade‑long data governance journey, outlining its three‑phase evolution, comprehensive framework, key policies, organizational structure, and practical implementations such as master data management, data quality, metadata, data market, and security, highlighting lessons and best practices for enterprise data management.

Data GovernanceEnterprise DataMaster Data Management
0 likes · 17 min read
SF Express Technology Data Governance Practice and Framework
Data Thinking Notes
Data Thinking Notes
Nov 10, 2022 · Big Data

Building Kuaishou’s Scalable Metadata Management Platform for Big Data

This article details Kuaishou’s evolution of its metadata management platform—from early Hive‑centric beginnings to a unified 2.0 architecture and a forward‑looking 3.0 vision—highlighting challenges, key technologies, and how metadata drives data production, consumption, governance, and cost optimization across the big‑data middle platform.

Data GovernanceData Platformmetadata lineage
0 likes · 17 min read
Building Kuaishou’s Scalable Metadata Management Platform for Big Data
DataFunSummit
DataFunSummit
Nov 7, 2022 · Big Data

Huolala's Data Governance Practices: Data Quality, Metadata, and Cost Management Platforms

This article details Huolala's end‑to‑end data governance practice, covering the construction of a data governance framework, the implementation of a zero‑code data quality platform, a metadata management platform, and a cost‑governance system that together improve data reliability, reduce waste, and support scalable big‑data operations.

Big DataCost ManagementData Governance
0 likes · 14 min read
Huolala's Data Governance Practices: Data Quality, Metadata, and Cost Management Platforms
Tencent Cloud Developer
Tencent Cloud Developer
Nov 7, 2022 · Big Data

Data Engineering and Data Warehouse Design: Principles, Practices, and Governance

The article outlines comprehensive data‑engineering and warehouse‑design principles—covering collection (four Ws and methods like SDK, point‑code, binlog), reporting strategies, source selection, modeling with fact, aggregation, dimension and model tables, quality checks, and governance practices such as standardized SDKs, metric libraries, automated lineage, and cost optimization—to share actionable experience for any organization.

Big DataData GovernanceData Warehouse
0 likes · 32 min read
Data Engineering and Data Warehouse Design: Principles, Practices, and Governance
DevOps Cloud Academy
DevOps Cloud Academy
Nov 5, 2022 · Fundamentals

Understanding Data Architecture: Definitions, Problems Solved, Core Components, and Future Trends

This article explains what data architecture is, why it is essential for linking business and technology, outlines its main components such as data models, data flows, value streams and standards, and discusses emerging trends toward service‑oriented, consumption‑focused data architectures.

Data ArchitectureData GovernanceData Management
0 likes · 9 min read
Understanding Data Architecture: Definitions, Problems Solved, Core Components, and Future Trends
DataFunSummit
DataFunSummit
Nov 1, 2022 · Big Data

Case Study of DCMM Standard Implementation at State Grid Tianjin Electric Power

This article details State Grid Tianjin Electric Power's early adoption and successful certification of the national DCMM data management maturity model, outlining background, certification milestones, systematic practices, and lessons learned that illustrate how data governance, architecture, and application strategies drive digital transformation.

Big DataCase StudyDCMM
0 likes · 11 min read
Case Study of DCMM Standard Implementation at State Grid Tianjin Electric Power
DevOps Cloud Academy
DevOps Cloud Academy
Oct 27, 2022 · Big Data

Understanding DataOps: Concepts, Standards, and Enterprise Practices

This article explains DataOps as a methodology for improving data analysis quality and efficiency, outlines its origins, standards, and maturity model, and presents practical insights and case studies from Chinese enterprises on how DataOps addresses common data engineering challenges and drives digital transformation.

Big DataData GovernanceData Management
0 likes · 12 min read
Understanding DataOps: Concepts, Standards, and Enterprise Practices
DataFunTalk
DataFunTalk
Oct 26, 2022 · Big Data

Metadata Management and Governance Practices at Wing Payment: Architecture, Techniques, and Future Outlook

This article explains how metadata serves as the foundation of enterprise data governance, outlines common data governance challenges, describes Wing Payment's metadata governance framework and platform architecture, and presents future directions such as multi‑source management, cross‑cluster disaster recovery, and intelligent recommendation.

Big DataData GovernanceData Lineage
0 likes · 18 min read
Metadata Management and Governance Practices at Wing Payment: Architecture, Techniques, and Future Outlook
Kuaishou Big Data
Kuaishou Big Data
Oct 25, 2022 · Big Data

How Kuaishou Built a Scalable Big Data Platform with Unified Data Quality and Metric Services

This article details Kuaishou's end‑to‑end big data platform, describing its organizational model, unified data governance framework, comprehensive data‑quality solution, the design of a headless metric platform, key technologies such as automatic modeling and code generation, and future directions toward a decentralized, smart data fabric.

Big DataData GovernanceData Quality
0 likes · 21 min read
How Kuaishou Built a Scalable Big Data Platform with Unified Data Quality and Metric Services
DataFunSummit
DataFunSummit
Oct 22, 2022 · Big Data

Tencent Music's Data Asset Management and Governance Practices

The article details Tencent Music's data governance journey, describing the background of rapid resource growth, challenges in cost management, a multi‑layered governance methodology—including metadata, tiered storage, and a Lego metadata platform—and the resulting improvements in resource utilization and data quality.

Big DataData GovernanceResource Optimization
0 likes · 14 min read
Tencent Music's Data Asset Management and Governance Practices
Youzan Coder
Youzan Coder
Sep 29, 2022 · Big Data

Implementing Spark Data Lineage with Spline: A Step‑by‑Step Guide

This article explains the growing importance of data lineage in large data warehouses, evaluates three Spark lineage extraction approaches, and provides a detailed, step‑by‑step guide to integrating the open‑source Spline agent—including codeless and programmatic initialization, configuration, dispatcher setup, post‑processing, and known limitations.

Apache SparkBig DataData Governance
0 likes · 16 min read
Implementing Spark Data Lineage with Spline: A Step‑by‑Step Guide
Meituan Technology Team
Meituan Technology Team
Sep 22, 2022 · Information Security

Tokenization for Data Security: Design, Implementation, and Engineering Practices

The article explains how tokenization transforms data security into a built‑in attribute that automatically scales with data growth, detailing its design principles, generation methods, architectural layers, security safeguards, and practical engineering experiences to address exposure risks in modern digital businesses.

Data GovernancePIISecurity Architecture
0 likes · 24 min read
Tokenization for Data Security: Design, Implementation, and Engineering Practices
ShiZhen AI
ShiZhen AI
Sep 7, 2022 · Big Data

Getting Started with DataHub: A One‑Stop Guide to Metadata Governance

This article walks you through the fundamentals of data governance, explains metadata management concepts, compares traditional tools with DataHub, and provides a step‑by‑step tutorial for installing Docker, Python, and DataHub 0.8.20 on CentOS 7, ingesting MySQL metadata, and exploring the UI.

Big DataData GovernanceDataHub
0 likes · 19 min read
Getting Started with DataHub: A One‑Stop Guide to Metadata Governance
DataFunSummit
DataFunSummit
Aug 19, 2022 · Big Data

Taobao Data Model Governance: Challenges, Analysis, and Solutions

This article presents a comprehensive overview of Taobao's data model governance, detailing the background and problems of the current data architecture, analyzing root causes, proposing a structured governance framework with DataWorks automation, and outlining future plans to improve efficiency, standardization, and product tooling.

AlibabaBig DataData Governance
0 likes · 13 min read
Taobao Data Model Governance: Challenges, Analysis, and Solutions
DataFunSummit
DataFunSummit
Aug 17, 2022 · Big Data

Data Governance Practices and Frameworks: Insights from Alibaba

This article presents an overview of data governance concepts, common enterprise challenges, and Alibaba's comprehensive data governance framework, covering theory, demand layers, practical solutions for stability, quality, standards, security, cost control, and the supporting platforms and operational practices.

AlibabaBig DataData Governance
0 likes · 13 min read
Data Governance Practices and Frameworks: Insights from Alibaba
High Availability Architecture
High Availability Architecture
Aug 15, 2022 · Big Data

Comprehensive Guide to Event Tracking Governance and the One‑Stop Tracking Management Platform

This article explains why event‑tracking (埋点) governance is essential, outlines the methodology and practice of full‑link tracking management, and introduces the one‑stop tracking platform with its innovative features such as standardized processes, verification tools, real‑time dashboards, cross‑platform data unification, and future roadmap.

AnalyticsBig DataData Governance
0 likes · 15 min read
Comprehensive Guide to Event Tracking Governance and the One‑Stop Tracking Management Platform
DataFunTalk
DataFunTalk
Aug 13, 2022 · Big Data

Data Governance Practices and Logical Closed‑Loop at KuaiKan

The talk outlines KuaiKan's data governance journey, describing the rapid business growth challenges, the three‑step logical closed‑loop framework, practical experiences in business scope management, data asset governance, collaboration techniques, and future outlook, highlighting evaluation metrics and ongoing improvements.

Big DataData GovernanceData Quality
0 likes · 16 min read
Data Governance Practices and Logical Closed‑Loop at KuaiKan
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 6, 2022 · Operations

Why Operations Data Quality Is the Key to Successful Digital Transformation

In the era of big data, poor operations data quality undermines analytics, decision‑making and digital transformation, so organizations must adopt a three‑dimensional governance approach—covering organization, processes and technology—to ensure completeness, consistency, accuracy, uniqueness, relevance and timeliness of their operational data.

AnalyticsData GovernanceData Quality
0 likes · 17 min read
Why Operations Data Quality Is the Key to Successful Digital Transformation
Snowball Engineer Team
Snowball Engineer Team
Aug 5, 2022 · Big Data

Snowball Data Warehouse Modeling and OneData System Implementation

This article outlines Snowball's data warehouse background, compares major modeling approaches such as ER, dimensional, DataVault and Anchor models, describes the current challenges of their dimensional model, and details the OneData methodology—including OneModel, OneID, and OneService—along with its practical implementation, results, and future plans.

Big DataData GovernanceData Warehouse
0 likes · 23 min read
Snowball Data Warehouse Modeling and OneData System Implementation
High Availability Architecture
High Availability Architecture
Aug 5, 2022 · Big Data

Innovative Marketing Practices on the Cloud: How an Intelligent Data Lake Enables Flexible and Efficient Marketing Capabilities

The presentation details how Amazon Web Services’ intelligent data lake architecture integrates big data and machine learning to overcome marketing challenges, improve data governance, and provide scalable, real‑time analytics for personalized, data‑driven marketing across enterprises.

AWSBig DataData Governance
0 likes · 13 min read
Innovative Marketing Practices on the Cloud: How an Intelligent Data Lake Enables Flexible and Efficient Marketing Capabilities
Architecture Digest
Architecture Digest
Aug 1, 2022 · Big Data

Understanding Data Lakes: Concepts, Features, Architectures, and Vendor Solutions

This article provides a comprehensive overview of data lakes, explaining their definition, key characteristics, architectural evolution, and detailed comparisons of major cloud providers' solutions, while also presenting typical use cases, construction processes, and future development directions for this emerging big‑data infrastructure.

AWSAlibaba CloudAzure
0 likes · 52 min read
Understanding Data Lakes: Concepts, Features, Architectures, and Vendor Solutions
Big Data Technology Architecture
Big Data Technology Architecture
Jul 28, 2022 · Big Data

Reflections on Data Governance Challenges and Approaches

The author shares a candid account of transitioning from a non‑data role to confronting data‑centric bottlenecks, describing the current state of data projects, common pitfalls, and practical thoughts on simplifying data governance within limited resources and budget constraints.

Big DataDAMAData Governance
0 likes · 7 min read
Reflections on Data Governance Challenges and Approaches
Architects Research Society
Architects Research Society
Jul 26, 2022 · Information Security

Data Governance: Securing the Data Lifecycle in Cloud Environments

This article explains how enterprises can implement data governance to protect data throughout its lifecycle—collection, storage, processing, and deletion—especially in public and hybrid cloud settings, outlining SABSA categories, key questions, and practical considerations for secure data management.

Data GovernanceSABSAcloud security
0 likes · 6 min read
Data Governance: Securing the Data Lifecycle in Cloud Environments
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 26, 2022 · Big Data

How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs

This report details Alibaba’s large‑scale data model governance initiative for the DaTao ecosystem, analyzing current data issues such as naming inconsistencies, low reuse, and application‑layer inefficiencies, and presents a comprehensive solution—including a model evaluation system, DataWorks co‑development, intelligent modeling, data map enhancements, and future roadmap—to improve data health, reduce costs, and increase operational efficiency.

Big DataData GovernanceDataWorks
0 likes · 15 min read
How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs
DataFunTalk
DataFunTalk
Jul 25, 2022 · Big Data

Taobao Data Model Governance and Intelligent Modeling with DataWorks

This article summarizes Guo Jinshi's presentation on Taobao's data model governance, covering the current data landscape, identified problems, analysis of root causes, proposed governance solutions—including DataWorks intelligent modeling—and future plans, while also providing a Q&A session on practical implementation.

AlibabaBig DataData Governance
0 likes · 13 min read
Taobao Data Model Governance and Intelligent Modeling with DataWorks
AntTech
AntTech
Jun 29, 2022 · Information Security

Data Confidentiality Era: Development and Security – Highlights from Wei Tao’s 2022 Big Data Summit Speech

Wei Tao, Vice President of Ant Group, outlined the transition to a data‑confidentiality era, emphasizing the need for privacy‑computing security grading, technical requirements, and industry collaboration to safely circulate data as a new production factor in the post‑2022 big data landscape.

Data GovernancePrivacy Computingconfidential data
0 likes · 11 min read
Data Confidentiality Era: Development and Security – Highlights from Wei Tao’s 2022 Big Data Summit Speech
政采云技术
政采云技术
Jun 21, 2022 · Big Data

Overview of the Traffic Domain and Its Data Governance Architecture

This document presents a comprehensive overview of the traffic domain in a data warehouse, covering its concepts, objectives, guiding principles, core and extension models, data quality, monitoring, scheduling, and operational practices to achieve a complete, accurate, efficient, low‑cost, and high‑value traffic data system while addressing massive data volume, consistency, and SLA challenges.

Big DataData GovernanceData Warehouse
0 likes · 15 min read
Overview of the Traffic Domain and Its Data Governance Architecture
DataFunTalk
DataFunTalk
Jun 2, 2022 · Big Data

Data Governance Practices and Product Strategy at NetEase: Challenges, Solutions, and Future Plans

The article presents NetEase's internal data governance experience, outlining past challenges, current pain points, a comprehensive product strategy covering scope, value quantification, and feature implementation, and shares initial results and future plans to build an automated, end‑to‑end big‑data optimization platform.

Cost OptimizationData GovernanceData Quality
0 likes · 13 min read
Data Governance Practices and Product Strategy at NetEase: Challenges, Solutions, and Future Plans
Architect
Architect
May 25, 2022 · Big Data

Metadata Infrastructure and Governance in Bilibili's Data Platform

The article details how Bilibili built a unified metadata infrastructure—including a URN‑based model, collection pipelines, quality assurance, storage in TiDB/ES/HugeGraph, and query services—to support data discovery, lineage, impact analysis, and governance across its growing data platform.

Big DataData CatalogData Governance
0 likes · 21 min read
Metadata Infrastructure and Governance in Bilibili's Data Platform
Bilibili Tech
Bilibili Tech
May 24, 2022 · Big Data

Metadata Infrastructure and Governance in Bilibili Data Platform

Bilibili’s data platform consolidates scattered metadata into a unified URN‑based model stored across TiDB, Elasticsearch, and HugeGraph, offering batch‑pull and embedded collection, flexible SQL‑like queries, comprehensive lineage mapping, and powering data‑map, lineage‑map, and impact‑analysis tools while planning expanded quality assurance and self‑service dictionaries.

Data GovernanceData LineageData Platform
0 likes · 21 min read
Metadata Infrastructure and Governance in Bilibili Data Platform
Architects Research Society
Architects Research Society
May 17, 2022 · Information Security

Understanding Data Governance, Models, Policies, and Best Practices

The article explains data governance concepts, outlines four common governance models, details key policy elements such as availability, quality, integrity, usability, and security, and highlights the benefits, risks, and best‑practice recommendations for implementing effective data governance in organizations.

Data GovernanceData Managementcompliance
0 likes · 10 min read
Understanding Data Governance, Models, Policies, and Best Practices
DaTaobao Tech
DaTaobao Tech
May 13, 2022 · Big Data

Taobao Big Data Model Governance and DataWorks Co‑development

Taobao’s rapidly expanding technical data system faced naming inconsistencies, low table reuse, and costly, inefficient data usage, prompting a joint effort with DataWorks to digitize model evaluation, enforce standardized governance, deliver intelligent end‑to‑end modeling tools, and launch a development assistant, resulting in a health‑monitoring dashboard, upgraded data maps, and a roadmap for further automation and architecture refinement.

Big DataData GovernanceData Platform
0 likes · 12 min read
Taobao Big Data Model Governance and DataWorks Co‑development
Meituan Technology Team
Meituan Technology Team
May 12, 2022 · Operations

Systematic Data Governance Framework and Practices at Meituan Accommodation

The Meituan Accommodation data governance team shares how they evolved from ad‑hoc, single‑point fixes to a systematic, automated governance framework—covering management, standards, capability, execution, evaluation, and vision—using standardization, digitization, and systematization to achieve measurable quality, cost and efficiency gains across thousands of data assets.

AutomationData GovernanceDigitization
0 likes · 33 min read
Systematic Data Governance Framework and Practices at Meituan Accommodation