Tagged articles
135 articles
Page 1 of 2
DataFunSummit
DataFunSummit
Mar 23, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI

The article analyzes costly data‑platform failures in large enterprises, contrasts traditional data middle‑platforms with Palantir’s ontology‑based approach, and explains a three‑layer architecture that turns raw data into automated business decisions, illustrated with real‑world case outcomes.

Data ManagementData PlatformDigital Twin
0 likes · 5 min read
Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI
dbaplus Community
dbaplus Community
Feb 25, 2026 · R&D Management

Escaping Report‑Hell: Practical Steps for Data Team Leaders

Data team leaders overwhelmed by endless ad‑hoc reporting can reclaim strategic time by quantifying workload, negotiating protected work blocks, prioritizing quick‑win automation projects, establishing transparent demand processes, and gradually introducing self‑service BI through focused pilot dashboards and gentle economic incentives.

Data Managementanalytics governanceprocess optimization
0 likes · 10 min read
Escaping Report‑Hell: Practical Steps for Data Team Leaders
DataFunSummit
DataFunSummit
Sep 20, 2025 · Fundamentals

Why Data Governance Fails: Combating Entropy in Integrated Data Systems

This article explains how the natural entropy of massive data sets creates governance challenges, outlines four core obstacles faced by large internet companies, and presents a sustainable, metric‑driven framework—including quality measurement, indicator systems, and future‑oriented operations—to achieve orderly data asset management.

Data GovernanceData ManagementEnterprise Data
0 likes · 18 min read
Why Data Governance Fails: Combating Entropy in Integrated Data Systems
Data Party THU
Data Party THU
Sep 2, 2025 · Industry Insights

How a Tsinghua CTO Is Driving IoTDB Toward Global Leadership

The article profiles Tianmou Technology’s co‑founder and CTO Qiao Jialin, detailing his personal journey, the company’s mission to build the world’s best time‑series database, its technical advantages, adoption by aerospace and other industries, and the cultural values that sustain rapid innovation.

AerospaceData ManagementIoTDB
0 likes · 21 min read
How a Tsinghua CTO Is Driving IoTDB Toward Global Leadership
DataFunTalk
DataFunTalk
Sep 1, 2025 · Big Data

How JD Retail Tackles Data Governance Challenges to Boost Efficiency

JD Retail outlines the growing data management challenges it faces—including asset discovery, architecture agility, development quality, and rising IT costs—and presents a comprehensive data governance framework that leverages standards, agile architecture, development isolation, and resource optimization to improve efficiency and reduce operational expenses.

Big DataData GovernanceData Management
0 likes · 7 min read
How JD Retail Tackles Data Governance Challenges to Boost Efficiency
DataFunTalk
DataFunTalk
Aug 27, 2025 · Big Data

How JD Retail Overcomes Data Governance Challenges to Boost Efficiency

JD Retail confronts growing data volume, redundant models, shared account risks, and rising storage costs, and responds with a comprehensive data governance framework that standardizes data, streamlines architecture, isolates development, and optimizes resources to achieve efficient, secure, and cost‑effective data operations.

Big DataData ArchitectureData Governance
0 likes · 8 min read
How JD Retail Overcomes Data Governance Challenges to Boost Efficiency
MaGe Linux Operations
MaGe Linux Operations
Jul 11, 2025 · Fundamentals

Mastering Ceph: A Deep Dive into Distributed Storage Architecture and Operations

This article provides a comprehensive overview of the open‑source Ceph distributed storage system, covering its core features, architecture components, data placement algorithms, storage interfaces, deployment best practices, operational management, and real‑world use cases for cloud, big data, and backup scenarios.

CephData Managementcloud computing
0 likes · 11 min read
Mastering Ceph: A Deep Dive into Distributed Storage Architecture and Operations
Architects' Tech Alliance
Architects' Tech Alliance
Jun 19, 2025 · Fundamentals

Mastering Enterprise Storage: 100 Essential Fundamentals Explained

This comprehensive guide walks you through 100 key concepts of enterprise storage—including architectures, media, redundancy, performance optimization, security, cloud integration, emerging technologies, standards, and operational best practices—helping IT professionals build a solid knowledge foundation for modern data‑centric environments.

BackupData ManagementEnterprise
0 likes · 23 min read
Mastering Enterprise Storage: 100 Essential Fundamentals Explained
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 18, 2025 · Fundamentals

How to Build a Digital Ecosystem: Planning, Core Solutions, Management & Data Strategies

This article outlines a comprehensive digital ecosystem framework, covering ecosystem planning, core digital solution construction, enhanced digital management and collaboration capabilities, and improved centralized data management and application, illustrated through a series of detailed diagrams.

Data ManagementDigital Transformationdigital strategy
0 likes · 3 min read
How to Build a Digital Ecosystem: Planning, Core Solutions, Management & Data Strategies
Baidu Tech Salon
Baidu Tech Salon
Jun 17, 2025 · Operations

How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets

This article explains how Baidu's vertical search platform tackled massive data growth and scaling challenges by redesigning its data management system, introducing elastic scheduling, decoupling ETCD access, implementing auto‑scaling, and advancing shard expansion to improve performance, stability, and cost efficiency.

Auto ScalingData ManagementSearch Architecture
0 likes · 18 min read
How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 11, 2025 · Cloud Computing

How Alibaba’s Qi Tian Platform Secures Large-Scale Cloud Networks

This article examines Alibaba Cloud’s Qi Tian integrated operation‑management platform, detailing the challenges of massive cloud network management and the innovative data‑fusion, automated change, intent‑aware monitoring, and multi‑plane self‑healing technologies that enable secure, high‑performance operation at million‑device scale.

AIData Managementcloud computing
0 likes · 11 min read
How Alibaba’s Qi Tian Platform Secures Large-Scale Cloud Networks
Chen Tian Universe
Chen Tian Universe
Jun 10, 2025 · Operations

Mastering Reconciliation Systems: Design, Architecture, Data Management & Error Handling

This comprehensive guide walks through the fundamentals of building a reconciliation system, covering overview, data source management, transaction and fund reconciliation models, system architecture, data entity relationships, data acquisition, result handling, project design, engine control, error processing, and the strategic decision of reconciling before or after settlement, providing practical configurations, diagrams, and best‑practice recommendations for reliable financial operations.

Data ManagementError HandlingReconciliation
0 likes · 37 min read
Mastering Reconciliation Systems: Design, Architecture, Data Management & Error Handling
Big Data Tech Team
Big Data Tech Team
May 15, 2025 · Industry Insights

What a Decade of Data Governance Taught Me: From Chaos to AI‑Driven Automation

Over ten years, the author chronicles the evolution of data governance across finance, government, and manufacturing, highlighting early chaos, tool migrations from Excel to Apache Atlas, AI‑powered quality monitoring, strict compliance across jurisdictions, cross‑department collaboration challenges, and the shift toward autonomous, value‑driven data ecosystems.

AIData Managementcompliance
0 likes · 18 min read
What a Decade of Data Governance Taught Me: From Chaos to AI‑Driven Automation
DaTaobao Tech
DaTaobao Tech
Apr 28, 2025 · Frontend Development

Front‑End Architecture and Performance Optimization for a Large‑Scale Chinese New Year Interactive Activity

The article details a large‑scale Chinese New Year interactive activity’s front‑end architecture, describing a layered system for business logic, data abstraction, and animation engines, unified data handling, dynamic animation rendering with downgrade paths, high‑concurrency QPS reduction, resilience measures, and extensive performance and workflow optimizations.

Data ManagementResilienceanimation
0 likes · 15 min read
Front‑End Architecture and Performance Optimization for a Large‑Scale Chinese New Year Interactive Activity
Sohu Tech Products
Sohu Tech Products
Mar 19, 2025 · Artificial Intelligence

Easy DataSet: An Open‑Source Tool for Building Domain‑Specific Datasets and Fine‑Tuning Large Language Models

The article introduces Easy DataSet, an open‑source tool that streamlines the creation of domain‑specific datasets by aggregating public data sources, chunking Markdown documents, generating and managing QA pairs with configurable LLM endpoints, and exporting them in common formats, while outlining its architecture and future roadmap.

AIData ManagementLLM fine-tuning
0 likes · 30 min read
Easy DataSet: An Open‑Source Tool for Building Domain‑Specific Datasets and Fine‑Tuning Large Language Models
JavaEdge
JavaEdge
Feb 2, 2025 · Artificial Intelligence

Mastering LLMOps: From Model Deployment to Scalable AI Operations

This article explains LLMOps—its goals, core activities, benefits, best practices, and how using an LLMOps platform like Dify can dramatically cut development time, simplify prompt engineering, data preparation, monitoring, and deployment of large language models.

AI OperationsData ManagementLLMOps
0 likes · 13 min read
Mastering LLMOps: From Model Deployment to Scalable AI Operations
IT Architects Alliance
IT Architects Alliance
Dec 29, 2024 · Fundamentals

Five Common Mistakes in IT Architecture Design and How to Avoid Them

This article outlines five common IT architecture design errors—neglecting connectivity, postponing security, poor compatibility, uncontrolled data duplication, and unsynchronized environments—illustrated with real cases and provides practical strategies to prevent each pitfall and build resilient, efficient systems.

CompatibilityData ManagementEnvironment Sync
0 likes · 11 min read
Five Common Mistakes in IT Architecture Design and How to Avoid Them
Data Thinking Notes
Data Thinking Notes
Dec 10, 2024 · Big Data

Why Data Asset Inclusion in Financial Statements Is the Next Competitive Edge for Enterprises

The article explains how recent policies make data asset inclusion in financial statements essential, outlines the concepts of data resources, assets and factors, describes the governance, assessment and lifecycle processes, and shows how this practice can boost financing, valuation and digital transformation for companies, economies and nations.

Data AssetData GovernanceData Management
0 likes · 30 min read
Why Data Asset Inclusion in Financial Statements Is the Next Competitive Edge for Enterprises
DataFunSummit
DataFunSummit
Dec 6, 2024 · Artificial Intelligence

Xiaomi AI Data Management Platform: Design, Implementation, and Practice

This article presents the background, design principles, architecture, and practical deployment of Xiaomi's AI Data Management Platform, highlighting how unified cataloging, Fileset integration, and notebook‑based development address AI data governance, cost reduction, and workflow efficiency for both structured and non‑structured data.

AI dataData ManagementFileset
0 likes · 15 min read
Xiaomi AI Data Management Platform: Design, Implementation, and Practice
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 7, 2024 · Big Data

Douyin Group's Data Management Strategies: Enhancing Metric Stability and Reusability

This article outlines Douyin Group's approach to handling petabyte‑scale data, addressing metric inconsistencies, and improving data product agility through a four‑layer Volcano Engine platform, systematic indicator production‑management‑consumption cycles, organizational design, automation, and future plans for large‑model‑driven metric splitting.

AnalyticsBig DataData Management
0 likes · 20 min read
Douyin Group's Data Management Strategies: Enhancing Metric Stability and Reusability
JD Retail Technology
JD Retail Technology
Oct 29, 2024 · Big Data

JD Unified Storage Practice: Cross‑Region and Tiered Storage on HDFS

This article details JD's large‑scale HDFS unified storage implementation, covering cross‑region storage challenges, topology design, asynchronous block replication, flow‑control mechanisms, tiered storage strategies, automatic hot‑cold data migration, and the resulting performance and cost improvements for big‑data workloads.

Big DataCross-Region StorageData Management
0 likes · 20 min read
JD Unified Storage Practice: Cross‑Region and Tiered Storage on HDFS
Data Thinking Notes
Data Thinking Notes
Oct 15, 2024 · Fundamentals

Why Data Modeling Matters: Unlock Business Value and Governance

This article explains how data modeling drives business value by sparking conversations about data meaning, enabling the creation of useful data objects, and guiding smart decisions on data capture, storage, usage, and integration, while also outlining a governance framework for managing enterprise data models.

Data GovernanceData ManagementEnterprise Data
0 likes · 4 min read
Why Data Modeling Matters: Unlock Business Value and Governance
ITPUB
ITPUB
Sep 30, 2024 · Databases

From SQL‑86 to SQL‑2023: How the Language Evolved Over 38 Years

This article traces the 38‑year evolution of the SQL standard from its first version in 1986 through successive revisions—SQL‑89, SQL‑92, SQL:1999, SQL:2003, SQL:2006/2008, SQL:2011, SQL:2016, and the latest SQL:2023—highlighting key features, extensions, and the growing gap between standards and vendor implementations.

Data ManagementDatabase StandardsJSON
0 likes · 23 min read
From SQL‑86 to SQL‑2023: How the Language Evolved Over 38 Years
Data Thinking Notes
Data Thinking Notes
Aug 13, 2024 · Fundamentals

How to Define, Classify, and Catalog Your Enterprise Data Assets

This article explains what data assets are, how to categorize them by structure and source, outlines a six‑step inventory process, describes a hierarchical catalog architecture, and highlights the four key benefits of a unified data asset directory for modern enterprises.

Data AssetsData CatalogData Governance
0 likes · 11 min read
How to Define, Classify, and Catalog Your Enterprise Data Assets
Continuous Delivery 2.0
Continuous Delivery 2.0
Aug 1, 2024 · Fundamentals

The Essence of Data Governance: Managing Data and People

This article reflects on the challenges of data governance, emphasizing that effective governance involves not only technical data handling but also managing people, aligning responsibilities, fostering cooperation between leadership and business units, and establishing clear ownership and incentive mechanisms.

Data GovernanceData Managementorganizational culture
0 likes · 4 min read
The Essence of Data Governance: Managing Data and People
Software Development Quality
Software Development Quality
Jun 19, 2024 · Operations

Best Practices for Test Data Management and Usage

This guide outlines comprehensive principles for generating, using, and cleaning test data across development, performance, and production environments, emphasizing independence, realism, security, proper permission controls, and systematic synchronization to ensure reliable and safe testing processes.

Data ManagementOperationsSoftware Testing
0 likes · 6 min read
Best Practices for Test Data Management and Usage
StarRocks
StarRocks
Jun 18, 2024 · Databases

How StarRocks Compaction Boosts Query Performance: Mechanics, Tuning, and Best Practices

This article explains StarRocks' compaction process that merges multiple data versions into larger files to reduce I/O, details the scheduler and executor roles, shows how to monitor and control compaction via SQL commands, and provides tuning parameters and best‑practice recommendations for optimal performance.

Data ManagementStarRockscompaction
0 likes · 21 min read
How StarRocks Compaction Boosts Query Performance: Mechanics, Tuning, and Best Practices
21CTO
21CTO
May 29, 2024 · Databases

Why PostgreSQL Is the 2024 SQL Powerhouse Reviving Database Innovation

Amid the rise of vector databases, AI, and cloud computing, this article explains how PostgreSQL’s flexibility, extensibility, and open‑source community are reigniting SQL’s relevance in 2024, uniting OLTP and OLAP workloads and positioning PostgreSQL as the versatile backbone for modern data management.

Data ManagementOpen source databasesextensibility
0 likes · 4 min read
Why PostgreSQL Is the 2024 SQL Powerhouse Reviving Database Innovation
DataFunTalk
DataFunTalk
May 27, 2024 · Big Data

JD Retail’s Unified HDFS Storage: Cross‑Region and Hierarchical Storage Practices

This article details JD Retail’s large‑scale HDFS deployment, describing how cross‑region storage challenges were solved with a full‑copy topology, asynchronous block replication, flow‑control mechanisms, and a tiered storage strategy that automatically moves hot, warm, and cold data among SSD, HDD, and high‑density HDD nodes to improve performance and cut costs.

Big DataData ManagementHDFS
0 likes · 20 min read
JD Retail’s Unified HDFS Storage: Cross‑Region and Hierarchical Storage Practices
dbaplus Community
dbaplus Community
May 1, 2024 · Databases

8 Compelling Reasons SQL Still Dominates After 50 Years

This article outlines eight key reasons why SQL and relational databases remain the dominant data management solution half a century after their invention, covering processing power, proven reliability, community support, simplicity, widespread adoption, open‑source growth, practical query power, and their role alongside NoSQL technologies.

Data ManagementRDBMSSQL vs NoSQL
0 likes · 9 min read
8 Compelling Reasons SQL Still Dominates After 50 Years
DataFunSummit
DataFunSummit
Apr 25, 2024 · Big Data

Paimon Project Overview: Recent Developments, Core Capabilities, and Future Roadmap

This article presents a comprehensive overview of the Apache‑incubated Paimon project, covering its evolution from Flink Table Store, the current features of primary‑key and log tables, management tools such as snapshots, tags and branches, performance optimizations for Flink and Spark, and a detailed roadmap of upcoming functionalities.

Big DataData ManagementFlink
0 likes · 23 min read
Paimon Project Overview: Recent Developments, Core Capabilities, and Future Roadmap
Data Thinking Notes
Data Thinking Notes
Apr 18, 2024 · Information Security

How to Implement Effective Data Classification and Grading for Secure Data Management

Data classification and grading, essential components of data security governance, involve defining data categories, assigning sensitivity levels, adhering to national standards, and establishing organizational processes to ensure compliant, secure, and value‑driven data handling across enterprises.

Data GovernanceData ManagementInformation Security
0 likes · 20 min read
How to Implement Effective Data Classification and Grading for Secure Data Management
DataFunSummit
DataFunSummit
Apr 12, 2024 · Artificial Intelligence

Exploring the Application of AI Large Models in the Automotive Industry

This article provides a comprehensive overview of AI large‑model development, defines what constitutes a large model, discusses current challenges such as cost, privacy and safety, and examines how these models can improve efficiency across automotive marketing, sales, service, data management, infrastructure building, and future automation stages.

AIData Managementautomotive
0 likes · 13 min read
Exploring the Application of AI Large Models in the Automotive Industry
Data Thinking Notes
Data Thinking Notes
Jan 30, 2024 · Operations

How Banks Can Build an Effective Data Governance Framework

This article outlines a two‑step approach for banks to design a data governance system—clarifying organizational responsibilities and constructing a layered institutional framework—while detailing cross‑department collaboration, head‑office and branch coordination, and practical policy, procedure, and work‑detail levels to sustain continuous improvement and support digital transformation.

BankingData GovernanceData Management
0 likes · 10 min read
How Banks Can Build an Effective Data Governance Framework
DevOps
DevOps
Jan 17, 2024 · Operations

Agile Data Management: Principles, Practices, and Implementation Guide

This article explains how agile methodologies can be applied to data management, covering the need for agile data practices, core principles, iterative modeling, governance, CI/CD pipelines, tooling, metrics, security, case studies, challenges, and future outlooks in a comprehensive, step‑by‑step guide.

Data GovernanceData ManagementDataOps
0 likes · 13 min read
Agile Data Management: Principles, Practices, and Implementation Guide
DataFunSummit
DataFunSummit
Dec 20, 2023 · Cloud Native

Building a Cloud‑Native Lakehouse with Apache Iceberg and Amoro

This article introduces the background, challenges, and cloud‑native solutions of lakehouse architecture, explains Apache Iceberg’s open table format and its cloud‑native features, details Amoro’s management and self‑optimizing capabilities, showcases three real‑world cloud migration cases, and outlines future development plans.

AmoroApache IcebergData Management
0 likes · 12 min read
Building a Cloud‑Native Lakehouse with Apache Iceberg and Amoro
Architects Research Society
Architects Research Society
Nov 7, 2023 · Big Data

Five Hidden Costs of Working with Alternative Data

The article outlines five often overlooked expenses that IT managers face when integrating alternative data—vendor selection, skilled staffing, data ownership verification, model updates, and storage tooling—and offers strategies to mitigate each cost.

Data ManagementIT costsalternative data
0 likes · 9 min read
Five Hidden Costs of Working with Alternative Data
Data Thinking Notes
Data Thinking Notes
Nov 5, 2023 · Fundamentals

Why Poor Data Quality Costs Companies $15M Annually and How to Fix It

Low‑quality data can cost enterprises up to $15 million each year, making data quality management essential for accurate decision‑making, compliance, and operational efficiency, and this article explains its importance, evaluation dimensions, common issues, monitoring metrics, responsible roles, and a three‑phase management framework of prevention, control, and remediation.

Big DataBusiness IntelligenceData Governance
0 likes · 32 min read
Why Poor Data Quality Costs Companies $15M Annually and How to Fix It
Architects Research Society
Architects Research Society
Oct 21, 2023 · Fundamentals

Information Governance: Roles, Responsibilities, and Key Processes

The article explains information governance as a business‑driven program that ensures data accuracy, completeness, consistency, accessibility, and security, outlines three essential roles, describes the data administrator’s duties, and details the key procedures and their relationship to corporate and IT governance.

Data ManagementData Qualitydata stewardship
0 likes · 11 min read
Information Governance: Roles, Responsibilities, and Key Processes
DataFunTalk
DataFunTalk
Oct 5, 2023 · Big Data

Building a Unified Streaming‑Batch Lakehouse with Amoro Mixed Iceberg

This article describes how Shanghai Steel Union leveraged Amoro Mixed Iceberg on top of Apache Iceberg to create a unified streaming‑batch lakehouse, addressing small‑file and upsert challenges, simplifying architecture, improving data freshness, and providing a scalable solution for real‑time and batch analytics.

AmoroApache IcebergBig Data
0 likes · 13 min read
Building a Unified Streaming‑Batch Lakehouse with Amoro Mixed Iceberg
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 26, 2023 · Artificial Intelligence

How Baidu’s Autonomous Driving Toolchain Powers Production‑Ready AI

This article summarizes Baidu’s senior manager Xu Peng’s presentation on the evolution from R&D‑focused to production‑ready autonomous driving toolchains, highlighting cloud simulation, data‑closed‑loop, AI‑driven labeling, compliance, efficiency, service, and cost challenges, and outlining Baidu’s integrated solutions for the automotive industry.

AICloud ServicesData Management
0 likes · 11 min read
How Baidu’s Autonomous Driving Toolchain Powers Production‑Ready AI
Data Thinking Notes
Data Thinking Notes
Sep 3, 2023 · Big Data

How to Build an Effective Data Governance Framework: Steps & Best Practices

This article outlines a comprehensive data governance framework for Chinese enterprises, covering organizational structures, data asset inventory, six‑stage methodology, and the creation of unified data standards and quality rules to support effective digital transformation and data‑driven decision making.

Big DataData GovernanceData Management
0 likes · 13 min read
How to Build an Effective Data Governance Framework: Steps & Best Practices
Baidu Geek Talk
Baidu Geek Talk
Aug 30, 2023 · Industry Insights

Midgard: Adaptive Storage Management for Search – From Simple Tables to Intelligent Layers

This article examines how Baidu's search service evolved its storage architecture—from a basic key‑value table to a hybrid HDD/Redis cache and finally to a sharded, multi‑collection design—culminating in Midgard, an intelligent storage‑layer manager that abstracts and optimizes data access for changing business needs.

BackendData ManagementMidgard
0 likes · 11 min read
Midgard: Adaptive Storage Management for Search – From Simple Tables to Intelligent Layers
Architects Research Society
Architects Research Society
Jul 21, 2023 · Big Data

Understanding Data Fabric Architecture: Key Pillars for Modern Data Management and Integration

The article explains what Data Fabric (also called data weaving) is, outlines its four essential pillars—metadata collection, active metadata, knowledge‑graph management, and a robust integration backbone—and shows how D&A leaders can adopt this design to achieve agile, AI‑enabled data integration across hybrid and multi‑cloud environments.

AI/MLData Managementmetadata
0 likes · 10 min read
Understanding Data Fabric Architecture: Key Pillars for Modern Data Management and Integration
Data Thinking Notes
Data Thinking Notes
Jul 12, 2023 · Fundamentals

Why Metadata Governance Is the Backbone of Modern Data Platforms

This article explains how metadata serves as essential infrastructure for data platforms, detailing Huawei's classification framework, governance challenges, management architecture, integrated modeling, data lake handling, service management, and data map construction to bridge business and IT domains.

Data GovernanceData LakeData Management
0 likes · 24 min read
Why Metadata Governance Is the Backbone of Modern Data Platforms
Data Thinking Notes
Data Thinking Notes
May 14, 2023 · Big Data

Why Data Governance Matters: Boosting Data Quality and Business Value

Data governance, the overarching framework for evaluating, guiding, and supervising an organization’s data lifecycle—from collection to utilization—ensures high data quality, compliance, and security, ultimately maximizing data value and supporting AI-driven initiatives, while distinguishing itself from data management and data control through a strategic, top‑down approach.

Big DataData GovernanceData Management
0 likes · 8 min read
Why Data Governance Matters: Boosting Data Quality and Business Value
DataFunSummit
DataFunSummit
Apr 23, 2023 · Fundamentals

Data Governance Practices and Implementation Path at Dipu Technology

This article presents Dipu Technology's comprehensive data governance methodology, covering construction paths, a typical enterprise digital platform framework, core governance components, practical case studies, and a Q&A session that together illustrate how businesses can design, implement, and sustain effective data governance across the organization.

Data CatalogData GovernanceData Management
0 likes · 19 min read
Data Governance Practices and Implementation Path at Dipu Technology
DataFunSummit
DataFunSummit
Apr 19, 2023 · Fundamentals

Data Governance Construction Path and Practice by Dipu Technology

The article presents Dipu Technology's comprehensive approach to data governance, outlining construction pathways, a typical enterprise digital platform framework, core governance concepts, implementation steps, case studies, and a Q&A session that together illustrate how to design, execute, and sustain effective data governance across business domains.

Data GovernanceData ManagementEnterprise Data
0 likes · 22 min read
Data Governance Construction Path and Practice by Dipu Technology
Big Data Technology Architecture
Big Data Technology Architecture
Apr 19, 2023 · Big Data

Why the Big Data Era Is Over

The article argues that the era of big data is ending, showing that most organizations store only modest amounts of data, that storage costs outweigh benefits, and that modern cloud and analytics tools allow efficient processing without needing massive datasets.

AnalyticsBig DataData Management
0 likes · 16 min read
Why the Big Data Era Is Over
Data Thinking Notes
Data Thinking Notes
Apr 9, 2023 · Big Data

Why Data Quality Is the Hidden Driver of Big Data Success

In the big‑data era, high‑quality data are essential for reliable analytics, and this article explains data‑quality concepts, key dimensions, analysis methods for missing values, outliers, inconsistencies and duplicates, as well as practical management practices to ensure data assets become a competitive advantage.

Big DataData GovernanceData Management
0 likes · 15 min read
Why Data Quality Is the Hidden Driver of Big Data Success
Model Perspective
Model Perspective
Apr 8, 2023 · Fundamentals

Boost Your Math Modeling Success with Essential Online Collaboration Tools

Effective use of online collaboration platforms—such as Shimo Docs, WPS, Feishu, Google Docs, Notion, and Microsoft Teams—can streamline team communication, document sharing, version control, and data management in math modeling competitions, ensuring every minute counts and teams stay coordinated under tight deadlines.

Data Managementmath modelingonline collaboration
0 likes · 4 min read
Boost Your Math Modeling Success with Essential Online Collaboration Tools
Data Thinking Notes
Data Thinking Notes
Mar 26, 2023 · Big Data

Why Data Governance Is the Key to Unlocking Your Data’s True Value

This article explains how effective data governance transforms raw data into a trusted enterprise asset, outlines common pitfalls such as backward and passive governance, and presents a structured, four‑phase approach—including organizational setup, standards, platform selection, and continuous operations—to successfully implement data governance at scale.

Big DataData GovernanceData Management
0 likes · 10 min read
Why Data Governance Is the Key to Unlocking Your Data’s True Value
Data Thinking Notes
Data Thinking Notes
Mar 14, 2023 · Fundamentals

Practical Data Governance Guide for SMEs: Strategies, Steps, and Tools

This article explains why data governance matters for small‑to‑medium enterprises, outlines its four key values, describes essential governance components, and provides a step‑by‑step framework—including timing, roles, standards, execution mechanisms, tools, and common pitfalls—to help organizations implement effective data governance.

Data GovernanceData ManagementPerformance Monitoring
0 likes · 16 min read
Practical Data Governance Guide for SMEs: Strategies, Steps, and Tools
Tencent Cloud Developer
Tencent Cloud Developer
Mar 8, 2023 · Artificial Intelligence

Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation

The article describes WeChat Games’ scalable recommendation system, detailing its four‑component architecture—offline ML platform, unified management, online DAG‑based engine, and peripheral services—along with a hybrid algorithm library, feature engineering, real‑time monitoring, and solutions that boost engagement across diverse game recommendation scenarios.

Data ManagementDeep LearningReal-time Processing
0 likes · 28 min read
Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 3, 2023 · Fundamentals

Understanding Data Management Principles and Governance: Insights from DMBOK

This article explains the core principles, strategies, frameworks, and governance practices of data management based on DAMA's DMBOK, covering data lifecycle, value, leadership responsibilities, strategic planning, governance models, metrics, and implementation guidelines to help organizations derive business value from high‑quality data.

DMBOKData GovernanceData Management
0 likes · 17 min read
Understanding Data Management Principles and Governance: Insights from DMBOK
Data Thinking Notes
Data Thinking Notes
Feb 9, 2023 · Fundamentals

Why Data Standards Are the Key to Unlocking Business Value

This article explains how data standards form the foundation of data governance, clarifies data assets, breaks silos, accelerates data flow, and outlines their definitions, benefits, common challenges, essential components, and best practices for effective implementation.

Data GovernanceData Managementbest practices
0 likes · 14 min read
Why Data Standards Are the Key to Unlocking Business Value
Data Thinking Notes
Data Thinking Notes
Feb 2, 2023 · Fundamentals

Why Metadata Management Is the Key to Unlocking Data Value

This article explains how effective metadata management provides context, improves data quality, enables data lineage tracing, supports governance, and ultimately turns raw data into valuable assets for enterprises navigating complex, evolving data environments.

Data GovernanceData LineageData Management
0 likes · 35 min read
Why Metadata Management Is the Key to Unlocking Data Value
DataFunTalk
DataFunTalk
Jan 31, 2023 · Big Data

Data Governance Strategies: Concepts, Practices, and Case Studies

This article explains the importance of data governance for organizations, distinguishes narrow and broad governance scopes, outlines strategic principles, and presents multiple real‑world case studies from leading companies, offering practical insights for building effective data governance frameworks.

Data Managementcase studystrategy
0 likes · 7 min read
Data Governance Strategies: Concepts, Practices, and Case Studies
DataFunSummit
DataFunSummit
Jan 27, 2023 · Big Data

Data Governance Strategies: Principles, Practices, and Case Studies

The article explains the importance of data governance, distinguishes narrow and broad governance, outlines strategic principles such as systemic engineering and prioritization, and presents eight case studies from leading Chinese tech companies illustrating practical implementations and effective strategies.

Big DataData GovernanceData Management
0 likes · 8 min read
Data Governance Strategies: Principles, Practices, and Case Studies
DataFunSummit
DataFunSummit
Jan 21, 2023 · Big Data

Building and Evolving Data Management Systems: From IT to DT Era, Standards, Models, and Marketization

This article outlines the evolution of data management in the big‑data era, covering the history of the industry, key governance frameworks such as DMBOK, DCMM and DMM, the steps to construct a data‑management system, the requirements for a data‑factor market, and an introduction to the DataEasy company and its services.

Big DataDCMMDMBOK
0 likes · 15 min read
Building and Evolving Data Management Systems: From IT to DT Era, Standards, Models, and Marketization
DataFunTalk
DataFunTalk
Jan 19, 2023 · Big Data

Data Governance Strategies: Concepts, Practices, and Case Studies

The article explains the importance of data governance for organizations handling big data, outlines narrow and broad governance approaches, presents strategic design principles, and shares practical case studies from leading companies, while also offering a downloadable ebook of governance strategies.

Case StudiesData Managementdata security
0 likes · 7 min read
Data Governance Strategies: Concepts, Practices, and Case Studies
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 12, 2023 · Operations

What Is DataOps and How Can It Transform Your Data Management?

DataOps, the data‑centric counterpart of DevOps, combines agile principles, standardized tools, and cross‑team collaboration to manage the full data lifecycle—from integration and development to storage, governance, and service—enabling organizations to handle massive, diverse datasets efficiently, reduce silos, and turn data into actionable value.

Big DataData GovernanceData Integration
0 likes · 15 min read
What Is DataOps and How Can It Transform Your Data Management?
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jan 11, 2023 · Artificial Intelligence

How Baidu Cloud Powers End-to-End Autonomous Driving Data Ops and AI

This article outlines Baidu Intelligent Cloud's comprehensive, low‑cost solution for autonomous‑driving data pipelines—from road data collection and compliance, through annotation, management, and model training, to simulation—highlighting the platform's tools, services, and security measures that accelerate development.

AIData ManagementModel Training
0 likes · 18 min read
How Baidu Cloud Powers End-to-End Autonomous Driving Data Ops and AI
DataFunTalk
DataFunTalk
Dec 30, 2022 · Fundamentals

Financial Data Governance: From 0 to 1 – Practices and Insights

This article examines the current state of financial data governance, outlines the external and internal drivers, presents a step‑by‑step architecture framework, discusses domain‑level coordination, shares practical implementations and Q&A, and highlights how AI and automation can enhance governance in the financial sector.

AIData ManagementFinancial Services
0 likes · 25 min read
Financial Data Governance: From 0 to 1 – Practices and Insights
ByteDance Data Platform
ByteDance Data Platform
Dec 28, 2022 · Big Data

How Cloud Data Warehouses Are Shaping the Future of Big Data and DataOps

This article examines the four‑stage evolution of data warehouses, highlights the cost‑effective, scalable advantages of cloud‑native warehouses, explores the rapid growth of data‑management infrastructure, and discusses the emerging practices of DataOps and AI integration that are redefining modern data stacks.

AIBig DataData Management
0 likes · 15 min read
How Cloud Data Warehouses Are Shaping the Future of Big Data and DataOps
DataFunSummit
DataFunSummit
Dec 23, 2022 · Artificial Intelligence

Data‑Centric AI Practices for Content Moderation at NetEase Yidun

The article presents NetEase Yidun’s data‑centric AI approach to content moderation, covering the background of Data‑Centric AI, the specific business and data challenges of content safety, comprehensive data pipelines—including collection, labeling, augmentation, selection, cleaning, iteration and testing—and the role of self‑, semi‑ and weak‑supervised learning in enhancing algorithm performance.

Algorithm InnovationData ManagementData‑Centric AI
0 likes · 19 min read
Data‑Centric AI Practices for Content Moderation at NetEase Yidun
Laravel Tech Community
Laravel Tech Community
Nov 20, 2022 · Databases

11 Popular MySQL Graphical Management Tools

This article introduces eleven widely used graphical tools for MySQL administration, describing each tool's features, platform support, and official download links, helping developers and DBAs choose the most suitable solution for managing MySQL databases efficiently.

Administration ToolsData ManagementDatabase GUI
0 likes · 8 min read
11 Popular MySQL Graphical Management Tools
DevOps Cloud Academy
DevOps Cloud Academy
Nov 5, 2022 · Fundamentals

Understanding Data Architecture: Definitions, Problems Solved, Core Components, and Future Trends

This article explains what data architecture is, why it is essential for linking business and technology, outlines its main components such as data models, data flows, value streams and standards, and discusses emerging trends toward service‑oriented, consumption‑focused data architectures.

Data ArchitectureData GovernanceData Management
0 likes · 9 min read
Understanding Data Architecture: Definitions, Problems Solved, Core Components, and Future Trends
DataFunSummit
DataFunSummit
Nov 1, 2022 · Big Data

Case Study of DCMM Standard Implementation at State Grid Tianjin Electric Power

This article details State Grid Tianjin Electric Power's early adoption and successful certification of the national DCMM data management maturity model, outlining background, certification milestones, systematic practices, and lessons learned that illustrate how data governance, architecture, and application strategies drive digital transformation.

Big DataDCMMData Governance
0 likes · 11 min read
Case Study of DCMM Standard Implementation at State Grid Tianjin Electric Power
DevOps Cloud Academy
DevOps Cloud Academy
Oct 27, 2022 · Big Data

Understanding DataOps: Concepts, Standards, and Enterprise Practices

This article explains DataOps as a methodology for improving data analysis quality and efficiency, outlines its origins, standards, and maturity model, and presents practical insights and case studies from Chinese enterprises on how DataOps addresses common data engineering challenges and drives digital transformation.

Big DataData GovernanceData Management
0 likes · 12 min read
Understanding DataOps: Concepts, Standards, and Enterprise Practices
Laravel Tech Community
Laravel Tech Community
Aug 17, 2022 · Databases

Comprehensive SQL Server Database Operations and Techniques Guide

This article provides a comprehensive collection of SQL Server commands and techniques, covering database creation, table manipulation, queries, indexing, backup, replication, and advanced operations, offering practical examples and code snippets for developers and database administrators.

Data ManagementSQL ServerT-SQL
0 likes · 23 min read
Comprehensive SQL Server Database Operations and Techniques Guide
Big Data Technology Architecture
Big Data Technology Architecture
Jul 28, 2022 · Big Data

Reflections on Data Governance Challenges and Approaches

The author shares a candid account of transitioning from a non‑data role to confronting data‑centric bottlenecks, describing the current state of data projects, common pitfalls, and practical thoughts on simplifying data governance within limited resources and budget constraints.

Big DataDAMAData Governance
0 likes · 7 min read
Reflections on Data Governance Challenges and Approaches
Architects Research Society
Architects Research Society
May 17, 2022 · Information Security

Understanding Data Governance, Models, Policies, and Best Practices

The article explains data governance concepts, outlines four common governance models, details key policy elements such as availability, quality, integrity, usability, and security, and highlights the benefits, risks, and best‑practice recommendations for implementing effective data governance in organizations.

Data GovernanceData Managementcompliance
0 likes · 10 min read
Understanding Data Governance, Models, Policies, and Best Practices
Architect
Architect
Apr 25, 2022 · Cloud Native

Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform

This article presents a cloud‑native redesign of Baidu's search middle‑platform that introduces intelligent data management, elastic scaling, on‑demand resource allocation, precise fan‑out, and localized computation to address efficiency, cost, stability, and performance challenges of large‑scale search workloads.

Data ManagementSearch Architecturecloud-native
0 likes · 14 min read
Designing a Cloud‑Native Intelligent Data Architecture for Baidu Search Platform
DataFunTalk
DataFunTalk
Mar 30, 2022 · Big Data

NetEase Big Data Platform: HDFS Optimization and Practice

This article presents NetEase's big data platform architecture, detailing multi‑layer storage and compute design, HDFS deployment challenges, NameNode and NameSpace performance optimizations, cluster scaling strategies, data tiering, hardware upgrades, and real‑world business use cases, illustrating practical large‑scale big data engineering.

Big DataCluster OptimizationData Management
0 likes · 23 min read
NetEase Big Data Platform: HDFS Optimization and Practice
DataFunSummit
DataFunSummit
Jan 23, 2022 · Big Data

MobTech's Integrated Data Governance Practices and Architecture

This article presents MobTech's comprehensive data governance and security practices, covering the necessity of governance, challenges in large‑scale data environments, the full‑link governance chain, modular architecture, and specific implementations for financial risk‑control scenarios.

Big DataData ArchitectureData Governance
0 likes · 19 min read
MobTech's Integrated Data Governance Practices and Architecture
DataFunTalk
DataFunTalk
Jan 8, 2022 · Big Data

Lakehouse: Concepts, Architecture, Implementation, and Cloud Practices

This article provides a comprehensive overview of the Lakehouse paradigm, tracing its origins from traditional data warehouses and data lakes, comparing architectures, detailing core components such as Delta Lake and Iceberg, and illustrating practical cloud implementations and future directions.

Apache IcebergBig DataCloud Data Platform
0 likes · 14 min read
Lakehouse: Concepts, Architecture, Implementation, and Cloud Practices
Baidu Geek Talk
Baidu Geek Talk
Dec 15, 2021 · Cloud Native

Cloud-Native Intelligent Data Management Architecture for Baidu Search Platform

Cloud-native redesign of Baidu's search middle platform introduces partition, shard, replica, and addressing controllers that enable elastic scaling, on-demand resource allocation, precise fan‑out, and localized computation, reducing capacity adjustment time from weeks to hours, cutting costs by 30‑80%, raising availability above 99.9% and halving query latency.

Data ManagementSearch Architecturecloud-native
0 likes · 17 min read
Cloud-Native Intelligent Data Management Architecture for Baidu Search Platform
DataFunSummit
DataFunSummit
Dec 14, 2021 · Big Data

Data Map: Background, Definition, and Youzan’s Practical Implementation

This article introduces the concept of a data map, explains its background and goals, describes Youzan’s end‑to‑end data‑map practice—including full data lineage, search, management, link analysis, impact estimation, and optimization—and concludes with a summary and future outlook.

Big DataData GovernanceData Lineage
0 likes · 16 min read
Data Map: Background, Definition, and Youzan’s Practical Implementation
Architects Research Society
Architects Research Society
Nov 13, 2021 · Databases

Choosing the Right Databases for IoT Applications

The article explains why the Internet of Things generates massive, diverse data streams that require specialized databases, outlines key selection criteria, describes common IoT data types, and reviews several open‑source databases—InfluxDB, CrateDB, MongoDB, RethinkDB, SQLite, and Cassandra—highlighting their strengths for IoT workloads.

Data ManagementIoTNoSQL
0 likes · 10 min read
Choosing the Right Databases for IoT Applications