Tagged articles
548 articles
Page 1 of 6
Digital Planet
Digital Planet
May 16, 2026 · Industry Insights

Why Data Capability Is the New Moat in the AI Era

The article argues that as AI models become commoditized, the decisive factor for enterprises is mastering data governance, data‑AI integration, and data flow, turning data into a strategic asset that creates a three‑layer moat and drives sustainable AI ROI.

AIAI industry trendsData Assets
0 likes · 13 min read
Why Data Capability Is the New Moat in the AI Era
dbaplus Community
dbaplus Community
May 14, 2026 · Big Data

Building a ‘One‑Sentence Bank’: Big Data and AI Fusion for Small Banks

The article outlines the evolution of big data in banking, compares management models for heterogeneous data, describes the shift from data engineering to knowledge engineering, introduces LLMOps for high‑quality knowledge bases, and details how integrating AI and data can enable a “one‑sentence bank” that answers queries and executes tasks.

BankingBig DataData Governance
0 likes · 22 min read
Building a ‘One‑Sentence Bank’: Big Data and AI Fusion for Small Banks
Smart Workplace Lab
Smart Workplace Lab
May 10, 2026 · Artificial Intelligence

When Your Internal AI Is Fed Bad Data, How to Fix It?

The article recounts a real incident where an AI‑generated SOP cited outdated policy because a knowledge base was overloaded with unchecked historical documents, then outlines a step‑by‑step protocol—including corpus cleaning, version locking, and isolation zones—to prevent data contamination and ensure reliable AI outputs.

AIData GovernanceKnowledge Base
0 likes · 7 min read
When Your Internal AI Is Fed Bad Data, How to Fix It?
DataFunSummit
DataFunSummit
May 10, 2026 · Big Data

How Lance File Format v2.2 Accelerates, Cuts Costs, and Governs Multimodal Data

Lance File Format v2.2 tackles the AI data explosion by delivering hundred‑fold random‑read performance, advanced two‑layer compression, zero‑cost schema evolution, Git‑style versioning, external blob handling, and a roadmap toward native media support and intelligent encoding, positioning it as a core infrastructure for large‑scale multimodal workloads.

Data GovernanceFile FormatIO performance
0 likes · 14 min read
How Lance File Format v2.2 Accelerates, Cuts Costs, and Governs Multimodal Data
Digital Planet
Digital Planet
May 7, 2026 · Industry Insights

DRP vs. ERP: Why the New Digital Platform Complements, Not Replaces, Existing Systems

The article analyzes the three meanings of DRP, explains its role as a group‑level data‑driven control hub, contrasts it with ERP’s execution focus, debunks the myth that DRP will replace ERP, and outlines four practical obstacles—cognitive bias, data silos, organizational resistance, and talent shortage—along with concrete steps to ensure successful implementation.

DRPData GovernanceDigital Transformation
0 likes · 15 min read
DRP vs. ERP: Why the New Digital Platform Complements, Not Replaces, Existing Systems
DataFunSummit
DataFunSummit
May 1, 2026 · Artificial Intelligence

From “Lobster” to Ontology: Unveiling the Next Wave of Self‑Evolving AI Agents and Data Governance

The DACon conference in Shanghai gathered over 8,000 developers, managers and experts, delivering 50 talks that explored self‑evolving AI agents, data‑centric ontology, Agent‑Ready big‑data infrastructure, AI‑AR ecosystem evolution, and the emerging challenges of Agentic data governance.

AI agentsAI+ARAgentic Data Protocol
0 likes · 11 min read
From “Lobster” to Ontology: Unveiling the Next Wave of Self‑Evolving AI Agents and Data Governance
DataFunSummit
DataFunSummit
Apr 30, 2026 · Industry Insights

Why Palantir’s Edge Isn’t Unique – Chinese Enterprises Can Replicate Its Methodology

A panel of industry experts dissected Palantir’s rapid growth, revealing that its advantage lies in a systematic ontology‑driven methodology rather than exclusive technology, and argued that Chinese firms can adopt the same approach if they first resolve data governance, semantic consistency, and management challenges.

AI agentsCapability vs CompetencyData Governance
0 likes · 26 min read
Why Palantir’s Edge Isn’t Unique – Chinese Enterprises Can Replicate Its Methodology
DataFunTalk
DataFunTalk
Apr 28, 2026 · Artificial Intelligence

From “Lobster” to Ontology: DACon Reveals the Next Trend in Self‑Evolving AI Agents

The DACon conference in Shanghai gathered over 8,000 developers and experts, showcasing 50 talks that explored self‑evolving AI agents, the open‑source GenericAgent framework, data‑governance ontology, Agent‑Ready big‑data infrastructure, and AI+AR ecosystems, while highlighting practical case studies and future industry directions.

AI agentsAI+ARBig Data
0 likes · 11 min read
From “Lobster” to Ontology: DACon Reveals the Next Trend in Self‑Evolving AI Agents
Smart Workplace Lab
Smart Workplace Lab
Apr 27, 2026 · Industry Insights

Data‑Application Illusion, Agentic AI, and New‑Hire Employment – US‑China AI Workplace Weekly (Apr 21‑27)

The report analyzes why AI project failure rates remain 70‑85%, how data‑application illusion and workslop erode productivity, and why integrating Agentic AI into native workflows is the only viable path, while highlighting a 16% drop in Gen Z AI‑related job placements and practical mitigation strategies.

AI workplaceAgentic AIData Governance
0 likes · 8 min read
Data‑Application Illusion, Agentic AI, and New‑Hire Employment – US‑China AI Workplace Weekly (Apr 21‑27)
DataFunSummit
DataFunSummit
Apr 27, 2026 · Artificial Intelligence

How Tencent Games Leverages AI to Turn Data Governance into a Service

Tencent Games’ data governance team details an AI‑driven, end‑to‑end semantic framework that shifts traditional rule‑based data management to a service‑oriented model, cutting storage waste by 30 %, halving development time, and boosting asset recommendation accuracy to 95 % across its global gaming platform.

AIBig DataData Governance
0 likes · 19 min read
How Tencent Games Leverages AI to Turn Data Governance into a Service
DataFunSummit
DataFunSummit
Apr 26, 2026 · Artificial Intelligence

How AI Powers an Immersive Vibe Analyzing Experience for Data Exploration

The article analyzes how AskTable uses AI agents to replace static BI dashboards with an immersive, real‑time data‑analysis canvas, enabling business users to query multiple data sources in seconds, while addressing accuracy, table‑finding, and fine‑grained permission challenges.

AIAI AgentAskTable
0 likes · 15 min read
How AI Powers an Immersive Vibe Analyzing Experience for Data Exploration
Digital Planet
Digital Planet
Apr 26, 2026 · Industry Insights

Why Most Companies Aren’t Ready for AI Yet

The article argues that the failure of many enterprises to benefit from AI is not due to a lack of technology but to insufficient digital foundations, disorganized processes, poor data quality, cultural resistance, and a shortage of skilled talent, turning AI projects into costly showpieces.

AI adoptionData GovernanceDigital Transformation
0 likes · 9 min read
Why Most Companies Aren’t Ready for AI Yet
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 24, 2026 · Artificial Intelligence

How to Build a Truly Usable AI‑Powered Natural Language Query System from Scratch

The article analyzes why natural‑language database queries often fail, outlines four technical routes, presents a five‑layer architecture with a business‑semantic middle layer, shares engineering best practices, a real‑world case study, and a product comparison to guide data companies in designing an effective intelligent query system.

AIData GovernanceNL2SQL
0 likes · 16 min read
How to Build a Truly Usable AI‑Powered Natural Language Query System from Scratch
DataFunSummit
DataFunSummit
Apr 24, 2026 · Artificial Intelligence

AI‑Driven Data Governance as a Service: Tencent Games' Paradigm Shift

This talk details how Tencent Games leverages AI to transform its data governance from rule‑based, passive processes into a semantic, service‑oriented paradigm, addressing resource waste, low collaboration efficiency, and scalability challenges while delivering measurable improvements in cost, speed, and asset quality.

AIAutomationBig Data
0 likes · 19 min read
AI‑Driven Data Governance as a Service: Tencent Games' Paradigm Shift
Big Data Tech Team
Big Data Tech Team
Apr 22, 2026 · Big Data

Inside Big Tech: Full Breakdown of AI Agents for Data Warehouse Governance

The article analyzes how leading internet companies embed AI agents across the entire data‑warehouse lifecycle to automate governance, presenting real‑world case studies from Alibaba, ByteDance, JD.com and Tencent, and quantifies benefits such as over 65% reduction in manual effort, 50% drop in metric duplication, and a 40% boost in resource utilization.

AI agentsAutomationBig Data
0 likes · 10 min read
Inside Big Tech: Full Breakdown of AI Agents for Data Warehouse Governance
DataFunTalk
DataFunTalk
Apr 21, 2026 · Industry Insights

How AI Agents Are Redefining Data Governance: 5 Key Shifts and 3 Strategic Solutions

In the AI era, data consumption moves from a few technical users to all business staff, forcing a fundamental redesign of data governance across five dimensions—resource consumption, frequency, semantics, knowledge base, and modality—and proposing three actionable strategies to make data semantically rich, fully multimodal, and AI‑consumable.

AIData GovernanceEnterprise Analytics
0 likes · 18 min read
How AI Agents Are Redefining Data Governance: 5 Key Shifts and 3 Strategic Solutions
Big Data Tech Team
Big Data Tech Team
Apr 20, 2026 · Artificial Intelligence

How AI is Redefining Data Workflows: 4 Game‑Changing Paradigms Explained

The article outlines four AI‑driven breakthroughs reshaping data work—AI‑for‑Data automation, generative‑AI‑enhanced governance, NoETL real‑time lake ingestion, and next‑generation SQL analysis—detailing their problems, concrete case studies, implementation steps, pitfalls, and measurable efficiency gains.

AI for DataData GovernanceNoETL
0 likes · 12 min read
How AI is Redefining Data Workflows: 4 Game‑Changing Paradigms Explained
DataFunTalk
DataFunTalk
Apr 19, 2026 · Industry Insights

From ChatBI to DataAgent: Turning AI Demos into Trusted Enterprise Decision Engines

The live discussion breaks down the practical challenges of building enterprise‑grade Data Agents—from unified semantic layers and prompt engineering versus model fine‑tuning, to table discovery, multi‑turn memory, trust, cost control, and continuous improvement—showing why real‑world AI success hinges on system reliability rather than raw model power.

AIData AgentData Governance
0 likes · 17 min read
From ChatBI to DataAgent: Turning AI Demos into Trusted Enterprise Decision Engines
DataFunSummit
DataFunSummit
Apr 18, 2026 · Industry Insights

Why Palantir’s Ontology Beats Traditional Data Models – Insights from Industry Leaders

A closed‑door forum gathered experts from academia and leading Chinese tech firms to dissect Palantir’s ontology‑driven approach, comparing it with conventional data modeling, exploring AI integration, and highlighting the managerial and technical challenges that determine its success in enterprise environments.

Data GovernanceEnterprise AIKnowledge Graph
0 likes · 27 min read
Why Palantir’s Ontology Beats Traditional Data Models – Insights from Industry Leaders
Big Data Tech Team
Big Data Tech Team
Apr 15, 2026 · Industry Insights

How to Harness Large Language Models for Effective Data Governance: Real Scenarios, Pitfalls, and Best Practices

This article analyzes how large language models can be integrated into data governance workflows, outlines three practical use cases, identifies five common implementation traps, offers best‑practice recommendations, and presents a real hospital case that demonstrates measurable performance gains.

AIData Governancebest practices
0 likes · 13 min read
How to Harness Large Language Models for Effective Data Governance: Real Scenarios, Pitfalls, and Best Practices
dbaplus Community
dbaplus Community
Apr 2, 2026 · Operations

Why Most CMDB Projects Fail and How to Build a Sustainable Data Engine

The article analyzes common pitfalls of CMDB implementations, explains why overly comprehensive models collapse, and proposes a consumption‑driven, federated, and automation‑focused approach that integrates monitoring, ITSM, and FinOps to achieve continuous data quality and business value.

AutomationCMDBData Governance
0 likes · 13 min read
Why Most CMDB Projects Fail and How to Build a Sustainable Data Engine
dbaplus Community
dbaplus Community
Mar 31, 2026 · Industry Insights

Why Most Data Governance Projects Fail and How to Build a Practical, Engineer‑Friendly Solution

Most companies see data governance fail not because of technology but because they start with the wrong direction, focusing on rules, platforms, and processes that add friction instead of improving data usability, and the article provides a step‑by‑step, low‑overhead approach with concrete SQL and Python templates to fix it.

Data GovernanceEngineering ProductivityPython
0 likes · 25 min read
Why Most Data Governance Projects Fail and How to Build a Practical, Engineer‑Friendly Solution
Big Data Tech Team
Big Data Tech Team
Mar 30, 2026 · Big Data

2026 Data Warehouse Interview Guide: Essential Questions for All Three Rounds

This article compiles a comprehensive set of data‑warehouse interview questions—including self‑introduction prompts, SQL and window‑function challenges, data‑skew solutions, architecture design, file‑format trade‑offs, governance, and team‑leadership topics—to help candidates prepare for first, second, and third‑round interviews at leading tech firms.

Big DataCareer DevelopmentData Governance
0 likes · 7 min read
2026 Data Warehouse Interview Guide: Essential Questions for All Three Rounds
DataFunSummit
DataFunSummit
Mar 25, 2026 · Big Data

How Apache Gravitino and OpenLineage Transform Data Governance for AI‑Driven Enterprises

In the era of AI and multi‑cloud, this article analyzes the core challenges of data governance—data silos, quality gaps, and compliance risks—and explains how Apache Gravitino’s unified metadata architecture together with OpenLineage’s standardized lineage model provide a scalable, automated solution for intelligent, real‑time data management.

Apache GravitinoBig DataData Governance
0 likes · 15 min read
How Apache Gravitino and OpenLineage Transform Data Governance for AI‑Driven Enterprises
ITPUB
ITPUB
Mar 17, 2026 · Interview Experience

Expert Links Microservices to Financial AI: Architecture and Data Governance

In this interview, senior technology specialist Chen Ke shares how he adapts internet‑scale microservice and PaaS practices to the highly regulated financial sector, discusses building enterprise knowledge‑base platforms with large language models, outlines data‑governance and compliance strategies, and predicts the evolving skill set engineers will need.

AIData GovernanceMicroservices
0 likes · 15 min read
Expert Links Microservices to Financial AI: Architecture and Data Governance
Wuming AI
Wuming AI
Mar 2, 2026 · Industry Insights

How China’s New AI Training Data Standard Bridges Data Delivery and Model Performance

The article explains how the newly released "AI Training Data Set Delivery and Quality Acceptance Specification" addresses gaps in existing data‑quality standards by defining a three‑layer acceptance framework, quantitative metrics, and a pre‑negotiated quality‑baseline mechanism to make dataset delivery verifiable and directly supportive of model training goals.

AI data standardsData GovernanceData Quality
0 likes · 7 min read
How China’s New AI Training Data Standard Bridges Data Delivery and Model Performance
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Feb 11, 2026 · Artificial Intelligence

Breaking the Data Ceiling: UltraData’s 2.4 TB Tiered Dataset with the Largest L3 Math Library

UltraData presents a five‑level tiered data‑management system (L0‑L4) for large‑language‑model training, releases the world’s largest open L3 mathematics dataset (2.4 TB), validates the approach with extensive MiniCPM‑1.2B experiments showing consistent performance gains across web, multilingual, math and code domains, and opens a suite of governance tools and a community portal.

Data GovernanceMathematics DatasetMiniCPM
0 likes · 15 min read
Breaking the Data Ceiling: UltraData’s 2.4 TB Tiered Dataset with the Largest L3 Math Library
ITPUB
ITPUB
Jan 20, 2026 · Databases

Boost Data Warehouse Efficiency with Proven Naming Conventions

A well‑defined naming convention for data‑warehouse tables reduces chaos, improves maintainability, speeds up queries, and cuts cross‑team collaboration costs, turning raw data into a strategic asset for modern enterprises.

Data GovernanceData WarehouseDatabase design
0 likes · 8 min read
Boost Data Warehouse Efficiency with Proven Naming Conventions
Woodpecker Software Testing
Woodpecker Software Testing
Dec 25, 2025 · Artificial Intelligence

How AI Testing Platforms Achieve Real-World Efficiency Gains

The article analyzes AI testing platforms, showing how automated test‑case generation, adaptive execution, defect prediction, and a structured rollout process deliver up to 35% higher coverage, 48% faster design, and 40% reduced execution time across finance and e‑commerce case studies.

AI testingData Governanceadaptive testing
0 likes · 8 min read
How AI Testing Platforms Achieve Real-World Efficiency Gains
StarRocks
StarRocks
Dec 25, 2025 · Big Data

How dbt, DataOps, and StarRocks Combine to Accelerate Real‑Time Data Modeling

This article explains how dbt drives automated data modeling and governance, how DataOps practices bring agility and control to data projects, and how StarRocks’ lakehouse architecture enables real‑time and batch analytics, illustrated with concrete workflows, version‑control conventions, and enterprise case studies.

Data GovernanceDataOpsELT
0 likes · 14 min read
How dbt, DataOps, and StarRocks Combine to Accelerate Real‑Time Data Modeling
Zhuanzhuan Tech
Zhuanzhuan Tech
Dec 17, 2025 · Artificial Intelligence

How AI Powers Automatic Security Tagging in Large‑Scale Data Governance

This article details how a Chinese e‑commerce platform leverages large‑language‑model AI, the open‑source Dify platform, and engineered workflows to automate security tagging of massive data assets, covering data‑governance fundamentals, AI‑driven tagging advantages, technical architecture, prompt engineering, optimization cases, and future roadmap.

AIData GovernancePrompt engineering
0 likes · 25 min read
How AI Powers Automatic Security Tagging in Large‑Scale Data Governance
Ray's Galactic Tech
Ray's Galactic Tech
Dec 15, 2025 · Databases

Mastering Database Design: From Core Principles to Modern Distributed Practices

This comprehensive guide walks you through fundamental database design goals, a step‑by‑step lifecycle, nine essential strategies—including normalization, indexing, and security—plus modern distributed and NoSQL considerations, performance tuning, high‑availability tactics, and practical tools for robust data governance.

Data GovernanceDatabase designNoSQL
0 likes · 11 min read
Mastering Database Design: From Core Principles to Modern Distributed Practices
dbaplus Community
dbaplus Community
Dec 7, 2025 · Artificial Intelligence

How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint

This article explains how AI agents transform traditional data governance by introducing a four‑layer perception‑decision‑execution‑learning architecture, detailing the required technologies, tool integrations, code examples, deployment steps, team roles, security safeguards, and practical rollout strategies for enterprises seeking automated, intelligent data management.

AI AgentData GovernanceData Quality
0 likes · 10 min read
How AI Agents Can Revolutionize Data Governance: A Step‑by‑Step Blueprint
DataFunSummit
DataFunSummit
Dec 1, 2025 · Artificial Intelligence

Why Palantir’s Ontology Approach Could Transform Enterprise AI – Insights from Industry Leaders

A detailed transcript of a closed‑door forum reveals how Palantir’s ontology methodology, combined with AI agents, addresses data semantics, knowledge governance, and enterprise‑level decision making, while highlighting practical challenges, evaluation frameworks, and the need for strong management and high‑quality data foundations.

Data GovernanceEnterprise AIKnowledge Graph
0 likes · 27 min read
Why Palantir’s Ontology Approach Could Transform Enterprise AI – Insights from Industry Leaders
DaTaobao Tech
DaTaobao Tech
Dec 1, 2025 · Artificial Intelligence

How AI Can Automate Repetitive Work: From Simple Tools to Intelligent Agents

This article shares the author's practical experience in using AI to tackle complex repetitive tasks, presenting a reusable methodology that abstracts human actions into a perception‑decision‑execution loop, and demonstrates three automation modes—tool assistant, workflow, and intelligent agent—through real‑world cases in data governance, ticket handling, and baseline operations.

AI automationData Governanceintelligent agent
0 likes · 23 min read
How AI Can Automate Repetitive Work: From Simple Tools to Intelligent Agents
Baidu Tech Salon
Baidu Tech Salon
Nov 26, 2025 · Big Data

How Baidu MEG Cut Data Costs: Inside a Big Data Governance Playbook

This article details Baidu's MEG data cost governance practice, covering background challenges, a unified governance framework, health‑score metrics, platform and engine capabilities, concrete compute and storage optimization techniques, achieved results, and future plans for continuous cost reduction.

Cost OptimizationData Governance
0 likes · 23 min read
How Baidu MEG Cut Data Costs: Inside a Big Data Governance Playbook
JD Cloud Developers
JD Cloud Developers
Nov 24, 2025 · Artificial Intelligence

JoyAgent: Open‑Source Enterprise‑Grade Multi‑Agent Platform from JD

The 2025 Open Atom Developer Conference highlighted JD's JoyAgent project, an open‑source, 100% enterprise‑grade multi‑agent platform that excels in AI, data governance, and diagnostic analysis, with detailed features, performance metrics, and deployment experiences shared.

AI PlatformData GovernanceDiagnostic Analysis
0 likes · 7 min read
JoyAgent: Open‑Source Enterprise‑Grade Multi‑Agent Platform from JD
DataFunSummit
DataFunSummit
Nov 23, 2025 · Artificial Intelligence

How Large Language Models Are Revolutionizing Banking Data Integration

This article examines the challenges of traditional banking data, explains how large language models can fuse structured and unstructured information, outlines a new data‑centric infrastructure and governance approach, and describes the DiFY platform’s AI‑agent and DataOps capabilities for agile, non‑intrusive integration with core banking systems.

AI agentsBig DataData Governance
0 likes · 16 min read
How Large Language Models Are Revolutionizing Banking Data Integration
Data Thinking Notes
Data Thinking Notes
Nov 2, 2025 · Artificial Intelligence

Why Data Governance Is the Key to Trustworthy AI in the Large Model Era

The article explains how the rapid rise of large‑model AI has shifted the focus from models to data, outlines the concept and stages of AI‑specific data governance, identifies challenges such as low‑quality data, privacy leaks, bias, and proposes a comprehensive framework of principles, processes, and technologies to ensure high‑quality, secure, and ethical AI deployment.

AIData GovernanceData Quality
0 likes · 40 min read
Why Data Governance Is the Key to Trustworthy AI in the Large Model Era
Big Data Tech Team
Big Data Tech Team
Oct 29, 2025 · Fundamentals

Why Unified Data Modeling Matters: From Conceptual Design to Physical Implementation

The article explains how inconsistent "customer ID" fields across systems stem from a lack of unified data models, defines the difference between data modeling and data models, outlines three modeling stages, and compares three major modeling approaches—normative, dimensional, and entity—highlighting their purposes, processes, and trade‑offs.

Data GovernanceDatabase designconceptual modeling
0 likes · 12 min read
Why Unified Data Modeling Matters: From Conceptual Design to Physical Implementation
DataFunSummit
DataFunSummit
Oct 28, 2025 · Fundamentals

Why Unstructured Data Management Is the Next Frontier for Enterprises

This article explores the evolution, current state, and challenges of enterprise unstructured data management, reviews case studies from traditional firms, Huawei and Ant Group, proposes an ECM‑based reference framework, compares it with structured data governance, and outlines future integration strategies with AI and unified data platforms.

AIBig DataData Governance
0 likes · 28 min read
Why Unstructured Data Management Is the Next Frontier for Enterprises
DataFunTalk
DataFunTalk
Oct 26, 2025 · Big Data

How Kuaishou E‑Commerce Built a Data Metric System to Drive Growth

This article explores Kuaishou E‑Commerce’s journey in constructing a comprehensive data metric system, detailing its business context, the necessity of metrics, challenges across data production, querying and usage, practical implementation steps, management practices, and a concluding Q&A.

Data GovernanceKuaishoudata metrics
0 likes · 6 min read
How Kuaishou E‑Commerce Built a Data Metric System to Drive Growth
Big Data Tech Team
Big Data Tech Team
Oct 23, 2025 · Industry Insights

How to Build a Reusable, Well‑Designed Data Warehouse Model

This article analyzes why analysts and data engineers clash over non‑reusable data models, presents metrics such as cross‑layer reference rate and model reuse coefficient, and outlines a step‑by‑step framework—including ODS takeover, subject‑domain mapping, dimension consistency, fact‑table integration, development best practices, and tool support—to transform siloed warehouses into a shared data‑platform.

Big DataData GovernanceData Platform
0 likes · 15 min read
How to Build a Reusable, Well‑Designed Data Warehouse Model
DataFunTalk
DataFunTalk
Oct 18, 2025 · Big Data

Inside Ant Group’s Big Data Governance: Key Practices and Insights

This article shares Ant Group’s practical experience in large-scale data governance, outlining four main topics—overall governance overview, data quality management, data storage-processing governance, and future considerations—while emphasizing the five critical aspects of architecture, security, compliance, quality, and value that drive effective big-data operations.

Data ArchitectureData GovernanceData Quality
0 likes · 4 min read
Inside Ant Group’s Big Data Governance: Key Practices and Insights
DataFunSummit
DataFunSummit
Oct 14, 2025 · Big Data

How Douyin’s Data Asset Platform Redefines Big Data Lineage

This article introduces Douyin Group’s one‑stop Data Asset Management Platform, explains why the company focuses on data assets rather than raw metadata, and details the evolution, architecture, applications, and future outlook of its comprehensive big‑data lineage system.

Big DataData Asset ManagementData Governance
0 likes · 5 min read
How Douyin’s Data Asset Platform Redefines Big Data Lineage
DataFunSummit
DataFunSummit
Oct 12, 2025 · Big Data

How Douyin’s Data Asset Platform Revolutionizes Big Data Lineage

This article introduces Douyin Group’s Data Asset Management Platform, explaining its shift from traditional metadata to comprehensive data assets, detailing the evolution, architecture, and applications of its full‑link big data lineage, and offering strategic guidance for building effective lineage systems.

Data AssetData GovernanceData Lineage
0 likes · 5 min read
How Douyin’s Data Asset Platform Revolutionizes Big Data Lineage
DataFunSummit
DataFunSummit
Oct 11, 2025 · Big Data

What Small Banks Can Learn from Cutting-Edge Data Governance Practices

This article shares a data‑governance roadmap for small and medium banks, covering industry pain points, high‑quality data sets, a three‑step governance path, data standards, metadata management, master‑data strategy, business data modeling, a hybrid Greenplum‑Hadoop platform, quality monitoring, and a maturity assessment framework.

BankingBig DataData Architecture
0 likes · 21 min read
What Small Banks Can Learn from Cutting-Edge Data Governance Practices
DataFunTalk
DataFunTalk
Oct 6, 2025 · Big Data

What Ant Group Learned: 5 Pillars of Effective Data Governance

Ant Group shares its practical experience in big data governance, outlining five key focus areas—architecture, security, compliance, quality, and value—through four structured sections and detailed discussions on data quality and storage governance, while also exploring future challenges and the economics of data.

Ant GroupBig DataData Architecture
0 likes · 4 min read
What Ant Group Learned: 5 Pillars of Effective Data Governance
DataFunSummit
DataFunSummit
Sep 30, 2025 · Artificial Intelligence

How to Govern AI Ethically: Frameworks, Risks, and Real‑World Practices

This article explores AI governance and ethics, outlining five key parts: AI business scenarios, data and AI risks, a comprehensive governance framework, practical implementation steps, and measurable benefits, while also providing expert insights and a Q&A session for deeper understanding.

AI FrameworkAI GovernanceAI risk management
0 likes · 16 min read
How to Govern AI Ethically: Frameworks, Risks, and Real‑World Practices
Alibaba Cloud Observability
Alibaba Cloud Observability
Sep 29, 2025 · Cloud Native

How Alibaba Cloud SLS Soft Delete Enables Instant, Low‑Cost Data Cleanup

This article explains Alibaba Cloud's Log Service (SLS) soft‑delete feature, describing its mark‑and‑filter mechanism, implementation steps, and real‑world scenarios where it replaces costly hard‑delete or ETL solutions with near‑instant, low‑impact data removal for compliance, emergencies, and test‑data contamination.

Alibaba CloudCloud NativeData Governance
0 likes · 9 min read
How Alibaba Cloud SLS Soft Delete Enables Instant, Low‑Cost Data Cleanup
DataFunSummit
DataFunSummit
Sep 20, 2025 · Fundamentals

Why Data Governance Fails: Combating Entropy in Integrated Data Systems

This article explains how the natural entropy of massive data sets creates governance challenges, outlines four core obstacles faced by large internet companies, and presents a sustainable, metric‑driven framework—including quality measurement, indicator systems, and future‑oriented operations—to achieve orderly data asset management.

Data GovernanceData ManagementEnterprise Data
0 likes · 18 min read
Why Data Governance Fails: Combating Entropy in Integrated Data Systems
DataFunSummit
DataFunSummit
Sep 19, 2025 · Big Data

Unlocking Data Lineage: SQL Bloodline for Discovery, Governance & Protection

This article explains how SQL lineage (bloodline) technology can be leveraged in offline data warehouses to enable precise data discovery, automated tag propagation, fine‑grained data governance, column‑level TTL management, and dynamic masking for data protection, illustrating implementation steps, strategies, and real‑world use cases.

Data GovernanceDynamic MaskingSQL lineage
0 likes · 28 min read
Unlocking Data Lineage: SQL Bloodline for Discovery, Governance & Protection
DataFunTalk
DataFunTalk
Sep 16, 2025 · Artificial Intelligence

Top AI Data Governance & Large Model Innovations: A Comprehensive Catalog

This article presents a curated catalog of cutting‑edge topics covering financial large‑model data governance, proactive metadata systems, data cleaning and compliance technologies, AI‑driven intelligent operations, and generative data analysis solutions, inviting readers to explore the latest AI innovations.

AIData GovernanceIntelligent Operations
0 likes · 2 min read
Top AI Data Governance & Large Model Innovations: A Comprehensive Catalog
DataFunTalk
DataFunTalk
Sep 15, 2025 · Artificial Intelligence

Unlocking the Future: AI-Driven Data Governance and Large Model Innovations

This article presents a curated catalog of cutting‑edge topics covering AI‑powered data governance, large‑model applications, data cleaning, compliance, lakehouse integration, intelligent operations, and generative analytics, inviting readers to explore the latest innovations and download the full e‑book via QR code.

AIAnalyticsData Governance
0 likes · 2 min read
Unlocking the Future: AI-Driven Data Governance and Large Model Innovations
DataFunSummit
DataFunSummit
Sep 6, 2025 · Artificial Intelligence

Explore Cutting-Edge AI‑Driven Data Governance: Full Topic Catalog

This article presents a comprehensive catalog of cutting‑edge AI and large‑model topics, covering financial data governance, proactive metadata systems, data cleaning compliance, lake‑warehouse integration, intelligent operations, generative analytics, and QR‑code access to the full e‑book.

AIAnalyticsData Governance
0 likes · 2 min read
Explore Cutting-Edge AI‑Driven Data Governance: Full Topic Catalog
Baidu Geek Talk
Baidu Geek Talk
Sep 3, 2025 · Big Data

How Baidu’s TDS Platform Achieves End‑to‑End Data Governance and Smart Operations

This article details Baidu MEG’s TDS (Turing Data Studio) platform, explaining its three‑pillar governance framework—process standardization, quality controllability, and intelligent operations—along with concrete mechanisms, automation, and measurable results that dramatically improve data reliability, operational efficiency, and fault‑tolerance in large‑scale data production.

AutomationData GovernanceData Quality
0 likes · 20 min read
How Baidu’s TDS Platform Achieves End‑to‑End Data Governance and Smart Operations
DataFunTalk
DataFunTalk
Sep 1, 2025 · Big Data

How JD Retail Tackles Data Governance Challenges to Boost Efficiency

JD Retail outlines the growing data management challenges it faces—including asset discovery, architecture agility, development quality, and rising IT costs—and presents a comprehensive data governance framework that leverages standards, agile architecture, development isolation, and resource optimization to improve efficiency and reduce operational expenses.

Big DataData GovernanceData Management
0 likes · 7 min read
How JD Retail Tackles Data Governance Challenges to Boost Efficiency
DataFunTalk
DataFunTalk
Aug 28, 2025 · Big Data

How JD Retail Tackles Data Governance Challenges to Boost Efficiency

JD Retail faces growing data volume, redundant models, and resource‑intensive storage, prompting a comprehensive data‑governance strategy that defines standards, streamlines architecture, isolates development, and optimizes compute and storage costs, ultimately enabling more efficient, secure, and agile data operations across the enterprise.

Big DataData ArchitectureData Governance
0 likes · 8 min read
How JD Retail Tackles Data Governance Challenges to Boost Efficiency
DataFunTalk
DataFunTalk
Aug 27, 2025 · Big Data

How JD Retail Overcomes Data Governance Challenges to Boost Efficiency

JD Retail confronts growing data volume, redundant models, shared account risks, and rising storage costs, and responds with a comprehensive data governance framework that standardizes data, streamlines architecture, isolates development, and optimizes resources to achieve efficient, secure, and cost‑effective data operations.

Big DataData ArchitectureData Governance
0 likes · 8 min read
How JD Retail Overcomes Data Governance Challenges to Boost Efficiency
Big Data Tech Team
Big Data Tech Team
Aug 25, 2025 · Interview Experience

Essential Big Data Interview Questions for Data Warehouse Engineer Roles

A comprehensive list of interview topics covering self‑introduction, career moves, data‑warehouse design, team building, architecture comparisons, fact‑table classification, common dimensions, performance tuning, and data‑governance for aspiring big‑data engineers.

Big DataData GovernanceFlink
0 likes · 4 min read
Essential Big Data Interview Questions for Data Warehouse Engineer Roles
Data Party THU
Data Party THU
Aug 1, 2025 · Industry Insights

How Data Elements Drive Continuous Growth in Manufacturing: Challenges and Solutions

This report analyzes how treating data as a production factor reshapes manufacturing, outlines three major challenges—scenario explosion, business‑application enrichment, and intelligent‑application expansion—and shares concrete governance, platform, and AI‑model practices that enable agile, data‑driven digital transformation.

AIData AssetsData Governance
0 likes · 17 min read
How Data Elements Drive Continuous Growth in Manufacturing: Challenges and Solutions
Bilibili Tech
Bilibili Tech
Jul 25, 2025 · Big Data

How Unified Metadata Lineage Transforms Big Data Governance and Security

This article introduces the comprehensive design and evolution of a unified metadata lineage platform for big data, covering background, data processing chain, lineage models, system architecture, quality metrics, application scenarios, and future plans to enhance data governance, quality, and security.

Big DataData GovernanceData Quality
0 likes · 27 min read
How Unified Metadata Lineage Transforms Big Data Governance and Security
DataFunTalk
DataFunTalk
Jul 9, 2025 · Big Data

How Lakehouse Is Transforming Real‑Time Multi‑Dimensional Analytics

This article compiles a series of expert case studies and insights on real‑time intelligent fully‑managed Lakehouse technology, illustrating how companies such as SalesEasy, Chang’an Auto, Kuaishou, Tencent, and JD.com leverage lakehouse architectures to achieve advanced multi‑dimensional analytics, cost‑performance balance, and effective data governance in the digital economy.

Case StudiesData ArchitectureData Governance
0 likes · 2 min read
How Lakehouse Is Transforming Real‑Time Multi‑Dimensional Analytics
DataFunTalk
DataFunTalk
Jul 8, 2025 · Big Data

Explore Cutting-Edge Lakehouse Solutions: Real-Time Analytics & Data Governance

This guide presents a curated collection of case studies and insights on cloud-native Lakehouse architectures, real‑time analytics, data‑driven user experiences, and data governance, showcasing implementations from companies like SalesEasy, Changan Auto, TikTok, Tencent, JD.com, and more.

Case StudiesData GovernanceLakehouse
0 likes · 2 min read
Explore Cutting-Edge Lakehouse Solutions: Real-Time Analytics & Data Governance
DataFunTalk
DataFunTalk
Jul 7, 2025 · Big Data

Unlock Real-Time Analytics with Cloud Lakehouse: A Complete Guide

This article presents a curated list of sessions covering cloud Lakehouse technology for real-time, multidimensional data analysis, including case studies from SalesEasy, Changan Auto, Tencent, and JD, as well as discussions on data lake adoption, streaming lake Paimon, and the relevance of metadata‑driven data governance in the digital economy.

Big DataCase StudyData Governance
0 likes · 2 min read
Unlock Real-Time Analytics with Cloud Lakehouse: A Complete Guide
DataFunTalk
DataFunTalk
Jul 6, 2025 · Big Data

How Cloud Lakehouse Is Redefining Real-Time Multi-Dimensional Data Analytics

This article presents a curated list of case studies and insights on cloud Lakehouse technology, covering real-time intelligent analytics, data architecture simplification, IoT big‑data platforms, integrated data platforms, and the evolving role of metadata‑driven data governance in the digital economy.

Big DataCase StudiesData Governance
0 likes · 2 min read
How Cloud Lakehouse Is Redefining Real-Time Multi-Dimensional Data Analytics
Instant Consumer Technology Team
Instant Consumer Technology Team
Jul 3, 2025 · Artificial Intelligence

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

Enterprises rushing to purchase DeepSeek AI appliances and smart‑agent platforms often face hidden technical, data, and organizational challenges that turn promised "plug‑and‑play" solutions into costly missteps, highlighting the need for realistic strategy, robust data governance, and continuous capability building.

AI capability buildingAI deploymentData Governance
0 likes · 28 min read
Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 13, 2025 · Artificial Intelligence

Designing AI-Ready Data Architecture: Key Features and Future Trends

AI-era data architecture must handle massive, multimodal datasets with real-time processing, prioritize data quality over quantity, support scalability, provenance, and native ML/AI integration, while addressing governance, security, and ethical challenges through emerging technologies like data fabric, mesh, and federated learning.

AIBig DataData Architecture
0 likes · 6 min read
Designing AI-Ready Data Architecture: Key Features and Future Trends
DataFunSummit
DataFunSummit
Jun 6, 2025 · Big Data

How Unicom Digital’s Integrated Data Platform Revolutionizes Metadata Management

This article details Unicom Digital’s metadata management practice on its integrated data platform, covering the strategic background of data, key challenges, award-winning capabilities, three-pronged solutions—automation, linking+, and AI—along with practical implementations, full‑chain lineage, data responsibility, lifecycle management, and future AI‑driven enhancements.

AIAutomationBig Data
0 likes · 18 min read
How Unicom Digital’s Integrated Data Platform Revolutionizes Metadata Management
Continuous Delivery 2.0
Continuous Delivery 2.0
May 30, 2025 · Artificial Intelligence

Data Quality and Diversity: The Critical Battlefield Beyond AI Models

The article explains why high‑quality, diverse data—rather than just advanced models—has become the decisive factor for enterprise AI success, outlining key dimensions of data quality, strategies for building diverse datasets, and practical steps for establishing a data‑first AI strategy.

AIData GovernanceData Quality
0 likes · 12 min read
Data Quality and Diversity: The Critical Battlefield Beyond AI Models
Big Data Tech Team
Big Data Tech Team
May 19, 2025 · Fundamentals

Why Unified Data Modeling Matters: From Concepts to Physical Design

The article explains why establishing a unified data model is essential, differentiates data modeling from data models, outlines three modeling stages, compares normative, dimensional, and entity modeling methods, and provides practical steps and diagrams to help organizations build robust, business‑driven data architectures.

Data GovernanceDatabase designdimensional modeling
0 likes · 12 min read
Why Unified Data Modeling Matters: From Concepts to Physical Design
Big Data Technology & Architecture
Big Data Technology & Architecture
May 16, 2025 · Big Data

Apache Gravitino: An Open‑Source Metadata Lake for Unified Data and AI Asset Management

Apache Gravitino is an open‑source metadata service platform that provides a unified, high‑performance, geographically distributed metadata lake, enabling end‑to‑end data governance, multi‑engine access, and direct management of both structured and unstructured data assets across diverse systems.

Apache GravitinoData GovernanceData Lake
0 likes · 9 min read
Apache Gravitino: An Open‑Source Metadata Lake for Unified Data and AI Asset Management
Bilibili Tech
Bilibili Tech
May 13, 2025 · Big Data

Live Streaming Ecosystem Governance Architecture and Data Mining Engine Design

The article outlines a comprehensive live‑streaming ecosystem governance framework that combines data‑mining engines, tagging platforms, rule‑based disposal mechanisms, and multi‑stage user touchpoints to improve content quality, compliance, and platform sustainability.

Data GovernanceTagging Systemcontent moderation
0 likes · 14 min read
Live Streaming Ecosystem Governance Architecture and Data Mining Engine Design
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
May 9, 2025 · Big Data

Mastering Multi‑AZ Replication in HDFS with AZ Mover

This article introduces AZ Mover, a lightweight HDFS client‑side tool that intelligently scans, schedules, and migrates block replicas across multiple availability zones, detailing its design goals, core workflow, command‑line options, concurrency controls, and future enhancements for robust big‑data disaster recovery.

AZ MoverData GovernanceHDFS
0 likes · 9 min read
Mastering Multi‑AZ Replication in HDFS with AZ Mover
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 29, 2025 · Big Data

Big Data Interview Preparation: Data Governance, Iceberg Metadata, Lakehouse Best Practices, and Xiaohongshu HR Updates

The article reports Xiaohongshu’s cancellation of the big‑small week schedule and non‑compete clause, then provides a collection of big‑data interview questions—including data governance, Iceberg metadata management, and lakehouse production best practices—along with concise answers and resources for candidates.

Data GovernanceIcebergLakehouse
0 likes · 7 min read
Big Data Interview Preparation: Data Governance, Iceberg Metadata, Lakehouse Best Practices, and Xiaohongshu HR Updates
Big Data Tech Team
Big Data Tech Team
Apr 28, 2025 · Big Data

Mastering Metadata, Master Data, and Data Governance: A Complete Guide

This article explains the core concepts of metadata, master data, data resources, data governance, and data management, outlines their roles, compares governance with management, and provides practical steps and best‑practice recommendations for building a robust enterprise data framework.

Big DataData GovernanceMaster Data
0 likes · 15 min read
Mastering Metadata, Master Data, and Data Governance: A Complete Guide
DataFunSummit
DataFunSummit
Apr 7, 2025 · Artificial Intelligence

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

This article examines how Retrieval‑Augmented Generation (RAG) and multi‑agent technologies narrow the gap between large language models and practical deployment, highlighting their roles in operations automation, financial risk control, intelligent data governance, database localization, edge inference, and future AI‑driven solutions.

Data GovernanceOperations AutomationRAG
0 likes · 8 min read
Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents
Data Thinking Notes
Data Thinking Notes
Mar 19, 2025 · Big Data

How to Maximize Data Asset Value: From DataOps to Monetization

This report outlines a comprehensive framework for turning raw data into valuable assets, introducing DataOps and panoramic data architecture, and detailing practical methods for data value assessment, asset circulation, and operational mechanisms to help enterprises build a solid value baseline and expand data asset applications.

Big DataData Asset ManagementData Governance
0 likes · 4 min read
How to Maximize Data Asset Value: From DataOps to Monetization
DataFunSummit
DataFunSummit
Jan 17, 2025 · Databases

Graph Database Applications and Architectures in DataFun Knowledge Map 3.0

The DataFun Knowledge Map 3.0’s graph database module, presented by Ant Group expert Cui Anqi, outlines how graph databases enhance complex analysis through risk‑control architectures, user‑relationship recommendation, data‑governance, a new graph‑based data management system, and the GraphRAG framework, while also offering a free download link.

AIData Governancegraph database
0 likes · 3 min read
Graph Database Applications and Architectures in DataFun Knowledge Map 3.0
DataFunSummit
DataFunSummit
Jan 1, 2025 · Big Data

Douyin Group Data Asset Management Platform: Full‑Stack Data Lineage Evolution and Applications

This article introduces Douyin Group’s end‑to‑end data asset management platform, explains the evolution and architecture of its large‑scale data lineage system, presents quality metrics and ecosystem components, and outlines practical applications and future directions for data governance, development, and security.

Data Asset PlatformData GovernanceData Lineage
0 likes · 16 min read
Douyin Group Data Asset Management Platform: Full‑Stack Data Lineage Evolution and Applications
Data Thinking Notes
Data Thinking Notes
Dec 24, 2024 · Big Data

Unlock Business Growth with the Three‑Element and Four‑Movement Data Asset Framework

This article explains why data is a new production factor, introduces the “three elements” (organization & awareness, processes & standards, platforms & tools) and the “four‑movement” (inventory, assessment, governance, sharing) framework for data asset operation, and shows how it drives digital transformation, efficiency and innovative business models.

Big DataData AssetData Governance
0 likes · 4 min read
Unlock Business Growth with the Three‑Element and Four‑Movement Data Asset Framework
JD Retail Technology
JD Retail Technology
Dec 24, 2024 · Industry Insights

How JD Retail Automates AB Experiment Data Pipelines with Data Weaving

This article analyzes JD Retail's approach to automating AB experiment workflows by introducing a data‑weaving framework that unifies metric definitions, streamlines logical data modeling, and enables scalable, real‑time DAG orchestration across multiple experiment scenarios.

AB testingAutomationData Governance
0 likes · 21 min read
How JD Retail Automates AB Experiment Data Pipelines with Data Weaving
DataFunSummit
DataFunSummit
Dec 21, 2024 · Big Data

Big Data Implementation Practices and Architecture in a Foreign Bank

This article shares the foreign bank's big data implementation journey, covering background and goals, overall planning and architecture, practical insights, phased rollout, data governance, security, and Q&A, illustrating how a unified data platform, storage‑compute separation, and AI‑driven tools drive business innovation.

AIBankingData Architecture
0 likes · 19 min read
Big Data Implementation Practices and Architecture in a Foreign Bank
Data Thinking Notes
Data Thinking Notes
Dec 19, 2024 · Information Security

Unveiling 41 Official Data Terms: What They Mean for China’s Data Infrastructure

This article compiles the official definitions released by China’s National Data Bureau and other agencies for 41 data‑related terms, explains the concepts of data infrastructure, privacy‑preserving computing, trusted data spaces, and blockchain, and outlines how these definitions guide the nation’s data‑driven development strategy.

BlockchainData GovernancePrivacy Computing
0 likes · 25 min read
Unveiling 41 Official Data Terms: What They Mean for China’s Data Infrastructure
JD Retail Technology
JD Retail Technology
Dec 19, 2024 · Big Data

JD.com Data Governance: Architecture, Key Technologies, and Future Directions

JD.com’s data‑governance framework combines a health‑score‑driven, automated platform that cross‑verifies audit logs, builds full‑link and operator‑level lineage, introduces standard fields, and optimizes resource mixing, task staggering, and cross‑datacenter scheduling, while targeting real‑time AI‑enhanced detection and full automation.

Data GovernanceData LineageJD.com
0 likes · 15 min read
JD.com Data Governance: Architecture, Key Technologies, and Future Directions
DataFunSummit
DataFunSummit
Dec 16, 2024 · Big Data

Empowering Manufacturing Digital Transformation with Data: Architecture, Challenges, and Solutions

This article explains how data can empower the digital transformation of traditional manufacturing, covering background policies, challenges in building industrial data indicator systems, overall architecture design, technical and business considerations, and practical solutions such as the 4+4 principle, KPI loops, and case studies.

AIData GovernanceDigital Transformation
0 likes · 16 min read
Empowering Manufacturing Digital Transformation with Data: Architecture, Challenges, and Solutions
Data Thinking Notes
Data Thinking Notes
Dec 10, 2024 · Big Data

Why Data Asset Inclusion in Financial Statements Is the Next Competitive Edge for Enterprises

The article explains how recent policies make data asset inclusion in financial statements essential, outlines the concepts of data resources, assets and factors, describes the governance, assessment and lifecycle processes, and shows how this practice can boost financing, valuation and digital transformation for companies, economies and nations.

Data AssetData GovernanceData Management
0 likes · 30 min read
Why Data Asset Inclusion in Financial Statements Is the Next Competitive Edge for Enterprises
DataFunSummit
DataFunSummit
Dec 10, 2024 · Big Data

JD.com’s Big Data Governance: Practices, Key Technologies, and Future Outlook

This article presents JD.com’s comprehensive big‑data governance experience, detailing the background and challenges, the automated governance platform and its core technologies such as audit logs and full‑link lineage, strategies for resource optimization, and the roadmap toward real‑time, intelligent, and fully automated data governance.

AutomationData GovernanceData Lineage
0 likes · 14 min read
JD.com’s Big Data Governance: Practices, Key Technologies, and Future Outlook