Tagged articles
370 articles
Page 1 of 4
DataFunTalk
DataFunTalk
May 19, 2026 · Industry Insights

From Single‑Point Copilot to Platform‑Level Agentic: Real Challenges and Future Forks for Data Platforms

A live discussion dissected the shift from single‑point Copilot assistants to platform‑level Agentic data platforms, exposing hard architectural, security, knowledge‑base, evaluation, stability‑cost, and governance challenges while debating whether the future will favor a super‑agent or a multi‑agent ecosystem.

Agentic AIBig DataData Platform
0 likes · 18 min read
From Single‑Point Copilot to Platform‑Level Agentic: Real Challenges and Future Forks for Data Platforms
DataFunSummit
DataFunSummit
May 18, 2026 · Artificial Intelligence

From Single‑Point Copilot to Platform‑Level Agentic: Real Challenges and Future Paths for Data Platforms

A 90‑minute live discussion examined how data platforms must evolve from simple Copilot assistants to fully agentic systems, covering architectural redesign, security guardrails, knowledge‑base integration, evaluation pitfalls, cost management, and whether the future favors a super‑agent or a multi‑agent ecosystem.

Agentic AICost ManagementData Platform
0 likes · 20 min read
From Single‑Point Copilot to Platform‑Level Agentic: Real Challenges and Future Paths for Data Platforms
Digital Planet
Digital Planet
May 18, 2026 · Industry Insights

Why Xiangpiaopiao Lost 168 Million Cups and Can’t Circle the Earth: The Invisible Digital War

The article analyzes Xiangpiaopiao’s 2025 revenue drop and 2026 quarterly rebound, exposing how its traditional channel‑centric model left it blind to consumer behavior, while digital‑native rivals like Hey Tea and Yuanqi Forest leverage data‑driven operations, prompting a call for bC integration, one‑code tracing, and a data‑platform to achieve true digital transformation.

Data PlatformDigital TransformationFMCG
0 likes · 15 min read
Why Xiangpiaopiao Lost 168 Million Cups and Can’t Circle the Earth: The Invisible Digital War
DataFunSummit
DataFunSummit
May 17, 2026 · Industry Insights

From Single‑point Copilot to Platform‑level Agentic: Real Challenges and Future Paths for Data Platforms

A 90‑minute live discussion with data experts from vivo and YangQianGuan reveals that moving from a simple Copilot assistant to a platform‑level Agentic data system requires fundamental architectural changes, new infrastructure for memory, planning, tool orchestration, security guardrails, knowledge management, robust evaluation, and a clear ROI strategy.

AI GovernanceAgenticBig Data
0 likes · 19 min read
From Single‑point Copilot to Platform‑level Agentic: Real Challenges and Future Paths for Data Platforms
DataFunSummit
DataFunSummit
May 9, 2026 · Industry Insights

Why Palantir’s Ontology Beats Traditional Data Middle Platforms in Decision Making

The article examines costly failures of conventional data middle platforms—such as a $40 million payroll system flop and a chemical firm’s data‑cleaning bottleneck—then shows how Palantir’s ontology‑driven approach delivers triple‑digit ROI for BP, 98% R&D efficiency for Novartis, and $14 million annual savings for General Mills, highlighting the three‑layer semantic, dynamics, and decision architecture that turns data into actionable decisions.

Business IntelligenceData PlatformDecision Systems
0 likes · 5 min read
Why Palantir’s Ontology Beats Traditional Data Middle Platforms in Decision Making
DataFunSummit
DataFunSummit
Apr 24, 2026 · Artificial Intelligence

AI‑Driven Data Governance as a Service: Tencent Games' Paradigm Shift

This talk details how Tencent Games leverages AI to transform its data governance from rule‑based, passive processes into a semantic, service‑oriented paradigm, addressing resource waste, low collaboration efficiency, and scalability challenges while delivering measurable improvements in cost, speed, and asset quality.

AIAutomationBig Data
0 likes · 19 min read
AI‑Driven Data Governance as a Service: Tencent Games' Paradigm Shift
DataFunSummit
DataFunSummit
Apr 15, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI

The article analyzes costly data‑platform failures—such as a $40 million payroll system in San Francisco schools and a collapsed Healthcare.gov launch—identifies the root cause as ineffective data middle platforms, and demonstrates how Palantir’s ontology‑based three‑layer architecture (semantic, dynamics, decision) can turn data into actionable insights, delivering triple‑digit ROI for enterprises like BP, Novartis, and General Mills.

Big DataData PlatformOntology
0 likes · 5 min read
Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI
DataFunSummit
DataFunSummit
Mar 27, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Delivers Triple‑Digit ROI

The article examines costly data platform failures—such as a $40 million payroll system collapse and a healthcare.gov outage—highlighting why traditional data middle platforms become data swamps, then explains how Palantir’s ontology approach, with its three‑layer semantic, dynamics, and decision architecture, can turn data into actionable insights and achieve triple‑digit ROI.

Data ArchitectureData PlatformOntology
0 likes · 4 min read
Why Traditional Data Platforms Fail and How Ontology Delivers Triple‑Digit ROI
DataFunSummit
DataFunSummit
Mar 26, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI

The article analyzes costly data‑platform failures—such as a $40 million school‑district payroll system and a collapsed Healthcare.gov launch—identifies the root cause as ineffective data middle platforms, and explains how Palantir’s ontology‑based three‑layer architecture (semantic, dynamics, decision) transforms raw data into automated business actions, delivering measurable ROI across multiple industries.

Data ArchitectureData PlatformDecision automation
0 likes · 5 min read
Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI
DataFunSummit
DataFunSummit
Mar 23, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI

The article analyzes costly data‑platform failures in large enterprises, contrasts traditional data middle‑platforms with Palantir’s ontology‑based approach, and explains a three‑layer architecture that turns raw data into automated business decisions, illustrated with real‑world case outcomes.

Data ManagementData PlatformDigital Twin
0 likes · 5 min read
Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI
Baidu Geek Talk
Baidu Geek Talk
Mar 23, 2026 · Databases

How Baidu’s MEG Platform Revamped ClickHouse with a Lakehouse Architecture

This article analyzes the challenges of scaling ClickHouse within Baidu’s MEG data platform and details a lake‑house solution that decouples storage and compute, integrates a meta‑service for transparent data access, optimizes query performance through caching, data roll‑up and layout tuning, and introduces a unified query gateway that gracefully falls back to Spark for complex workloads.

ClickHouseData PlatformLakehouse
0 likes · 25 min read
How Baidu’s MEG Platform Revamped ClickHouse with a Lakehouse Architecture
dbaplus Community
dbaplus Community
Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG
0 likes · 17 min read
How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms
Alibaba Cloud Observability
Alibaba Cloud Observability
Oct 27, 2025 · Operations

From Data Silos to Intelligent Insights: Building Future‑Ready Operation Intelligence

This article explains how enterprises can transform massive, fragmented operation data—technical, business, and security—into high‑value intelligent signals by unifying storage, enriching context, applying AI, and delivering a single, observable platform that enables proactive, data‑driven decision making.

AIData PlatformObservability
0 likes · 18 min read
From Data Silos to Intelligent Insights: Building Future‑Ready Operation Intelligence
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 24, 2025 · Big Data

How Leapmotor Scaled to 1M Cars with a Real‑Time Flink Data Platform

Leapmotor’s rapid growth to one million production cars drove a shift from daily batch data to minute‑level real‑time analytics, prompting the adoption of Flink as the core engine of a multi‑layered big‑data platform that handles massive IoT signals, supports fault diagnosis, and integrates batch and streaming workloads on the cloud.

Big DataData PlatformFlink
0 likes · 13 min read
How Leapmotor Scaled to 1M Cars with a Real‑Time Flink Data Platform
Big Data Tech Team
Big Data Tech Team
Oct 23, 2025 · Industry Insights

How to Build a Reusable, Well‑Designed Data Warehouse Model

This article analyzes why analysts and data engineers clash over non‑reusable data models, presents metrics such as cross‑layer reference rate and model reuse coefficient, and outlines a step‑by‑step framework—including ODS takeover, subject‑domain mapping, dimension consistency, fact‑table integration, development best practices, and tool support—to transform siloed warehouses into a shared data‑platform.

Big DataData GovernanceData Platform
0 likes · 15 min read
How to Build a Reusable, Well‑Designed Data Warehouse Model
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 22, 2025 · Big Data

Li Auto’s Trillion‑Row Real‑Time Car‑Network Analytics Using Hologres + Flink

Li Auto’s data team tackled the explosion of vehicle‑telemetry data—over a trillion rows and millions of signals per second—by redesigning their data foundation with Alibaba Cloud’s Hologres and Flink, achieving sub‑second latency, elastic scaling, high availability, and significant cost reductions across real‑time and offline workloads.

Car TelemetryData PlatformFlink
0 likes · 16 min read
Li Auto’s Trillion‑Row Real‑Time Car‑Network Analytics Using Hologres + Flink
DataFunTalk
DataFunTalk
Oct 19, 2025 · Big Data

How Zhihu’s Big Data Strategy Cuts Costs and Boosts Efficiency

This article outlines Zhihu’s big‑data cost‑reduction journey, covering its background, the FinOps‑driven financial management system, technical strategies for lowering expenses, and a forward‑looking summary of challenges and sustainable efficiency gains within the organization and industry context.

Big DataData PlatformFinOps
0 likes · 4 min read
How Zhihu’s Big Data Strategy Cuts Costs and Boosts Efficiency
DataFunSummit
DataFunSummit
Oct 18, 2025 · Big Data

How Zhihu’s Big Data FinOps Cuts Costs and Boosts Efficiency

This article outlines Zhihu’s practical use of big‑data FinOps, describing its hybrid‑cloud architecture, the challenges of multi‑vendor cost management, and how a systematic billing system launched in 2022 drives sustainable cost reduction across the organization.

Big DataCost reductionData Platform
0 likes · 4 min read
How Zhihu’s Big Data FinOps Cuts Costs and Boosts Efficiency
Baidu Geek Talk
Baidu Geek Talk
Oct 15, 2025 · Artificial Intelligence

Can LLMs Automate Data Ingestion and Cut Integration Time from Months to Days?

This article presents an LLM‑driven intelligent data platform ingestion solution that automates schema recognition, mapping, quality rule extraction, and package building, reducing integration cycles from three months to three days while eliminating manual effort and enhancing scalability and control.

AIAutomationCode Generation
0 likes · 21 min read
Can LLMs Automate Data Ingestion and Cut Integration Time from Months to Days?
DataFunSummit
DataFunSummit
Oct 10, 2025 · Artificial Intelligence

How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform

This article details Ping An Life's self‑developed large‑model reporting product ChatBI, covering its background, goals, solution architecture, technical stack, real‑world use cases, deployment challenges, and future outlook, offering practical insights for enterprises adopting AI‑driven business intelligence.

AIBusiness IntelligenceChatbot
0 likes · 17 min read
How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform
Bilibili Tech
Bilibili Tech
Sep 26, 2025 · Artificial Intelligence

How RAG Transforms Natural Language Queries into Accurate SQL for Business Users

This article explains how Retrieval‑Augmented Generation (RAG) combines large language models with vector databases to let non‑technical staff query massive membership data using plain language, detailing the workflow, technical architecture, optimization challenges, and real‑world impact on data‑driven decision making.

AIData PlatformLLM
0 likes · 17 min read
How RAG Transforms Natural Language Queries into Accurate SQL for Business Users
DataFunTalk
DataFunTalk
Sep 25, 2025 · Big Data

How Tencent Cloud’s AI‑Ready Data Platform Redefines Big Data for AI

This article outlines the challenges of high‑quality data for AI, introduces Tencent Cloud’s AI‑Ready data platform with three core capabilities—DIaaS, Setats, and ES‑based knowledge search—covers the end‑to‑end WeData integration, intelligent agents for automation, and showcases ecosystem partnerships driving industry‑wide intelligent transformation.

AIBig DataData Platform
0 likes · 14 min read
How Tencent Cloud’s AI‑Ready Data Platform Redefines Big Data for AI
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Sep 22, 2025 · Big Data

How Dataverse’s Notebook Supercharges Data+AI Development at Xiaohongshu

The article details Xiaohongshu’s Dataverse platform evolution into a Data+AI system, highlighting inefficiencies in algorithm and data‑science workflows, the introduction of an interactive notebook, comprehensive data lineage, AI‑coding assistance, and future DataAgent plans to automate data engineering tasks.

AI CodingData LineageData Platform
0 likes · 21 min read
How Dataverse’s Notebook Supercharges Data+AI Development at Xiaohongshu
DataFunTalk
DataFunTalk
Sep 22, 2025 · Big Data

How Kuaishou Scales Intelligent BI: Insights from Its Data Platform

This article outlines Kuaishou's Data Platform team's mission to boost data‑driven decision making through advanced compute engines, high‑performance services, and AI‑enhanced BI, detailing its architecture, challenges, solutions, and future outlook for large‑scale intelligent analytics.

AIAnalyticsBI
0 likes · 6 min read
How Kuaishou Scales Intelligent BI: Insights from Its Data Platform
DataFunTalk
DataFunTalk
Sep 19, 2025 · Big Data

How Kuaishou’s Data Platform Powers Intelligent BI with AI and Big Data

This article outlines how Kuaishou’s Data Platform Department enhances decision‑making efficiency by building advanced compute engines and high‑performance services, detailing the platform’s architecture, challenges of intelligent BI, AI‑driven solutions, and the end‑to‑end BI workflow from data ingestion to analysis.

AnalyticsBIBig Data
0 likes · 5 min read
How Kuaishou’s Data Platform Powers Intelligent BI with AI and Big Data
DataFunTalk
DataFunTalk
Sep 17, 2025 · Artificial Intelligence

How Generative AI is Transforming Business Intelligence: Trends and Practices

This article, excerpted from Baidu’s Data Platform technical salon, explores how generative AI reshapes Business Intelligence by outlining three perspectives: the technical trend and business value, the design principles of Baidu’s ChatBI platform, and practical challenges and solutions encountered during its deployment.

AI trendsBusiness IntelligenceChatBI
0 likes · 5 min read
How Generative AI is Transforming Business Intelligence: Trends and Practices
StarRocks
StarRocks
Aug 6, 2025 · Databases

How Qunar Migrated to StarRocks: Architecture, Performance Gains & Best Practices

This article details Qunar's transition to StarRocks as a unified OLAP engine, covering the business background, engine evaluation, architecture redesign, observability, high‑availability strategies, query‑performance optimizations, real‑world application cases, community contributions, and future plans.

Data PlatformOLAPObservability
0 likes · 21 min read
How Qunar Migrated to StarRocks: Architecture, Performance Gains & Best Practices
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 5, 2025 · Big Data

How Alibaba Built a World‑Class Big Data Platform Over a Decade

Over ten years, Alibaba’s data engineers transformed a modest Hadoop‑based system into a globally‑scalable, high‑performance big data platform—ODPS/MaxCompute—supporting massive offline and real‑time workloads, pioneering innovations like the 5K cluster expansion, Blink streaming, and the unified ‘Moon’ migration.

AlibabaBig DataData Platform
0 likes · 25 min read
How Alibaba Built a World‑Class Big Data Platform Over a Decade
High Availability Architecture
High Availability Architecture
Jun 18, 2025 · Backend Development

How WeChat Reading Scaled Its Backend Architecture Over a Decade

Marking ten years of WeChat Reading, this article details the backend's evolution from a monolithic service to a multi‑layered, micro‑service architecture with robust storage, RPC frameworks, book data platforms, account system redesign, and AI‑driven content retrieval, highlighting the technical challenges and solutions behind its scalability.

AI RetrievalBackend ArchitectureData Platform
0 likes · 18 min read
How WeChat Reading Scaled Its Backend Architecture Over a Decade
Big Data Tech Team
Big Data Tech Team
May 26, 2025 · Industry Insights

How to Build a Unified, Scalable Data Metric System for Digital Transformation

This article explains why unified data metrics are critical for digital transformation, outlines the core value of a metric product, details its classification, lifecycle, automation, and service integration, and proposes a three‑layer implementation architecture while looking ahead to AI‑driven enhancements.

Data PlatformDigital Transformationdata metrics
0 likes · 5 min read
How to Build a Unified, Scalable Data Metric System for Digital Transformation
StarRocks
StarRocks
Mar 27, 2025 · Databases

How JD Logistics Boosted Query Speed and Cut Costs with StarRocks Storage‑Compute Separation

JD Logistics transformed its one‑stop self‑service analytics platform, UData, by migrating from an integrated storage‑compute architecture to a storage‑compute separated design powered by StarRocks, achieving sub‑10‑second P95/P99 query latency, reducing storage costs by 90%, and cutting compute expenses around 30% while supporting massive data volumes.

Cost reductionData PlatformKubernetes
0 likes · 20 min read
How JD Logistics Boosted Query Speed and Cut Costs with StarRocks Storage‑Compute Separation
Baidu Geek Talk
Baidu Geek Talk
Mar 24, 2025 · Big Data

How Turing Data Finder Transforms Growth Analysis with a Unified Data Platform

The article provides a detailed technical overview of the Turing Data Finder (TDF) platform, describing its background, core components, data schema, ingestion workflow, and a suite of growth‑analysis features such as event, retention, funnel, path, component, distribution, and attribution analysis, while also outlining performance‑optimisation techniques and future development directions.

Big DataData PlatformSQL Optimization
0 likes · 17 min read
How Turing Data Finder Transforms Growth Analysis with a Unified Data Platform
DataFunSummit
DataFunSummit
Jan 29, 2025 · Artificial Intelligence

Tencent OlaChat: An LLM‑Powered Intelligent Business Intelligence Platform – Architecture, Capabilities, and Practice

This article presents Tencent's OlaChat intelligent BI platform, detailing its evolution from traditional to intelligent BI, the impact of large language models on data analytics, the system's multi‑task dialogue, metadata retrieval enhancements, Text2SQL solutions, and real‑world deployment insights.

AIBusiness IntelligenceData Platform
0 likes · 21 min read
Tencent OlaChat: An LLM‑Powered Intelligent Business Intelligence Platform – Architecture, Capabilities, and Practice
JD Retail Technology
JD Retail Technology
Dec 24, 2024 · Industry Insights

How JD Retail Automates AB Experiment Data Pipelines with Data Weaving

This article analyzes JD Retail's approach to automating AB experiment workflows by introducing a data‑weaving framework that unifies metric definitions, streamlines logical data modeling, and enables scalable, real‑time DAG orchestration across multiple experiment scenarios.

AB testingAutomationData Governance
0 likes · 21 min read
How JD Retail Automates AB Experiment Data Pipelines with Data Weaving
DataFunSummit
DataFunSummit
Dec 20, 2024 · Big Data

Douyin Group's Data Management: Strategies for Metric Construction, Management, Production, and Consumption

This article outlines Douyin Group's approach to handling massive EB‑scale data, describing the challenges of metric quality and efficiency, the Volcano Engine data platform architecture, three‑layer solutions for metric production, management and consumption, and future plans for automation and governance.

AnalyticsBig DataData Platform
0 likes · 19 min read
Douyin Group's Data Management: Strategies for Metric Construction, Management, Production, and Consumption
DataFunSummit
DataFunSummit
Nov 25, 2024 · Big Data

Kuaishou Big Data Analytics Practices Driven by NoETL

This article presents Kuaishou's big‑data analytics system, describing its current capabilities, the pain points of traditional ETL workflows, the NoETL concept, the implementation of a metric‑center platform, and practical features such as custom fields, automated modeling and acceleration, followed by future plans and a Q&A session.

Automated ModelingBig DataCustom Fields
0 likes · 20 min read
Kuaishou Big Data Analytics Practices Driven by NoETL
Ctrip Technology
Ctrip Technology
Nov 21, 2024 · Big Data

Performance Governance and Optimization of Ctrip's Nova Data Reporting Platform

This article details the performance challenges of Ctrip's Nova data reporting platform and describes a series of governance measures—including multi‑dimensional data caching, materialized view acceleration, query strategy optimization, and SQL quality improvements—that collectively reduced average query latency by over 50% and stabilized the system.

Data PlatformSQL Performancecaching
0 likes · 26 min read
Performance Governance and Optimization of Ctrip's Nova Data Reporting Platform
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 7, 2024 · Big Data

Douyin Group's Data Management Strategies: Enhancing Metric Stability and Reusability

This article outlines Douyin Group's approach to handling petabyte‑scale data, addressing metric inconsistencies, and improving data product agility through a four‑layer Volcano Engine platform, systematic indicator production‑management‑consumption cycles, organizational design, automation, and future plans for large‑model‑driven metric splitting.

AnalyticsAutomationBig Data
0 likes · 20 min read
Douyin Group's Data Management Strategies: Enhancing Metric Stability and Reusability
ByteDance Data Platform
ByteDance Data Platform
Nov 6, 2024 · Big Data

How Douyin’s Data Platform Overcomes EB‑Scale Metric Challenges

This article explains how Douyin Group tackles massive data volume, quality, and efficiency issues by building a four‑layer intelligent platform, standardizing metric management, automating metric decomposition, and creating reusable metric services that boost agility, stability, and cross‑team collaboration.

Big DataData PlatformData Quality
0 likes · 20 min read
How Douyin’s Data Platform Overcomes EB‑Scale Metric Challenges
Data Thinking Notes
Data Thinking Notes
Nov 5, 2024 · Big Data

How a Next‑Gen Data Management Platform Boosts Efficiency and Innovation

This article outlines the motivations, objectives, and architectural design of a next‑generation data management platform, detailing its four‑layer “four‑ization” approach, core services such as data integration, modeling, API provisioning, componentization, as well as governance, security, and operational best practices.

Big DataData GovernanceData Integration
0 likes · 20 min read
How a Next‑Gen Data Management Platform Boosts Efficiency and Innovation
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 31, 2024 · Big Data

How EMR Serverless Spark Powers the Next‑Gen Lakehouse Era

This article traces the evolution of data platforms, explains the rise of lakehouse architecture, and details how Alibaba Cloud's EMR Serverless Spark delivers one‑stop development, high performance, and full ecosystem compatibility, illustrated with real‑world case studies from Midea and Eagle Network.

AIBig DataData Platform
0 likes · 16 min read
How EMR Serverless Spark Powers the Next‑Gen Lakehouse Era
DataFunSummit
DataFunSummit
Oct 26, 2024 · Big Data

Kuaishou Metric Middle Platform: Design, Architecture, and Practices

This article presents Kuaishou's metric middle platform, detailing its background, design principles, architecture, metric management, data modeling, unified analysis language OAX, federated query engine OCTO, acceleration strategies, and future directions, illustrating how it improves data quality, development efficiency, and analytical capabilities at scale.

AnalyticsBig DataData Platform
0 likes · 64 min read
Kuaishou Metric Middle Platform: Design, Architecture, and Practices
DataFunSummit
DataFunSummit
Oct 13, 2024 · Big Data

Enterprise Digital Intelligence Capability Maturity Model (EDMM): Definitions, Framework, and Future Roadmap

This article presents the China Information and Communications Research Institute’s research on the Enterprise Digital Intelligence Capability Maturity Model (EDMM), detailing the concepts of data, intelligent, and knowledge middle platforms, the model’s four‑layer framework, its development stages, value propositions, long‑term mechanisms, and upcoming work plans.

Big DataData PlatformEnterprise Maturity Model
0 likes · 24 min read
Enterprise Digital Intelligence Capability Maturity Model (EDMM): Definitions, Framework, and Future Roadmap
Ctrip Technology
Ctrip Technology
Oct 11, 2024 · Big Data

Design and Implementation of Ctrip International Ticketing Data Middle Platform

This article details Ctrip's data middle‑platform solution for international ticketing, covering background challenges, design principles, key technical practices such as version control, P2P distribution, data timeliness, robustness, consumption‑process optimization, overall architecture, achieved benefits, and future plans.

Data ConsistencyData PlatformVersion Control
0 likes · 16 min read
Design and Implementation of Ctrip International Ticketing Data Middle Platform
DataFunSummit
DataFunSummit
Sep 25, 2024 · Big Data

Evolution of Big Data AI Development Paradigm and Alibaba Cloud’s Integrated Architecture

This article examines how large‑scale big‑data platforms can simplify AI application development, outlines the shift from model‑centric to data‑centric paradigms, and shares Alibaba Cloud’s practical experiences in building an integrated big‑data‑AI architecture, including MaxCompute, Hologres, MaxFrame, and vector search capabilities.

AI integrationBig DataData Platform
0 likes · 19 min read
Evolution of Big Data AI Development Paradigm and Alibaba Cloud’s Integrated Architecture
Data Thinking Notes
Data Thinking Notes
Sep 9, 2024 · Fundamentals

Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform

This guide outlines a practical six‑step methodology—covering overall planning, data integration, model construction, data development, asset management, and data services—to help enterprises build a robust data middle platform that unlocks business value and supports agile digital transformation.

Data GovernanceData IntegrationData Platform
0 likes · 10 min read
Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform
Baidu Geek Talk
Baidu Geek Talk
Sep 9, 2024 · Big Data

TDS Platform Overview: Architecture, Modules, and Features of Baidu MEG's Turing 3.0 Data Ecosystem

The TDS platform, central to Baidu MEG’s Turing 3.0 ecosystem, unifies data development, warehouse management, monitoring, and resource control through Spark‑based TDE, a visual studio, and AI‑enhanced tools like Smart Diagnosis and Text2SQL, enabling standardized workflows, scalable scheduling, and handling over 30 k daily tasks.

AIBig DataData Development
0 likes · 21 min read
TDS Platform Overview: Architecture, Modules, and Features of Baidu MEG's Turing 3.0 Data Ecosystem
DataFunSummit
DataFunSummit
Sep 8, 2024 · Big Data

Building and Optimizing a Cross‑Border E‑Commerce Data Platform: Architecture, Challenges, and Protonbase‑Based Solutions

This article presents Xide International's cross‑border e‑commerce data platform, detailing its multi‑layer business architecture, the scalability and data‑access problems encountered, and how a Protonbase‑driven data‑warehouse and micro‑service redesign dramatically improved query speed, operational efficiency, and cost.

Big DataData PlatformData Warehouse
0 likes · 11 min read
Building and Optimizing a Cross‑Border E‑Commerce Data Platform: Architecture, Challenges, and Protonbase‑Based Solutions
Baidu Geek Talk
Baidu Geek Talk
Sep 2, 2024 · Industry Insights

How a R&D Data Platform Leverages Large Language Models to Accelerate Issue Diagnosis

The article explains how the R&D data middle platform integrates large language models to automate data collection, real‑time monitoring, intelligent analysis, and rapid root‑cause identification for online issues, detailing the architecture, wide‑table modeling, generative BI, attribution algorithms, RAG enhancements, and future optimization plans.

Data PlatformRetrieval Augmented Generationgenerative BI
0 likes · 37 min read
How a R&D Data Platform Leverages Large Language Models to Accelerate Issue Diagnosis
Data Thinking Notes
Data Thinking Notes
Aug 29, 2024 · Big Data

How ICBC Evolved Its Data Intelligence Architecture for Real‑Time Insights

At the 2024 Data Intelligence Conference, ICBC's Big Data and AI Lab detailed the evolution of its data intelligence platform, covering architectural redesign, real‑time data warehouse technology, unified intelligent data tools, and future development directions to boost efficiency and innovation.

Big DataData Platformarchitecture evolution
0 likes · 3 min read
How ICBC Evolved Its Data Intelligence Architecture for Real‑Time Insights
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 14, 2024 · Big Data

Understanding Data Middle Platform: Value, Architecture, and Real‑World Cases

This article explains the concept, value, three‑layer architecture, and practical implementations of a data middle platform, illustrating how it standardizes data, forms a middle‑office organization, and drives cost‑effective business empowerment through examples from Alibaba, NetEase, and other enterprises.

Big DataCase StudyData Governance
0 likes · 9 min read
Understanding Data Middle Platform: Value, Architecture, and Real‑World Cases
DataFunSummit
DataFunSummit
Aug 14, 2024 · Big Data

Solving Typical Issues in Migrating to Spark 3.1: Multiple Catalog, Hive‑SQL to Spark‑SQL Migration, and Performance & Stability Optimizations at Xiaomi

This article shares Xiaomi's experience building a next‑generation one‑stop data development platform on Spark 3.1, covering typical challenges such as Multiple Catalog implementation, Hive‑SQL to Spark‑SQL migration, offline Spark performance and stability optimizations, and future roadmap plans.

Apache SparkBig DataData Platform
0 likes · 18 min read
Solving Typical Issues in Migrating to Spark 3.1: Multiple Catalog, Hive‑SQL to Spark‑SQL Migration, and Performance & Stability Optimizations at Xiaomi
DataFunSummit
DataFunSummit
Aug 9, 2024 · Big Data

Design and Practice of Ant Group's Metric System

This article presents a comprehensive overview of Ant Group's metric system, covering its definition, three-layer architecture, common challenges, concept consensus methods, semantic layer options, mechanism design, productization capabilities, platform improvements, business outcomes, future directions, and a detailed Q&A session.

Big DataData Platformdata modeling
0 likes · 28 min read
Design and Practice of Ant Group's Metric System
DataFunSummit
DataFunSummit
Aug 6, 2024 · Big Data

Implementing a Multi‑Tenant Lakehouse Data Platform for Real‑Time Analytics at a SaaS CRM Company

This article details how a SaaS CRM provider built a cloud‑native Lakehouse platform to support multi‑tenant real‑time analytics, describing data challenges, metadata‑driven architecture, virtual database design, query optimization, BI integration, AI readiness, migration steps, and the resulting performance and scalability gains.

Big DataData PlatformLakehouse
0 likes · 19 min read
Implementing a Multi‑Tenant Lakehouse Data Platform for Real‑Time Analytics at a SaaS CRM Company
Data Thinking Notes
Data Thinking Notes
Jul 29, 2024 · Big Data

What Is a Data Middle Platform and How Does It Transform Enterprise Data Management?

This article explains the concept, design principles, and core components of a data middle platform, detailing its overall, functional, layered, logical, and data architectures, as well as the specific platforms for data collection, processing, organization, governance, quality, sharing, and visualization, illustrated with diagrams.

Big DataData ArchitectureData Governance
0 likes · 27 min read
What Is a Data Middle Platform and How Does It Transform Enterprise Data Management?
DataFunTalk
DataFunTalk
Jul 16, 2024 · Big Data

E‑commerce Metric Management Practice Based on DataLeap

This article details ByteDance Volcano Engine's DataLeap‑driven e‑commerce metric management practice, covering the background of metric system construction, challenges of inconsistency, the six‑point platform solution, metric DSL query language, consumption workflow, and future plans for intelligent automation and large‑model integration.

Data PlatformDataLeapMetricDSL
0 likes · 18 min read
E‑commerce Metric Management Practice Based on DataLeap
Huolala Tech
Huolala Tech
Jul 4, 2024 · Big Data

How Huolala Built a High‑Impact Metric Library to Power Data‑Driven Decisions

Huolala’s data team created a comprehensive metric library platform that centralizes metric definitions, classifications, data, and analysis, enabling data‑driven decision‑making, operational efficiency, service optimization, and strategic business growth across its freight services.

AB testingBusiness AnalyticsData Platform
0 likes · 11 min read
How Huolala Built a High‑Impact Metric Library to Power Data‑Driven Decisions
DataFunSummit
DataFunSummit
Jul 2, 2024 · Cloud Computing

Global Perspective on Multi-Cloud Data Architecture

The forum presents a series of technical talks on multi‑cloud data architecture, covering Xiaomi’s lake‑warehouse practice, cross‑border e‑commerce data platforms, Alluxio‑based machine‑learning acceleration, Qichacha’s cost‑effective data solutions, and Kuaishou’s Flink on Kubernetes migration, highlighting strategies, implementations, and audience benefits.

Big DataData ArchitectureData Platform
0 likes · 8 min read
Global Perspective on Multi-Cloud Data Architecture
DataFunTalk
DataFunTalk
Jun 26, 2024 · Big Data

Evolution of the Big Data + AI Development Paradigm and Alibaba Cloud’s Integrated Architecture

This article examines how the big‑data AI development paradigm has shifted from model‑centric to data‑centric workflows, outlines the challenges of integrating data and AI teams, and details Alibaba Cloud’s end‑to‑end, serverless big‑data platform—including MaxCompute, Hologres, MaxFrame, Object Table, and vector search—designed to accelerate large‑scale AI applications.

AI integrationBig DataData Platform
0 likes · 20 min read
Evolution of the Big Data + AI Development Paradigm and Alibaba Cloud’s Integrated Architecture
Data Thinking Notes
Data Thinking Notes
Jun 2, 2024 · Big Data

How JD Retail’s Data Platform Boosts Efficiency with Unified Modeling and AI‑Driven Insights

This article details JD Retail’s end‑to‑end data platform, covering data asset certification, 5W2H modeling, unified query DSL, intelligent acceleration, robust governance, visualization components, low‑code orchestration, and large‑model AI applications that together reduce query latency, cut development costs, and empower analysts across the retail business.

AIBig DataData Governance
0 likes · 39 min read
How JD Retail’s Data Platform Boosts Efficiency with Unified Modeling and AI‑Driven Insights
Ctrip Technology
Ctrip Technology
May 30, 2024 · Big Data

Ctrip Data Platform 2.0 Architecture and Evolution: Multi‑IDC Storage, Tiered Data, Scheduling, and Spark/Kyuubi Enhancements

Since 2023, Ctrip’s Data Platform 2.0 has been redesigned to support multi‑IDC storage, tiered hot/warm/cold data, transparent migration, priority scheduling, mixed online/offline resources, and a smooth upgrade from Spark 2 to Spark 3 with Kyuubi as the query engine, delivering higher performance and scalability.

Data PlatformKyuubiScheduling
0 likes · 21 min read
Ctrip Data Platform 2.0 Architecture and Evolution: Multi‑IDC Storage, Tiered Data, Scheduling, and Spark/Kyuubi Enhancements
DataFunSummit
DataFunSummit
May 27, 2024 · Big Data

Design and Optimization of Zhihu's Bridge Platform for DMP/CDP: Architecture, Challenges, and Solutions

This article presents a comprehensive case study of Zhihu's Bridge platform, detailing its background, five core modules, unified architecture built on Spark and Flink, bitmap‑based tagging, and performance optimizations that address query speed, write latency, and high‑QPS online checks while outlining future directions with Doris 2.0 and large language models.

CDPDMPData Platform
0 likes · 27 min read
Design and Optimization of Zhihu's Bridge Platform for DMP/CDP: Architecture, Challenges, and Solutions
Big Data Technology & Architecture
Big Data Technology & Architecture
May 27, 2024 · Big Data

Athena Data Factory: A One‑Stop Data Development and Governance Platform – Architecture, Features, and Impact

The Athena Data Factory, built by Spark Thinking, is a comprehensive one‑stop data development and governance platform that integrates data integration, development, analysis, and services, offering offline, real‑time, and AI pipelines, modular architecture, extensive monitoring, and cost‑optimisation to empower thousands of users across the company.

AirflowBig DataData Platform
0 likes · 26 min read
Athena Data Factory: A One‑Stop Data Development and Governance Platform – Architecture, Features, and Impact
DataFunTalk
DataFunTalk
May 23, 2024 · Big Data

Berserker Big Data Platform: Architecture, Development Practices, and Operational Enhancements

This article presents a comprehensive overview of the Berserker big‑data platform, detailing its overall design, data‑development components, key architectural challenges such as state management, release processes, two‑phase commit, RPC duplication, task routing, message handling, execution isolation, dependency model redesign, and outlines future work including stateless execution nodes, Kubernetes integration, and unified stream‑batch processing.

Big DataData PlatformDistributed Scheduling
0 likes · 15 min read
Berserker Big Data Platform: Architecture, Development Practices, and Operational Enhancements
DataFunSummit
DataFunSummit
May 12, 2024 · Big Data

Practice of Lakehouse‑Integrated Data Platform Architecture in the Financial Innovation Sector

This article presents the evolution of data platform architectures, the specific challenges of financial‑sector information‑technology innovation, and the design, core components, deployment path, and real‑world case studies of the cloud‑native lakehouse solution DataCyber developed by Shuxin Network.

Big DataData PlatformFinancial Innovation
0 likes · 21 min read
Practice of Lakehouse‑Integrated Data Platform Architecture in the Financial Innovation Sector
Data Thinking Notes
Data Thinking Notes
May 9, 2024 · Big Data

How to Build an Effective Indicator System: From Concept to Productization

This article explores the complete lifecycle of an indicator system—from defining metrics and addressing common ambiguities, through designing concept consensus, semantic layers, mechanisms, and governance, to productizing platforms, optimizing development, and envisioning future AI‑driven enhancements.

Big DataData PlatformIndicator System
0 likes · 22 min read
How to Build an Effective Indicator System: From Concept to Productization
DataFunTalk
DataFunTalk
May 2, 2024 · Artificial Intelligence

Algorithmic Role in Building the 58 User Profiling Platform

This article explains how 58's user profiling platform leverages algorithms for tag system construction, audience generation, recommendation pipelines, and smart operations to enable personalized marketing, fine‑grained operations, and user value growth across multiple business scenarios.

Data PlatformLook-alike ModelingTag Engineering
0 likes · 12 min read
Algorithmic Role in Building the 58 User Profiling Platform
JD Retail Technology
JD Retail Technology
Apr 28, 2024 · Big Data

From Confusion to Mastery: A Newcomer's Journey in Big Data Testing

This article recounts a junior tester's two‑year growth at JD.com, detailing early uncertainties, practical learning methods, step‑by‑step big‑data testing tasks, big‑sale preparation experiences, and actionable advice for newcomers aiming to thrive in the big‑data testing field.

AdviceData PlatformPerformance Testing
0 likes · 10 min read
From Confusion to Mastery: A Newcomer's Journey in Big Data Testing
DataFunSummit
DataFunSummit
Apr 19, 2024 · Big Data

Design Insights of Bilibili's Big Data Development Governance Platform

This article outlines Bilibili's data‑driven approach, describing the five‑year development of its big‑data development governance platform, its user segmentation, product positioning, data‑map and governance product designs, operational methods, value evaluation, and future roadmap, highlighting significant efficiency gains and user impact.

Big DataBilibiliData Platform
0 likes · 10 min read
Design Insights of Bilibili's Big Data Development Governance Platform
DataFunTalk
DataFunTalk
Apr 13, 2024 · Artificial Intelligence

Integrating Generative AI with Business Intelligence: Design, Implementation, and Lessons from Baidu's ChatBI Platform

The article explores how generative AI transforms business intelligence by detailing BI's evolution, the ChatBI platform's architecture, NL2SQL challenges, performance and accuracy optimizations, and real‑world deployment outcomes that demonstrate reduced user barriers and enhanced analytical efficiency.

AIBusiness IntelligenceChatBI
0 likes · 13 min read
Integrating Generative AI with Business Intelligence: Design, Implementation, and Lessons from Baidu's ChatBI Platform
Baidu Geek Talk
Baidu Geek Talk
Apr 10, 2024 · Big Data

TDA: A One‑Stop Self‑Service BI Platform – Architecture, Challenges, and Solutions

The article presents Turing Data Analysis (TDA), a self‑service BI platform that replaces fragile traditional pipelines with a unified DWD‑based data model, drag‑and‑drop analytics, multi‑engine query optimization and caching, delivering sub‑10‑second queries on billions of rows, fine‑grained permissions, and rapid dashboard creation, while reporting significant usage growth and outlining AI‑driven future enhancements.

BIBig DataData Platform
0 likes · 15 min read
TDA: A One‑Stop Self‑Service BI Platform – Architecture, Challenges, and Solutions
Data Thinking Notes
Data Thinking Notes
Apr 9, 2024 · Big Data

What Is a Data Middle Platform and Why It’s Essential for Modern Enterprises

Data middle platforms transform raw enterprise data into reusable assets by integrating collection, storage, processing, governance, and service layers, enabling faster deployment, consistent metrics, improved data quality, and business value across digital transformation, while addressing challenges like siloed data, low efficiency, and inconsistent standards.

Big DataData GovernanceData Integration
0 likes · 23 min read
What Is a Data Middle Platform and Why It’s Essential for Modern Enterprises
DataFunSummit
DataFunSummit
Apr 4, 2024 · Big Data

Design Principles and Future Directions of DataOps

This article outlines the core capabilities of data-driven development, the background and architecture of DataOps, its research challenges and focus areas, and explores future directions such as data virtualization, platform governance, and data value assessment, providing a comprehensive overview of DataOps practices.

Big DataData Platform
0 likes · 8 min read
Design Principles and Future Directions of DataOps
DataFunSummit
DataFunSummit
Apr 1, 2024 · Big Data

DataOps at ByteDance: Challenges, Implementation, and Future Outlook

This article examines ByteDance's DataOps journey, detailing the data‑engineering challenges faced, the concrete solutions and productization through the DataLeap platform, the metrics and best‑practice framework adopted, and the future directions involving AI‑assisted development and open‑source collaboration.

Big DataData Platformmetrics
0 likes · 20 min read
DataOps at ByteDance: Challenges, Implementation, and Future Outlook
MaGe Linux Operations
MaGe Linux Operations
Mar 8, 2024 · Cloud Computing

Choosing Between Cloud, On-Premises, and Cloud‑Near Storage: Which Wins?

This article compares the advantages and disadvantages of cloud storage, on‑premises storage, and the hybrid cloud‑near storage model, explaining how each impacts scalability, cost, control, security, and integration for modern data platforms, and helps organizations select the most suitable solution.

Cost OptimizationData PlatformHybrid storage
0 likes · 12 min read
Choosing Between Cloud, On-Premises, and Cloud‑Near Storage: Which Wins?
Baidu Tech Salon
Baidu Tech Salon
Mar 7, 2024 · Artificial Intelligence

How Generative AI is Transforming Business Intelligence: Inside Baidu’s ChatBI

This article examines the evolution of BI through generative AI, outlines the design and implementation of Baidu’s ChatBI platform, and discusses technical challenges such as NL2SQL integration, performance, accuracy, and user experience improvements that enable intelligent, low‑cost data analysis.

AIBusiness IntelligenceChatBI
0 likes · 14 min read
How Generative AI is Transforming Business Intelligence: Inside Baidu’s ChatBI
DataFunTalk
DataFunTalk
Mar 5, 2024 · Big Data

Changan Automotive Big Data Platform: Challenges and Practices in Connected Vehicle Scenarios

This article outlines the rapid growth of data in the smart automotive sector and details Changan's big data platform challenges—high cost, data accessibility, and operational complexity—and the practical migration from a Lambda to a unified Kappa architecture that delivers significant storage, compute, and maintenance efficiencies.

Big DataConnected VehiclesCost Optimization
0 likes · 14 min read
Changan Automotive Big Data Platform: Challenges and Practices in Connected Vehicle Scenarios
DataFunTalk
DataFunTalk
Mar 4, 2024 · Big Data

Design and Implementation of a Lakehouse‑Integrated Data Platform for Financial Innovation by Shuxin Network

This article presents Shuxin Network's practical experience in building a cloud‑native, lakehouse‑integrated data platform for the financial sector, covering architecture evolution, challenges of domestic‑innovation (信创), the DataCyber solution, core components, deployment roadmap, and real‑world case studies.

Big DataCloud NativeData Platform
0 likes · 21 min read
Design and Implementation of a Lakehouse‑Integrated Data Platform for Financial Innovation by Shuxin Network
JD Retail Technology
JD Retail Technology
Feb 22, 2024 · Big Data

JD Retail Data Platform: Data Asset Governance, Metric Middle Platform, Visualization Tools, and AI‑Driven Applications

This technical article details JD Retail’s 2023 data platform advancements, covering data‑asset certification and governance, a metric middle‑platform for unified indicator management, sophisticated visualization components, low‑code orchestration, and large‑model AI applications that together improve data retrieval efficiency, reduce storage‑compute costs, and support rapid business decision‑making.

AIData GovernanceData Platform
0 likes · 39 min read
JD Retail Data Platform: Data Asset Governance, Metric Middle Platform, Visualization Tools, and AI‑Driven Applications
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Feb 21, 2024 · Artificial Intelligence

Cloud Music Public Opinion Analysis Platform: Architecture and GPT-Based Implementation

The article describes NetEase Cloud Music’s public‑opinion analysis platform, which integrates external and internal data streams into a layered architecture—ingestion, processing, storage in Elasticsearch, visualization, and monitoring—and employs GPT‑based analyzers for clustering, sentiment, summarization, and intelligent alerts while optimizing costs and planning automated GPT‑driven reports.

Data PlatformElasticsearchGPT analysis
0 likes · 13 min read
Cloud Music Public Opinion Analysis Platform: Architecture and GPT-Based Implementation
DataFunTalk
DataFunTalk
Feb 8, 2024 · Big Data

Design and Practice of Ant Group's Metric System

This talk by Ant Group’s senior technical expert Wang Gaohang details the definition, design, mechanism, productization, and future outlook of the company’s metric system, covering concept consensus, semantic layers, workflow, AI assistance, performance optimization, and practical case studies.

AIBig DataData Platform
0 likes · 28 min read
Design and Practice of Ant Group's Metric System
DataFunSummit
DataFunSummit
Jan 31, 2024 · Big Data

iQIYI Magic Mirror: Evolution of a Big Data Analysis Platform

iQIYI's Magic Mirror platform, evolving from 1.0 to 3.0, addresses the growing data analysis demands of the internet industry by empowering self‑service analytics, introducing multi‑stage architectures, advanced computation engines, customizable SQL, and visual dashboards, thereby improving efficiency, scalability, and data security for business users.

Big DataData PlatformSQL
0 likes · 18 min read
iQIYI Magic Mirror: Evolution of a Big Data Analysis Platform
DataFunTalk
DataFunTalk
Jan 28, 2024 · Databases

Practical Experience of StarRocks Materialized Views at Didi

This article presents Didi's practical experience with StarRocks materialized views, covering the evolution of its OLAP architecture, the challenges of previous engines, the adoption of StarRocks, the design of materialized view acceleration for real‑time dashboards, and future optimization directions.

Big DataData PlatformOLAP
0 likes · 17 min read
Practical Experience of StarRocks Materialized Views at Didi
DataFunTalk
DataFunTalk
Jan 21, 2024 · Cloud Native

Building a System Observability Framework with YHP: Practices, Challenges, and Integrated Solutions

This article explains how YHP enables cloud‑native systems to achieve comprehensive observability by defining the three core signals—metrics, traces, and logs—addressing common enterprise pain points, and presenting an integrated platform that unifies data collection, storage, analysis, and visualization for efficient fault diagnosis and performance monitoring.

Cloud NativeData Platformlogs
0 likes · 22 min read
Building a System Observability Framework with YHP: Practices, Challenges, and Integrated Solutions
DataFunTalk
DataFunTalk
Jan 12, 2024 · Big Data

Building a Unified Data Empowerment Layer with Apache Kyuubi at GF Securities

The article describes how GF Securities designed and implemented a unified big‑data empowerment layer based on Apache Kyuubi to address data‑centric challenges, improve efficiency, ensure controllable governance, and support agile data scenarios across ingestion, processing, storage, and security.

Apache KyuubiBig DataData Empowerment
0 likes · 33 min read
Building a Unified Data Empowerment Layer with Apache Kyuubi at GF Securities