Tagged articles
270 articles
Page 1 of 3
DataFunTalk
DataFunTalk
May 6, 2026 · Artificial Intelligence

Why Palantir’s Ontology, Not Just Large Models, Drives Its Valuation Surge

In a 90‑minute round‑table, experts from banking risk control and cloud observability explain how Palantir’s ontology—viewed as the skeleton and memory that structures massive, heterogeneous data—bridges three data gaps, enables large‑model reasoning, and offers concrete steps for building practical knowledge graphs in enterprises.

Digital TwinEnterprise AIKnowledge Graph
0 likes · 16 min read
Why Palantir’s Ontology, Not Just Large Models, Drives Its Valuation Surge
DataFunTalk
DataFunTalk
May 2, 2026 · Industry Insights

Why Palantir’s Ontology Fuels Its Valuation: The Skeleton and Memory Behind AI

In a 90‑minute round‑table, experts from banking risk control and cloud observability explain how Palantir’s ontology bridges three data gaps, turns raw logs into a graph of entities and relationships, and works with large models as a skeleton and memory to make AI trustworthy and scalable.

AI trustworthinessDigital TwinKnowledge Graph
0 likes · 16 min read
Why Palantir’s Ontology Fuels Its Valuation: The Skeleton and Memory Behind AI
DataFunTalk
DataFunTalk
Apr 25, 2026 · Artificial Intelligence

How Palantir Ontology Modeling Turns Real Estate Ops into an AI‑Driven Enterprise

Healthpeak, a large medical‑real‑estate REIT, replaced fragmented spreadsheets and manual data entry with Palantir AIP’s ontology‑driven AI operating system, achieving automated billing, voice‑driven workflows, reduced errors, and a scalable, data‑centric operation that frees managers to focus on tenant relationships.

AI PlatformAutomationEnterprise AI
0 likes · 17 min read
How Palantir Ontology Modeling Turns Real Estate Ops into an AI‑Driven Enterprise
DataFunTalk
DataFunTalk
Apr 23, 2026 · Artificial Intelligence

Why Palantir’s Valuation Soars: Large Models as the Brain, Ontology as the Skeleton and Memory

In a 90‑minute round‑table hosted by DataFun, experts from banking risk control and cloud observability dissect how Palantir’s ontology—structured as a graph that links entities, metrics and logs—complements large‑model AI, solves data chaos, and becomes the practical backbone for trustworthy enterprise AI.

Enterprise AIKnowledge GraphObservability
0 likes · 16 min read
Why Palantir’s Valuation Soars: Large Models as the Brain, Ontology as the Skeleton and Memory
DataFunTalk
DataFunTalk
Apr 20, 2026 · Artificial Intelligence

Why Palantir’s Ontology Is the Secret Behind AI Success in Banking and Cloud Ops

In a 90‑minute round‑table hosted by DataFun, experts from Shanghai Bank, Alibaba Cloud, and academia dissect how ontology bridges data chaos, model opacity, and engineering scale, enabling trustworthy AI for financial risk control and cloud observability while outlining practical steps for building usable knowledge graphs.

AIDigital TwinEnterprise AI
0 likes · 17 min read
Why Palantir’s Ontology Is the Secret Behind AI Success in Banking and Cloud Ops
Big Data Tech Team
Big Data Tech Team
Feb 12, 2026 · Big Data

Mastering the DWS Layer: Core Strategies for Scalable Data Warehouses

This article provides a comprehensive, business‑driven analysis of the Data Warehouse Service (DWS) layer, covering its core positioning, design goals, modeling and aggregation tactics, storage optimizations, typical challenges with practical solutions, and best‑practice recommendations for building efficient, cost‑effective data services.

DWS LayerData WarehousePerformance Optimization
0 likes · 8 min read
Mastering the DWS Layer: Core Strategies for Scalable Data Warehouses
Big Data Tech Team
Big Data Tech Team
Jan 15, 2026 · Big Data

Mastering Data Warehousing: Core Concepts, Tools, and Future Trends

This article outlines a comprehensive roadmap for data warehousing, covering fundamental concepts, essential big‑data tools, practical implementation steps, advanced architectural topics, and emerging trends such as cloud‑native warehouses and machine‑learning integration, helping readers build a solid knowledge base.

Data WarehouseETLOLAP
0 likes · 9 min read
Mastering Data Warehousing: Core Concepts, Tools, and Future Trends
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 12, 2026 · Operations

Why Traditional Monitoring Fails and How UModel Redefines Observability for AI‑Powered Ops

The article explains how legacy monitoring based on isolated metrics, traces, and logs cannot keep up with the massive, fragmented, and dynamic data of modern IT systems, and introduces UModel—a graph‑based observability model that bridges data, model, and engineering gaps to enable AI‑driven operations.

Graph ModelingObservabilityOperations
0 likes · 11 min read
Why Traditional Monitoring Fails and How UModel Redefines Observability for AI‑Powered Ops
Big Data Tech Team
Big Data Tech Team
Jan 12, 2026 · Big Data

Avoid the 5 Fatal DWS Design Traps and Build Scalable Data Warehouses

This article dissects the five most common pitfalls when transitioning from DWD to DWS aggregation tables—such as chimney‑style designs, over‑wide tables, grain mismatches, missing drill‑down keys, and performance neglect—and offers concrete, production‑ready solutions to create reusable, efficient, and cost‑effective data‑warehouse layers.

DWS DesignData WarehouseETL
0 likes · 9 min read
Avoid the 5 Fatal DWS Design Traps and Build Scalable Data Warehouses
Instant Consumer Technology Team
Instant Consumer Technology Team
Jan 8, 2026 · Big Data

How Vintage Cohort Analysis Transforms Financial Risk Management

This article explains the concept, key terminology, and practical implementation of Vintage (cohort) analysis in financial services, detailing how to build tables and curves, integrate data pipelines, and use the insights to optimize marketing strategies, credit risk assessment, and operational efficiency.

Vintage analysiscohort analysisdata modeling
0 likes · 18 min read
How Vintage Cohort Analysis Transforms Financial Risk Management
Big Data Tech Team
Big Data Tech Team
Jan 5, 2026 · Big Data

Top 10 Data Warehouse Interview Questions Every 2026 Engineer Must Master

This article compiles the most frequently asked interview questions for 2026 data‑warehouse development engineers, covering core concepts, layer architecture, SQL optimization, window functions, Hive vs Spark, data skew solutions, modeling metrics, slowly changing dimensions, scheduling tools, data quality monitoring, and real project experience.

Data WarehouseHiveSQL Optimization
0 likes · 8 min read
Top 10 Data Warehouse Interview Questions Every 2026 Engineer Must Master
StarRocks
StarRocks
Dec 25, 2025 · Big Data

How dbt, DataOps, and StarRocks Combine to Accelerate Real‑Time Data Modeling

This article explains how dbt drives automated data modeling and governance, how DataOps practices bring agility and control to data projects, and how StarRocks’ lakehouse architecture enables real‑time and batch analytics, illustrated with concrete workflows, version‑control conventions, and enterprise case studies.

Data GovernanceDataOpsELT
0 likes · 14 min read
How dbt, DataOps, and StarRocks Combine to Accelerate Real‑Time Data Modeling
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
Nov 16, 2025 · Backend Development

How to Choose and Implement Architecture Contracts for Distributed Systems

This article explains why architecture‑level contract decisions are needed in distributed systems, compares strict and loose data contracts, illustrates schema‑on‑read/write patterns, and shows how to ensure forward and backward compatibility when evolving protocols such as JSON and Protobuf.

Distributed SystemsProtobufarchitecture contracts
0 likes · 11 min read
How to Choose and Implement Architecture Contracts for Distributed Systems
Architect-Kip
Architect-Kip
Nov 13, 2025 · Databases

Mastering Database Table Design: 12 Essential Table Types and Best Practices

Effective data modeling is crucial for system stability, and this guide walks you through core principles, twelve common table patterns—from batch and log tables to hierarchical and bitmap structures—detailing design rules, trade‑offs, usage scenarios, and practical examples to help you avoid costly redesigns.

SQLdata modelingnormalization
0 likes · 24 min read
Mastering Database Table Design: 12 Essential Table Types and Best Practices
Java Companion
Java Companion
Nov 9, 2025 · Databases

Why Big Companies Avoid SET for User Data: A Redis Storage Guide

The article compares storing user objects in Redis using plain SET with JSON versus using HASH fields, providing code demos, benchmark results, memory and concurrency analysis, and practical guidelines on when to choose each approach for optimal performance and safety.

HashJavaString
0 likes · 9 min read
Why Big Companies Avoid SET for User Data: A Redis Storage Guide
DeWu Technology
DeWu Technology
Nov 5, 2025 · Backend Development

How We Cut Rule‑Update Cycle from Weeks to Days: A Full‑Stack Case Study

This article details the end‑to‑end technical redesign of an e‑commerce management‑category system, covering business pain points, a layered backend architecture, core Java modules, data‑model design, data‑warehouse computation, automated rule validation, approval workflows, and the resulting efficiency gains that shrink rule‑update cycles to just one or two days.

Javaarchitecturedata modeling
0 likes · 13 min read
How We Cut Rule‑Update Cycle from Weeks to Days: A Full‑Stack Case Study
Big Data Tech Team
Big Data Tech Team
Oct 26, 2025 · Big Data

Data Domain vs Subject Area: Clear Differences and Practical Guide

This article explains the distinct concepts of data domain and subject area, uses a library‑vs‑bookstore analogy, presents a real e‑commerce case, compares them in a concise table, and offers best‑practice steps and common pitfalls to help data teams design efficient data architectures.

Data DomainData WarehouseSubject Area
0 likes · 8 min read
Data Domain vs Subject Area: Clear Differences and Practical Guide
Huolala Tech
Huolala Tech
Oct 17, 2025 · Big Data

How HuoLala Accelerated User Profiling 30× Faster with Apache Doris

This article details how HuoLala built a high‑performance user profiling platform on Apache Doris, redesigning data models, leveraging bitmap storage, and applying query‑level optimizations to achieve up to 30‑fold speed gains, lower memory usage, and scalable real‑time analytics.

Apache DorisBig DataBitmap
0 likes · 17 min read
How HuoLala Accelerated User Profiling 30× Faster with Apache Doris
Model Perspective
Model Perspective
Sep 26, 2025 · Fundamentals

Unlocking Insights with Grey Relational Analysis and Grey Prediction Models

This article introduces the core principles of Grey Relational Analysis and the Grey Prediction Model, explains their calculation steps, and demonstrates how they can be applied across engineering, economics, and environmental fields to analyze limited data, select key indicators, evaluate systems, and forecast trends.

Grey Prediction ModelGrey Theorydata modeling
0 likes · 8 min read
Unlocking Insights with Grey Relational Analysis and Grey Prediction Models
php Courses
php Courses
Sep 11, 2025 · Databases

How to Efficiently Store and Manage JSON Data in Relational and NoSQL Databases

JSON has become the de‑facto format for data exchange, and modern relational databases like PostgreSQL and MySQL now support native JSON types alongside NoSQL solutions such as MongoDB, offering developers flexible storage, indexing, and query capabilities while balancing schema rigidity, performance, and scalability.

JSONNoSQLRelational
0 likes · 7 min read
How to Efficiently Store and Manage JSON Data in Relational and NoSQL Databases
Kuaishou Tech
Kuaishou Tech
Jul 31, 2025 · Big Data

How Kuaishou Overcame the ‘Impossible Triangle’ of Performance, Flexibility, and Cost in Real‑Time Big Data Analytics

This article details how Kuaishou’s content middle platform tackled the massive challenges of real‑time, flexible, and cost‑effective data analysis at trillion‑scale by redesigning its architecture, adopting ClickHouse, splitting wide tables, and implementing a scatter‑gather execution model with pre‑shuffle and bitmap optimizations.

Big DataClickHousePerformance Optimization
0 likes · 17 min read
How Kuaishou Overcame the ‘Impossible Triangle’ of Performance, Flexibility, and Cost in Real‑Time Big Data Analytics
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jul 25, 2025 · Backend Development

Pitfall Diary: Practical Lessons on Using Elasticsearch Nested Types

After a failed flatten‑field migration from MySQL to Elasticsearch caused incorrect product matches, the team introduced nested types, redesigned mappings, rewrote queries with nested and inner_hits, optimized performance, documented pitfalls, and concluded that nested types solve one‑to‑many relations but require careful evaluation.

ElasticsearchNested Typedata modeling
0 likes · 15 min read
Pitfall Diary: Practical Lessons on Using Elasticsearch Nested Types
macrozheng
macrozheng
Jul 18, 2025 · Databases

MySQL vs Elasticsearch: Which Data Store Fits Your Needs?

This article compares MySQL and Elasticsearch across data models, query languages, indexing, distributed architecture, performance, scalability, and typical use cases, helping readers decide which system best fits their application requirements in modern software development.

Elasticsearchdata modelingdatabase comparison
0 likes · 12 min read
MySQL vs Elasticsearch: Which Data Store Fits Your Needs?
Architect
Architect
Jul 7, 2025 · Big Data

How Baidu’s New Search Data Warehouse Architecture Boosts Performance by 5×

This article explains how Baidu’s search data team redesigned its data warehouse with wide‑table modeling, Parquet columnar storage, and a Spark‑ClickHouse fusion engine, eliminating redundancy, cutting query latency from minutes to seconds, and enabling self‑service analytics for thousands of users.

Data WarehouseETLParquet
0 likes · 21 min read
How Baidu’s New Search Data Warehouse Architecture Boosts Performance by 5×
macrozheng
macrozheng
Apr 3, 2025 · Databases

MySQL vs Elasticsearch: Which Data Store Wins for Your Use Case?

This article compares MySQL and Elasticsearch across data models, query languages, indexing, distributed architecture, performance, scalability, and typical use cases, helping developers choose the right system or combine them effectively for various application scenarios.

ElasticsearchSearchdata modeling
0 likes · 12 min read
MySQL vs Elasticsearch: Which Data Store Wins for Your Use Case?

Exploring Data Models: From Hierarchical to Graph and Schema-on-Read/Write

This article examines the evolution of data models—from conceptual, logical, and physical layers to hierarchical, network, relational, document, and graph structures—explaining their characteristics, implementation examples, and the contrasting schema‑on‑read versus schema‑on‑write approaches for modern data storage systems.

data modelingdatabasesgraph database
0 likes · 10 min read
Exploring Data Models: From Hierarchical to Graph and Schema-on-Read/Write
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 26, 2025 · Backend Development

Solving Marketing Activity Product Search with Elasticsearch: When to Use Join

The article examines why front‑end product search fails during large marketing events, evaluates Elasticsearch's join feature and its drawbacks, compares nested, reverse‑modeling and flattened approaches, recommends reverse modeling for massive activity‑product data, and provides concrete DSL code, pagination and caching tips.

ElasticsearchJOINdata modeling
0 likes · 10 min read
Solving Marketing Activity Product Search with Elasticsearch: When to Use Join
Didi Tech
Didi Tech
Mar 20, 2025 · Big Data

Key Questions and Value Assessment in Data Warehouse Modeling and Development

The article explores nine fundamental questions about data‑warehouse modeling—why and when to model, how to evaluate and compare models, the warehouse’s unique role versus business systems, modern architectural shifts, a quantitative value‑proof scoring framework, industry‑standard versus custom approaches, demonstrating business impact, and career insights—concluding that true value lies in enabling informed decisions rather than technology hype.

AIBig DataData Value
0 likes · 12 min read
Key Questions and Value Assessment in Data Warehouse Modeling and Development
Big Data Tech Team
Big Data Tech Team
Mar 17, 2025 · Big Data

How to Design and Review a Data Warehouse Model: A Complete Guide

This document outlines a comprehensive data warehouse model design and review process, covering revision records, project overview, business requirements, conceptual and logical modeling, ETL workflow, exception handling, and acceptance criteria with practical examples and templates.

Data WarehouseETLModel Design
0 likes · 6 min read
How to Design and Review a Data Warehouse Model: A Complete Guide
Ma Wei Says
Ma Wei Says
Mar 9, 2025 · Big Data

Mastering DWD Layer Design: Principles, Fact Tables, and Performance Tips

This article provides a comprehensive guide to designing the Data Warehouse Detail (DWD) layer, covering Kimball‑based design principles, step‑by‑step modeling, table and field naming conventions, concrete Hive DDL/DML examples, and optimization techniques such as partitioning, bucketing, and compression.

Big DataDWDData Warehouse
0 likes · 21 min read
Mastering DWD Layer Design: Principles, Fact Tables, and Performance Tips
JD Cloud Developers
JD Cloud Developers
Feb 5, 2025 · Databases

Cutting Procurement Query Times by 92%: Data Heterogeneity & ES Strategies

This case study details how the BIP procurement system tackled massive data volume, complex queries, and slow SQL by segmenting inbound orders, leveraging Elasticsearch, introducing a dynamic routing layer, and implementing robust ES high‑availability and monitoring, ultimately reducing query load by over 90%.

Big DataPerformance Optimizationdata modeling
0 likes · 14 min read
Cutting Procurement Query Times by 92%: Data Heterogeneity & ES Strategies
DataFunTalk
DataFunTalk
Jan 27, 2025 · Artificial Intelligence

Improving AI Agent Planning and Reasoning: Challenges and Practical Solutions

The article examines current limitations of AI agents in planning and complex reasoning, critiques existing methods like COT/TOT and ReAct, and proposes practical strategies—including combined COT‑Reflection approaches, structured memory algorithms, and white‑box interaction designs—to enhance agent performance within the DataFun knowledge map framework.

AI AgentCoTPlanning
0 likes · 3 min read
Improving AI Agent Planning and Reasoning: Challenges and Practical Solutions
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 26, 2024 · Fundamentals

Detailed Granularity Fact Tables (DWD): Types, Design Principles, and Comparison

The article explains the three detailed-granularity fact table types—transaction, periodic snapshot, and cumulative snapshot—detailing their purposes, design principles, and comparative usage, and offers a simplified interpretation to help data engineers choose the appropriate fact table for data warehouse modeling.

Big DataDWDData Warehouse
0 likes · 5 min read
Detailed Granularity Fact Tables (DWD): Types, Design Principles, and Comparison
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 12, 2024 · Fundamentals

How to Master a New Project in Record Time: A Practical Guide

This article offers a step‑by‑step guide for programmers to quickly onboard a new business or project by gathering documentation, mapping data models, understanding architecture, learning platform tools, and applying practical tips to accelerate learning and growth.

Documentationdata modelingdevelopment workflow
0 likes · 8 min read
How to Master a New Project in Record Time: A Practical Guide
Chen Tian Universe
Chen Tian Universe
Dec 5, 2024 · Operations

Mastering the Four-Stage Reconciliation Model for Large Payment Institutions

This article explains how major payment institutions ensure the accuracy of tens of millions of daily transactions and billions of dollars by using a four‑segment data model, three verification groups, error classification, and extensible data coding to achieve reliable settlement and accounting.

OperationsReconciliationaccounting
0 likes · 6 min read
Mastering the Four-Stage Reconciliation Model for Large Payment Institutions
DataFunSummit
DataFunSummit
Nov 29, 2024 · Big Data

Standardizing Metric Management in Didi’s Data Platform

The article outlines Didi’s end‑to‑end metric lifecycle—from background, requirements and current pain points to a multi‑stage solution that introduces a unified metric dictionary, management tool, logical modeling, and consumption layer—to achieve accurate, timely, consistent, and efficiently managed indicators across the data warehouse ecosystem.

Big DataData Warehousedata modeling
0 likes · 20 min read
Standardizing Metric Management in Didi’s Data Platform
Test Development Learning Exchange
Test Development Learning Exchange
Nov 22, 2024 · Artificial Intelligence

Introduction to Data Modeling with Scikit-Learn

This article provides a comprehensive guide to using Scikit-Learn for data modeling, covering linear regression and decision tree algorithms, including data preparation, model training, evaluation metrics, and visualization techniques for predictive analysis.

Data ScienceDecision TreesPython
0 likes · 4 min read
Introduction to Data Modeling with Scikit-Learn
High Availability Architecture
High Availability Architecture
Nov 22, 2024 · Backend Development

Designing a High‑Availability, Scalable Feed Stream System

This article introduces feed streams, explains their evolution from RSS to modern social feeds, classifies them by aggregation logic and display, discusses challenges such as real‑time performance and massive data, and presents a backend architecture with data models, pagination, write/read diffusion, and core publishing/reading workflows.

Backend ArchitectureScalabilitydata modeling
0 likes · 21 min read
Designing a High‑Availability, Scalable Feed Stream System
Youzan Coder
Youzan Coder
Nov 13, 2024 · Big Data

How a Unified Metric Service Transforms Data Queries with Headless BI

Facing inconsistent metrics and low reuse in siloed data services, the team built a unified metric service using a headless BI semantic layer and virtual data models, enabling consistent metric definitions, reusable data models, AI-friendly queries, and faster, scalable reporting across the organization.

Big DataHeadless BILLM integration
0 likes · 17 min read
How a Unified Metric Service Transforms Data Queries with Headless BI
DataFunSummit
DataFunSummit
Oct 26, 2024 · Big Data

Kuaishou Metric Middle Platform: Design, Architecture, and Practices

This article presents Kuaishou's metric middle platform, detailing its background, design principles, architecture, metric management, data modeling, unified analysis language OAX, federated query engine OCTO, acceleration strategies, and future directions, illustrating how it improves data quality, development efficiency, and analytical capabilities at scale.

AnalyticsBig DataData Platform
0 likes · 64 min read
Kuaishou Metric Middle Platform: Design, Architecture, and Practices
Data Thinking Notes
Data Thinking Notes
Oct 15, 2024 · Fundamentals

Why Data Modeling Matters: Unlock Business Value and Governance

This article explains how data modeling drives business value by sparking conversations about data meaning, enabling the creation of useful data objects, and guiding smart decisions on data capture, storage, usage, and integration, while also outlining a governance framework for managing enterprise data models.

Data GovernanceData ManagementEnterprise Data
0 likes · 4 min read
Why Data Modeling Matters: Unlock Business Value and Governance
Su San Talks Tech
Su San Talks Tech
Oct 12, 2024 · Databases

Master MySQL: Essential Database Design and Performance Guidelines

This article compiles comprehensive MySQL best‑practice guidelines covering charset selection, storage engine choice, naming conventions, field design, index optimization, SQL development tips, and operational procedures to improve performance, maintainability, and reliability of database systems.

Performance OptimizationSQL Best Practicesdata modeling
0 likes · 23 min read
Master MySQL: Essential Database Design and Performance Guidelines
DataFunTalk
DataFunTalk
Sep 28, 2024 · Big Data

Metric Management and Standardization in Didi's Data Platform

This article outlines Didi's approach to metric management, covering background, data product overview, and challenges in traditional and agile BI models, and presents a comprehensive solution for metric standardization, logical modeling, quality assurance, unified consumption, and future roadmap to improve data warehouse efficiency and consistency.

BIData Warehousedata modeling
0 likes · 21 min read
Metric Management and Standardization in Didi's Data Platform
Data Thinking Notes
Data Thinking Notes
Sep 9, 2024 · Fundamentals

Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform

This guide outlines a practical six‑step methodology—covering overall planning, data integration, model construction, data development, asset management, and data services—to help enterprises build a robust data middle platform that unlocks business value and supports agile digital transformation.

Data GovernanceData IntegrationData Platform
0 likes · 10 min read
Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 3, 2024 · Big Data

Mastering Data Modeling: From Raw Data to Insightful Warehouses

This article walks through the fundamentals of data modeling, explaining what data is, the DIKW framework, why modeling matters, and detailing the end‑to‑end process from conceptual design through logical and physical layers, including DIM, DWD, DWS, and ADM tables with practical tips and naming conventions.

Data WarehouseETLdata modeling
0 likes · 11 min read
Mastering Data Modeling: From Raw Data to Insightful Warehouses
ITPUB
ITPUB
Aug 25, 2024 · Databases

How to Allocate Train Seats Across Segments Using SQL

This article recounts L's attempt to change a train ticket, analyzes seat availability across four segments, and presents a minimal SQL model with table creation, data insertion, and a query that determines feasible seat combinations, demonstrating how segment‑wise seat allocation can enable complete journey booking.

OracleSQLTravel Planning
0 likes · 9 min read
How to Allocate Train Seats Across Segments Using SQL
Eric Tech Circle
Eric Tech Circle
Aug 18, 2024 · Industry Insights

How Huawei Tackles Data Silos: Lessons from “Huawei Data Way”

Drawing from the book “Huawei Data Way”, this article explains why Huawei must digitize, describes the resulting data‑island problem, and outlines the four‑part framework of data governance—including data asset catalogs, standards, models, and distribution—while showing how business‑object‑centric information architecture is built and implemented.

Data GovernanceDigital TransformationHuawei
0 likes · 6 min read
How Huawei Tackles Data Silos: Lessons from “Huawei Data Way”
DataFunSummit
DataFunSummit
Aug 9, 2024 · Big Data

Design and Practice of Ant Group's Metric System

This article presents a comprehensive overview of Ant Group's metric system, covering its definition, three-layer architecture, common challenges, concept consensus methods, semantic layer options, mechanism design, productization capabilities, platform improvements, business outcomes, future directions, and a detailed Q&A session.

Big DataData Platformdata modeling
0 likes · 28 min read
Design and Practice of Ant Group's Metric System
21CTO
21CTO
Jul 30, 2024 · Databases

What Goes Around: 20‑Year Evolution of Database Systems and Future Trends

This article reviews two decades of database research, analyzing the rise and decline of various data models—from hierarchical and relational to NoSQL, vector, and graph databases—while highlighting how AI, cloud, and hardware advances are reshaping DBMS architecture and predicting which approaches will dominate tomorrow’s data landscape.

DBMS EvolutionNoSQLSQL
0 likes · 30 min read
What Goes Around: 20‑Year Evolution of Database Systems and Future Trends
DataFunTalk
DataFunTalk
Jul 27, 2024 · Big Data

Design and Implementation of Kuaishou's Metric Middle Platform

This article presents Kuaishou's metric middle platform, detailing its background, design principles, metric management and service architecture, including headless BI concepts, unified analysis language OAX, query engine OCTO, data modeling layers, acceleration strategies, and future directions toward intelligence and high performance.

Big DataHeadless BIKuaishou
0 likes · 19 min read
Design and Implementation of Kuaishou's Metric Middle Platform
Data Thinking Notes
Data Thinking Notes
Jul 22, 2024 · Fundamentals

Why Data Architecture Governance Is the Key to Successful Digital Transformation

Data architecture governance, encompassing standards, security, modeling, quality, and lifecycle management, is essential for digital transformation in fast‑growing industries like express delivery, and this article outlines current challenges, traditional approaches, and a practical, phased methodology with platform support to implement effective governance.

Data ArchitectureData GovernanceDigital Transformation
0 likes · 12 min read
Why Data Architecture Governance Is the Key to Successful Digital Transformation
DataFunSummit
DataFunSummit
Jul 19, 2024 · Artificial Intelligence

Risk Control in the Bulk Commodity Industry: Data‑Driven Solutions and Credit‑Risk Modeling by Ant Group

This article presents Ant Group's data‑driven approach to digital transformation and risk control in the bulk commodity sector, covering background challenges, data‑application pain points, core capabilities, credit‑risk models, data‑asset construction, indicator frameworks, and secure data integration for B2B scenarios.

commodity industrycredit riskdata modeling
0 likes · 14 min read
Risk Control in the Bulk Commodity Industry: Data‑Driven Solutions and Credit‑Risk Modeling by Ant Group
Data Thinking Notes
Data Thinking Notes
Jul 18, 2024 · Artificial Intelligence

How to Build and Apply a Scalable User Profile Tag System

This article explains how companies can integrate independent user‑profile tag systems into a unified framework, covering tag definitions, demand sources, classification, construction methods, update cycles, platform architecture, common algorithms, and practical applications such as marketing, KPI attribution, and A/B test analysis.

CDPdata modelingmarketing analytics
0 likes · 15 min read
How to Build and Apply a Scalable User Profile Tag System
Baidu Tech Salon
Baidu Tech Salon
Jul 11, 2024 · Industry Insights

How Baidu Feed Evolved Its Data Warehouse with Multi‑Version Wide Tables

This article outlines the step‑by‑step evolution of Baidu's Feed data warehouse—from traditional layered modeling to hour‑level core tables, then real‑time wide tables, and finally a flow‑batch integrated multi‑version wide‑table architecture—highlighting the motivations, design choices, challenges, and resulting benefits.

Big DataData WarehouseReal-time analytics
0 likes · 10 min read
How Baidu Feed Evolved Its Data Warehouse with Multi‑Version Wide Tables
Baidu Geek Talk
Baidu Geek Talk
Jul 8, 2024 · Big Data

Evolution of Feed Data Warehouse Wide-Table Modeling at Baidu App

Baidu’s Mobile Ecology team transformed its Feed data warehouse through three progressive stages—hour‑level core tables, a real‑time wide table, and a unified day‑level multi‑version table—consolidating traffic, content, and user data into a single partitioned wide‑table architecture that resolves granularity inconsistencies, cuts processing cost, and delivers real‑time to daily latency for diverse analytics.

Real-TimeSparkWide Table
0 likes · 10 min read
Evolution of Feed Data Warehouse Wide-Table Modeling at Baidu App
Data Thinking Notes
Data Thinking Notes
Jun 27, 2024 · Fundamentals

How to Build Effective Data Standards for Enterprise Governance

This article explains the concept of data standards, outlines the three main categories of data standards, describes a four‑stage implementation process, and provides a real‑world bank case study to illustrate how enterprises can establish and apply data standards for better data quality and value.

Data GovernanceData QualityEnterprise Data
0 likes · 11 min read
How to Build Effective Data Standards for Enterprise Governance
DataFunTalk
DataFunTalk
Jun 27, 2024 · Big Data

Data Warehouse Construction and Data Governance Practices at Wing Payment

This presentation by senior data warehouse engineer Huang Luo details Wing Payment’s end‑to‑end data warehouse build, covering background challenges, governance framework, platform architecture, layered modeling, naming standards, asset management, monitoring, and future plans, illustrating how systematic data governance drives cost reduction, efficiency, and security.

AnalyticsBig DataData Governance
0 likes · 14 min read
Data Warehouse Construction and Data Governance Practices at Wing Payment
Data Thinking Notes
Data Thinking Notes
Jun 2, 2024 · Big Data

How JD Retail’s Data Platform Boosts Efficiency with Unified Modeling and AI‑Driven Insights

This article details JD Retail’s end‑to‑end data platform, covering data asset certification, 5W2H modeling, unified query DSL, intelligent acceleration, robust governance, visualization components, low‑code orchestration, and large‑model AI applications that together reduce query latency, cut development costs, and empower analysts across the retail business.

AIBig DataData Governance
0 likes · 39 min read
How JD Retail’s Data Platform Boosts Efficiency with Unified Modeling and AI‑Driven Insights
Data Thinking Notes
Data Thinking Notes
May 30, 2024 · Databases

Why Your Data Team Is Drowning in Requests—and How OLAP Can Save You

This article examines why data departments get overwhelmed by massive data‑retrieval requests, identifies root causes such as mindset, requirement handling, and lack of tools, and presents a technical solution centered on dimensional modeling and OLAP multi‑dimensional reporting to streamline data access and empower teams.

Big DataData WarehouseOLAP
0 likes · 12 min read
Why Your Data Team Is Drowning in Requests—and How OLAP Can Save You
DataFunTalk
DataFunTalk
May 28, 2024 · Big Data

Building and Managing a Metric System in Data Warehouse: Practices from Dongchedi

This article details how the Dongchedi business team designs, implements, and monitors a comprehensive metric system within its data warehouse, covering metric standards, model construction, metadata management, quality monitoring, application scenarios, and future directions using the DataLeap platform.

Big DataData GovernanceData Warehouse
0 likes · 18 min read
Building and Managing a Metric System in Data Warehouse: Practices from Dongchedi
Top Architect
Top Architect
May 25, 2024 · Mobile Development

Cross‑Platform Architecture for WeChat Pay: Reducing Code, Improving Quality and Productivity

This article describes how a C++‑based cross‑platform framework and a unified routing mechanism were built for WeChat Pay to eliminate platform‑specific bugs, streamline data flow, improve crash stability, cut code size by nearly 45%, and boost development productivity across iOS and Android.

CMobile DevelopmentSoftware Architecture
0 likes · 16 min read
Cross‑Platform Architecture for WeChat Pay: Reducing Code, Improving Quality and Productivity
Data Thinking Notes
Data Thinking Notes
May 9, 2024 · Big Data

How to Build an Effective Indicator System: From Concept to Productization

This article explores the complete lifecycle of an indicator system—from defining metrics and addressing common ambiguities, through designing concept consensus, semantic layers, mechanisms, and governance, to productizing platforms, optimizing development, and envisioning future AI‑driven enhancements.

Big DataData PlatformIndicator System
0 likes · 22 min read
How to Build an Effective Indicator System: From Concept to Productization
Sanyou's Java Diary
Sanyou's Java Diary
Apr 30, 2024 · Fundamentals

Mastering Architecture Diagrams: When, Why, and How to Build Clear System Blueprints

This comprehensive guide explains the purpose of architecture diagrams, the criteria for good diagrams, the optimal moments to create them, and detailed methods for drawing business, application, technical, code, and data architecture diagrams, complete with design principles, classification, and practical tips.

Technical architecturearchitecture diagramsbusiness architecture
0 likes · 21 min read
Mastering Architecture Diagrams: When, Why, and How to Build Clear System Blueprints
Model Perspective
Model Perspective
Apr 21, 2024 · Fundamentals

Unlocking Grey Theory: Predicting with Incomplete Data

Grey Theory, introduced by Deng Julong in 1982, offers a mathematical framework for analyzing systems with incomplete or uncertain data, using techniques like generated series and the GM(1,1) model to enable reliable forecasting and decision‑making across fields such as economics, environment, and product lifecycle analysis.

Grey TheoryLimited Datadata modeling
0 likes · 8 min read
Unlocking Grey Theory: Predicting with Incomplete Data
vivo Internet Technology
vivo Internet Technology
Apr 17, 2024 · Big Data

Retention Analysis Model Practice Based on ClickHouse

The article explains retention analysis models, their importance for user loyalty, outlines offline Hive architecture, then shows how ClickHouse’s retention() function and columnar storage dramatically speed up multi‑day retention calculations, providing SQL examples and practical guidance for product analytics.

ClickHouseHiveRetention Analysis
0 likes · 17 min read
Retention Analysis Model Practice Based on ClickHouse
Architect
Architect
Mar 25, 2024 · Backend Development

Designing Payment Business Architecture: Process Decomposition, Sequence Diagram, and Structural Design

The article explains how to analyze, decompose, and design a payment system by breaking the workflow into modules, illustrating pre‑payment, third‑party integration, and post‑payment stages with sequence diagrams, and proposing a data‑structure model that covers account management, transaction records, order handling, and related business components.

Sequence Diagramarchitecturedata modeling
0 likes · 10 min read
Designing Payment Business Architecture: Process Decomposition, Sequence Diagram, and Structural Design
Yum! Tech Team
Yum! Tech Team
Mar 1, 2024 · Operations

Building an Observability System Traffic Distribution Diagram

This article explains how to design and implement a traffic distribution diagram for an observability system, covering current cloud‑native tooling, data standardization, transformation, traffic‑flow modeling, aggregation, storage with ClickHouse, and visualisation techniques such as Sankey diagrams.

Cloud NativeObservabilitydata modeling
0 likes · 7 min read
Building an Observability System Traffic Distribution Diagram
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 26, 2024 · Big Data

How Heartbeat Game Built a Cloud‑Native Big Data Platform with Alibaba DataWorks

This article explains how Heartbeat Game created a cloud‑native big data platform on Alibaba Cloud using DataWorks, detailing the end‑to‑end data pipeline, a universal logical data model for games, and the advantages of the GS‑LBDM architecture for AI, risk control, and analytics scenarios.

Alibaba Clouddata modelinggaming analytics
0 likes · 12 min read
How Heartbeat Game Built a Cloud‑Native Big Data Platform with Alibaba DataWorks
DataFunTalk
DataFunTalk
Feb 8, 2024 · Big Data

Design and Practice of Ant Group's Metric System

This talk by Ant Group’s senior technical expert Wang Gaohang details the definition, design, mechanism, productization, and future outlook of the company’s metric system, covering concept consensus, semantic layers, workflow, AI assistance, performance optimization, and practical case studies.

AIBig DataData Platform
0 likes · 28 min read
Design and Practice of Ant Group's Metric System
DevOps
DevOps
Jan 17, 2024 · Operations

Agile Data Management: Principles, Practices, and Implementation Guide

This article explains how agile methodologies can be applied to data management, covering the need for agile data practices, core principles, iterative modeling, governance, CI/CD pipelines, tooling, metrics, security, case studies, challenges, and future outlooks in a comprehensive, step‑by‑step guide.

Data GovernanceData ManagementDataOps
0 likes · 13 min read
Agile Data Management: Principles, Practices, and Implementation Guide
DataFunSummit
DataFunSummit
Dec 25, 2023 · Big Data

Xiaomi Sales Data Warehouse: Architecture, Construction Theory, and Practices

This article presents a comprehensive overview of Xiaomi's sales data warehouse, covering its evolution, dimensional modeling and layer theory, Lambda architecture with batch and streaming processing, capability layers, security measures, and future trends toward real‑time metricization and data value creation.

data modeling
0 likes · 14 min read
Xiaomi Sales Data Warehouse: Architecture, Construction Theory, and Practices
Zhuanzhuan Tech
Zhuanzhuan Tech
Dec 14, 2023 · Big Data

Design and Implementation of a Data Service Platform for New Media Business

This article details the background, challenges, design principles, and implementation of a unified data service platform—including data modeling, multi-source governance, real-time processing, and a Doris-based storage solution—to support large‑scale video data for a new media operation.

Apache DorisData GovernanceData Platform
0 likes · 7 min read
Design and Implementation of a Data Service Platform for New Media Business
Open Source Linux
Open Source Linux
Dec 8, 2023 · Databases

What Is NoSQL? Uses, Architecture, and How It Differs from Relational Databases

This article explains what NoSQL databases are, outlines their typical use cases and architectural components, compares them with traditional relational databases across storage, scalability, query, and transaction aspects, and highlights the advantages and trade‑offs to consider when choosing a data solution.

NoSQLScalabilityarchitecture
0 likes · 7 min read
What Is NoSQL? Uses, Architecture, and How It Differs from Relational Databases
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 5, 2023 · Big Data

NetEase EasyData Metric Middle Platform: Architecture, Core Technologies, and Future Plans

This article details NetEase EasyData's evolution and product matrix, explains why a metric middle platform is needed, describes its core technical architecture—including a unified logical semantic model, a custom metric query language, and engine decoupling—and outlines future development directions.

AnalyticsBig DataData Governance
0 likes · 12 min read
NetEase EasyData Metric Middle Platform: Architecture, Core Technologies, and Future Plans
DataFunTalk
DataFunTalk
Dec 4, 2023 · Artificial Intelligence

OPPO’s Unified Modeling and Smart Power Strategies for App Distribution and User Value

The article details OPPO’s approach to balancing cost reduction and user value in app distribution through unified cross‑scenario modeling, sparse‑data solutions, oCPX advertising optimization, and a multi‑stage smart‑power system that improves efficiency, scalability, and revenue while preserving user experience.

App DistributionOPPOSmart Power
0 likes · 12 min read
OPPO’s Unified Modeling and Smart Power Strategies for App Distribution and User Value
Architects Research Society
Architects Research Society
Nov 27, 2023 · Databases

Formal Naming of Data Schemas, Structures, and Models: Distinctions and Methodology

The article explains the differences between data schemas, data structures, and data models, proposes a systematic naming approach, and outlines a five‑schema architecture—including business, view, logical, deployment, and physical schemas—while addressing terminology challenges and normalization processes.

Data StructureDatabase designdata modeling
0 likes · 12 min read
Formal Naming of Data Schemas, Structures, and Models: Distinctions and Methodology
JD Tech
JD Tech
Oct 25, 2023 · Backend Development

Design and Implementation of JD Logistics Order System Architecture for High Scalability and Availability

The article details JD Logistics' order system redesign using a four‑layer transaction architecture, describing its decoupled backend, unified data model, high‑availability components such as CQRS, Redis, JMQ, HBase, and Elasticsearch, and outlines design advantages, extensible data modeling, future challenges, and overall performance outcomes.

Backend ArchitectureDistributed SystemsOrder Management
0 likes · 10 min read
Design and Implementation of JD Logistics Order System Architecture for High Scalability and Availability
DataFunTalk
DataFunTalk
Oct 23, 2023 · Big Data

Alibaba Cloud DataWorks Intelligent Data Modeling: Practices, Challenges, and Solutions

This article introduces Alibaba Cloud DataWorks' intelligent data modeling tool, outlines the data demand flow, shares best practices and hands‑on demonstrations for data warehouse modeling, discusses common challenges and their solutions, and provides Q&A and product details for developers and data engineers.

Alibaba CloudBig DataData Warehouse
0 likes · 12 min read
Alibaba Cloud DataWorks Intelligent Data Modeling: Practices, Challenges, and Solutions
dbaplus Community
dbaplus Community
Oct 14, 2023 · Big Data

What Is a Data Warehouse? From Basics to Modern Practices

This article explains what a data warehouse is, contrasts it with traditional databases, outlines the evolution from classic to internet‑scale warehouses, details modeling approaches and layered architectures, discusses KPI dictionaries, date dimensions, naming standards, data governance, incremental loading techniques, and upstream/downstream coordination.

Big DataData GovernanceETL
0 likes · 25 min read
What Is a Data Warehouse? From Basics to Modern Practices
Big Data Technology & Architecture
Big Data Technology & Architecture
Oct 7, 2023 · Big Data

Comprehensive Guide to OLAP Optimization and ClickHouse Performance Tuning

This article explains how to optimize OLAP workloads by balancing normalization and denormalization, applying data sharding, replication, indexing, partitioning, materialized views, columnar storage, compression, and lifecycle management, and provides practical ClickHouse SQL examples for index creation, partitioning, and query plan analysis.

ClickHouseOLAPPartitioning
0 likes · 15 min read
Comprehensive Guide to OLAP Optimization and ClickHouse Performance Tuning
DataFunTalk
DataFunTalk
Sep 30, 2023 · Big Data

Building a Marketing‑Oriented Data Middle Platform: Concepts and Practices

This article outlines how a marketing‑focused data middle platform can be constructed by integrating online and offline behavior data, business data, and third‑party sources, then applying data integration, modeling, processing, and application capabilities to enable data‑driven user journeys and personalized marketing strategies.

Big DataData Integrationdata modeling
0 likes · 13 min read
Building a Marketing‑Oriented Data Middle Platform: Concepts and Practices
Architect
Architect
Sep 28, 2023 · Databases

How to Pick the Best Storage Engine for High‑Throughput Browsing Records: Redis, MySQL or Tair?

This article walks through a real‑world e‑commerce scenario where billions of daily browsing events generate over 100K TPS writes, evaluates storage options on reliability, cost, read/write performance and implementation difficulty, and ultimately recommends Tair after detailed analysis of List, Sorted‑Set and Hash structures, code examples, and concurrency controls.

Tairconcurrencydata modeling
0 likes · 15 min read
How to Pick the Best Storage Engine for High‑Throughput Browsing Records: Redis, MySQL or Tair?
JD Cloud Developers
JD Cloud Developers
Sep 28, 2023 · Backend Development

Designing a Scalable, High‑Availability Order System: Architecture Insights

This article details the design of a decoupled, high‑availability order system, covering business scope, value propositions, layered architecture, real‑time data layer, read/write separation, caching, messaging, search, multi‑tenant support, data security, and future challenges such as personalized queries and cost‑effective scaling.

Backend ArchitectureScalabilitydata modeling
0 likes · 12 min read
Designing a Scalable, High‑Availability Order System: Architecture Insights
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 13, 2023 · Big Data

How to Quickly Land as a Data Engineer in a New Company

This guide explains how data engineers can rapidly adapt to a new workplace by mastering business context, data domains, and system architecture, using structured learning, practical case studies, and continuous reflection to earn trust and deliver value efficiently.

OnboardingSystem Architecturebusiness knowledge
0 likes · 15 min read
How to Quickly Land as a Data Engineer in a New Company
dbaplus Community
dbaplus Community
Aug 21, 2023 · Databases

MySQL vs Elasticsearch: Choosing the Right Database for Your Needs

This article compares MySQL and Elasticsearch across data models, query languages, indexing, distributed architecture, performance, scalability, and typical use cases, highlighting their distinct strengths and trade‑offs to help developers decide which system—or combination—best fits specific application requirements.

data modelingdatabase comparisonmysql
0 likes · 12 min read
MySQL vs Elasticsearch: Choosing the Right Database for Your Needs