Tagged articles
193 articles
Page 1 of 2
Geek Labs
Geek Labs
Apr 3, 2026 · Product Management

Essential AI Skills Every Product Manager Needs: From Ideation to Data Tracking

This guide lists six open‑source AI skills that help product managers turn vague ideas into concrete designs, write structured PRDs, break requirements into executable plans, set up rigorous A/B experiments, implement analytics tracking, and optimize new‑user onboarding, each with install commands and usage examples.

A/B testingAIAnalytics
0 likes · 9 min read
Essential AI Skills Every Product Manager Needs: From Ideation to Data Tracking
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 14, 2026 · Artificial Intelligence

How DataAgent Turns AI into a Virtual Data Analyst for Enterprise Insights

DataAgent, built on Spring AI Alibaba, tackles the "last mile" of AI data analysis by combining deterministic workflow orchestration with large‑model reasoning, offering human‑in‑the‑loop feedback, dynamic prompt configuration, hybrid retrieval, containerized Python execution, streaming SSE, multi‑model scheduling, multi‑source connectivity, and secure API‑key management to deliver instant, insight‑rich reports for business users.

AIAnalyticsAutomation
0 likes · 11 min read
How DataAgent Turns AI into a Virtual Data Analyst for Enterprise Insights
Big Data Tech Team
Big Data Tech Team
Jan 12, 2026 · Fundamentals

Why Wide Tables Are Essential in DWS Layer: 10 Real-World Modeling Scenarios

This article explains the purpose of the DWS (Data Warehouse Service) layer, why wide‑table modeling is crucial for performance and service‑oriented interfaces, and provides ten practical wide‑table designs with core field definitions, CREATE TABLE statements, and sample INSERT queries for common business domains such as products, users, orders, regions, channels, suppliers, services, finance, logistics, and data quality monitoring.

AnalyticsETLSQL
0 likes · 34 min read
Why Wide Tables Are Essential in DWS Layer: 10 Real-World Modeling Scenarios
DataFunSummit
DataFunSummit
Nov 27, 2025 · Big Data

How BMW Turned Data Into Growth: A Sensors Data Case Study

This article details BMW's digital transformation journey using Sensors Data, covering the background of rapid app growth, the cross‑regional data collection challenges, the systematic solution architecture—including mapping, preprocessing, and historical data migration—and the resulting business impact and future AI‑driven roadmap.

AnalyticsBig DataDigital Transformation
0 likes · 13 min read
How BMW Turned Data Into Growth: A Sensors Data Case Study
High Availability Architecture
High Availability Architecture
Nov 14, 2025 · Artificial Intelligence

Quantifying AI Programming Efficiency: A Traceable and Measurable System

This article outlines the challenges of tracking AI‑generated code and measuring AI contribution, reviews earlier ad‑hoc methods, and presents a comprehensive solution featuring a VSCode plugin for unified AI dialogue management and a cloud service that quantifies AI impact across projects, teams, and individual developers.

AIAnalyticsMetrics
0 likes · 9 min read
Quantifying AI Programming Efficiency: A Traceable and Measurable System
DataFunSummit
DataFunSummit
Sep 22, 2025 · Artificial Intelligence

Explore Cutting-Edge AI-Driven Data Governance: A Comprehensive Resource Guide

This article presents a curated list of cutting‑edge topics covering AI‑powered data governance, large‑model applications, intelligent operations, and advanced analytics, offering readers a concise overview of emerging practices and case studies from industry leaders.

AIAnalyticsIntelligent Operations
0 likes · 2 min read
Explore Cutting-Edge AI-Driven Data Governance: A Comprehensive Resource Guide
DataFunTalk
DataFunTalk
Sep 22, 2025 · Big Data

How Kuaishou Scales Intelligent BI: Insights from Its Data Platform

This article outlines Kuaishou's Data Platform team's mission to boost data‑driven decision making through advanced compute engines, high‑performance services, and AI‑enhanced BI, detailing its architecture, challenges, solutions, and future outlook for large‑scale intelligent analytics.

AIAnalyticsBI
0 likes · 6 min read
How Kuaishou Scales Intelligent BI: Insights from Its Data Platform
DataFunTalk
DataFunTalk
Sep 19, 2025 · Big Data

How Kuaishou’s Data Platform Powers Intelligent BI with AI and Big Data

This article outlines how Kuaishou’s Data Platform Department enhances decision‑making efficiency by building advanced compute engines and high‑performance services, detailing the platform’s architecture, challenges of intelligent BI, AI‑driven solutions, and the end‑to‑end BI workflow from data ingestion to analysis.

AnalyticsBIBig Data
0 likes · 5 min read
How Kuaishou’s Data Platform Powers Intelligent BI with AI and Big Data
DataFunTalk
DataFunTalk
Sep 15, 2025 · Artificial Intelligence

Unlocking the Future: AI-Driven Data Governance and Large Model Innovations

This article presents a curated catalog of cutting‑edge topics covering AI‑powered data governance, large‑model applications, data cleaning, compliance, lakehouse integration, intelligent operations, and generative analytics, inviting readers to explore the latest innovations and download the full e‑book via QR code.

AIAnalyticsData Governance
0 likes · 2 min read
Unlocking the Future: AI-Driven Data Governance and Large Model Innovations
DataFunSummit
DataFunSummit
Sep 6, 2025 · Artificial Intelligence

Explore Cutting-Edge AI‑Driven Data Governance: Full Topic Catalog

This article presents a comprehensive catalog of cutting‑edge AI and large‑model topics, covering financial data governance, proactive metadata systems, data cleaning compliance, lake‑warehouse integration, intelligent operations, generative analytics, and QR‑code access to the full e‑book.

AIAnalyticsData Governance
0 likes · 2 min read
Explore Cutting-Edge AI‑Driven Data Governance: Full Topic Catalog
DataFunTalk
DataFunTalk
Aug 25, 2025 · Artificial Intelligence

Unlock AI-Powered Data Governance: Insights from Leading Industry Cases

This article presents a curated collection of cutting‑edge case studies and research on AI‑driven data governance, large‑model data cleaning, intelligent operations, and innovative analytics from industry leaders such as JD, Alibaba Cloud, Ping An, Didi, and Kuaishou.

AIAnalyticsCase Studies
0 likes · 2 min read
Unlock AI-Powered Data Governance: Insights from Leading Industry Cases
DataFunSummit
DataFunSummit
Aug 23, 2025 · Artificial Intelligence

Explore Cutting-Edge AI-Driven Data Governance and Analytics Resources

This article presents a curated list of cutting‑edge resources covering AI‑powered data governance, large‑model driven data cleaning, intelligent operations, and generative analytics techniques from industry leaders such as JD.com, Alibaba Cloud, Ping An Life, Didi, and Kuaishou.

AIAnalyticscloud computing
0 likes · 2 min read
Explore Cutting-Edge AI-Driven Data Governance and Analytics Resources
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2025 · Databases

Boosting PostgreSQL Analytics with DuckDB: Architecture, Optimizations, and Performance Gains

This article explains how integrating DuckDB as an extension for RDS PostgreSQL creates a unified HTAP solution that dramatically accelerates complex analytical queries through columnar storage, vectorized execution, and advanced optimizer techniques, delivering up to hundreds‑fold performance improvements and superior compression.

AnalyticsColumnarDatabase Optimization
0 likes · 11 min read
Boosting PostgreSQL Analytics with DuckDB: Architecture, Optimizations, and Performance Gains
DataFunSummit
DataFunSummit
Dec 20, 2024 · Big Data

Douyin Group's Data Management: Strategies for Metric Construction, Management, Production, and Consumption

This article outlines Douyin Group's approach to handling massive EB‑scale data, describing the challenges of metric quality and efficiency, the Volcano Engine data platform architecture, three‑layer solutions for metric production, management and consumption, and future plans for automation and governance.

AnalyticsBig DataData Platform
0 likes · 19 min read
Douyin Group's Data Management: Strategies for Metric Construction, Management, Production, and Consumption
DevOps Engineer
DevOps Engineer
Nov 28, 2024 · Backend Development

Reviving GitStats: Modernizing an Old Git History Statistics Tool

The author recounts reviving the dormant GitStats project by migrating it to Python 3, adding CI/CD pipelines, publishing it on PyPI, providing Docker images and an online preview, while outlining future improvements and inviting community contributions.

AnalyticsGitGitStats
0 likes · 5 min read
Reviving GitStats: Modernizing an Old Git History Statistics Tool
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 7, 2024 · Big Data

Douyin Group's Data Management Strategies: Enhancing Metric Stability and Reusability

This article outlines Douyin Group's approach to handling petabyte‑scale data, addressing metric inconsistencies, and improving data product agility through a four‑layer Volcano Engine platform, systematic indicator production‑management‑consumption cycles, organizational design, automation, and future plans for large‑model‑driven metric splitting.

AnalyticsAutomationBig Data
0 likes · 20 min read
Douyin Group's Data Management Strategies: Enhancing Metric Stability and Reusability
DataFunSummit
DataFunSummit
Oct 26, 2024 · Big Data

Kuaishou Metric Middle Platform: Design, Architecture, and Practices

This article presents Kuaishou's metric middle platform, detailing its background, design principles, architecture, metric management, data modeling, unified analysis language OAX, federated query engine OCTO, acceleration strategies, and future directions, illustrating how it improves data quality, development efficiency, and analytical capabilities at scale.

AnalyticsBig DataData Platform
0 likes · 64 min read
Kuaishou Metric Middle Platform: Design, Architecture, and Practices
Software Development Quality
Software Development Quality
Oct 7, 2024 · Fundamentals

Unlocking Data Thinking: How to Turn Numbers into Actionable Insights

This article explains the concept of data thinking, its core components of data sensitivity and methodological experience, outlines a step‑by‑step data analysis process, and shows why cultivating this mindset improves decision‑making, communication efficiency, and business opportunity discovery across various domains.

AnalyticsBusiness Intelligencedata analysis
0 likes · 16 min read
Unlocking Data Thinking: How to Turn Numbers into Actionable Insights
21CTO
21CTO
Sep 17, 2024 · Big Data

Why AWS Donated OpenSearch to the Linux Foundation and Its Impact on Search

Amazon Web Services transferred its OpenSearch project—a fork of Elasticsearch and Kibana—to the newly formed OpenSearch Software Foundation under the Linux Foundation, gaining vendor‑neutral governance and support from members like AWS, Uber, Canonical, and Aiven, to foster broader community development of search, analytics, and vector database applications.

AnalyticsBig DataLinux Foundation
0 likes · 4 min read
Why AWS Donated OpenSearch to the Linux Foundation and Its Impact on Search
StarRocks
StarRocks
Aug 9, 2024 · Big Data

How Pinterest Cut Query Latency by 50% with StarRocks Migration

Pinterest migrated its Partner Insights analytics from Druid to StarRocks, achieving a 50% reduction in p90 latency, a six‑fold cost‑performance improvement, and simplified data ingestion, illustrating the benefits of a modern MPP database for real‑time ad analytics.

AnalyticsMPPPinterest
0 likes · 6 min read
How Pinterest Cut Query Latency by 50% with StarRocks Migration
DaTaobao Tech
DaTaobao Tech
Jul 17, 2024 · Frontend Development

Page Data Tracking and SPM for Front-End Development

Front‑end developers must embed unique page identifiers in URLs, capture incoming source parameters, and propagate them through outbound links so that traffic, source, interaction and conversion metrics can be automatically recorded by the SDK, enabling data‑driven product optimization and business insight.

AnalyticsData TrackingWeb
0 likes · 17 min read
Page Data Tracking and SPM for Front-End Development
DataFunTalk
DataFunTalk
Jul 7, 2024 · Product Management

User Growth Strategies: From Information Management to a Data‑Driven Flywheel

This article shares a data‑centric perspective on user growth, covering the evolution of information management, distribution and production, the concept of entropy reduction in products, the data‑driven flywheel model, practical AB‑testing case studies, and a Q&A on analytics tools and team collaboration.

AnalyticsData-drivenproduct-management
0 likes · 16 min read
User Growth Strategies: From Information Management to a Data‑Driven Flywheel
DataFunTalk
DataFunTalk
Jun 27, 2024 · Big Data

Data Warehouse Construction and Data Governance Practices at Wing Payment

This presentation by senior data warehouse engineer Huang Luo details Wing Payment’s end‑to‑end data warehouse build, covering background challenges, governance framework, platform architecture, layered modeling, naming standards, asset management, monitoring, and future plans, illustrating how systematic data governance drives cost reduction, efficiency, and security.

AnalyticsBig DataData Governance
0 likes · 14 min read
Data Warehouse Construction and Data Governance Practices at Wing Payment
Baidu Tech Salon
Baidu Tech Salon
Jun 12, 2024 · Big Data

Event Tracking Governance: Concepts, Challenges, and Platform Solutions

Event‑tracking governance ensures accurate, consistent user‑behavior data by managing the full lifecycle of logging points through defined quality standards, a digitized workflow, and supporting tools such as rule editors, real‑time testing, and compliance monitoring, while the platform’s page‑scene tree model and metrics improve visibility, reduce duplication, and drive business insight.

AnalyticsData QualityTooling
0 likes · 13 min read
Event Tracking Governance: Concepts, Challenges, and Platform Solutions
Baidu Geek Talk
Baidu Geek Talk
May 8, 2024 · Artificial Intelligence

Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform

Sugar BI, evolving from the internal ShowX platform to versions 2.0‑4.0, now offers a zero‑code, drag‑and‑drop visual editor, support for over 30 data sources, AI‑powered automatic analysis and the Sugar Bot Q&A module that transforms multi‑day data tasks into minutes, delivering containerized SaaS BI with intelligent chart recommendation and rapid, code‑free decision‑making for enterprises.

AIAnalyticsBI
0 likes · 19 min read
Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform
DataFunTalk
DataFunTalk
May 4, 2024 · Big Data

JD Retail Data Visualization Platform: Product Practice and Insights

This article presents an in‑depth overview of JD.com’s retail data visualization platform, detailing its product matrix—including EasyBI, a low‑code platform, and JDV large‑screen tool—its architectural layers, key capabilities, business case studies, challenges faced, and future development directions.

AnalyticsBig DataData visualization
0 likes · 14 min read
JD Retail Data Visualization Platform: Product Practice and Insights
DataFunSummit
DataFunSummit
May 2, 2024 · Big Data

Building an Attribution System for NetEase Cloud Music Data Warehouse: Challenges and Solutions

This article presents the problems faced by NetEase Cloud Music's data warehouse attribution system and details a comprehensive solution that includes upgrading the event‑tracking framework, redesigning the attribution model, and launching a unified management platform to improve stability, accuracy, and scalability.

AnalyticsBig DataData Warehouse
0 likes · 13 min read
Building an Attribution System for NetEase Cloud Music Data Warehouse: Challenges and Solutions
DataFunSummit
DataFunSummit
Feb 26, 2024 · Big Data

Building a New Lakehouse Analytics Paradigm with StarRocks and Paimon

This article introduces a new lakehouse analytics paradigm by combining StarRocks and Paimon, covering the evolution of data lake technologies, key integration scenarios, core technical mechanisms such as JNI connectors, materialized views, and future roadmap for enhanced lakehouse capabilities.

AnalyticsBig DataData Lake
0 likes · 16 min read
Building a New Lakehouse Analytics Paradigm with StarRocks and Paimon
ByteFE
ByteFE
Jan 26, 2024 · Frontend Development

A Comprehensive Guide to Frontend Event Tracking (埋点)

This article explains what frontend event tracking (埋点) is, why it is essential for product analytics, when and how to implement it, the different tracking models and reporting methods, as well as practical tips, iteration processes, and common pitfalls for developers and product teams.

AnalyticsWebdata collection
0 likes · 18 min read
A Comprehensive Guide to Frontend Event Tracking (埋点)
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 25, 2024 · Frontend Development

Front-End Event Tracking (埋点) – Fundamentals, Types, and Best Practices

This article provides a comprehensive guide to front‑end event tracking, covering its definition, motivations, scenarios, various tracking types, data models, reporting mechanisms, implementation steps, security considerations, and practical tips for ensuring accurate and non‑blocking data collection in web applications.

Analyticsdata collectionevent tracking
0 likes · 23 min read
Front-End Event Tracking (埋点) – Fundamentals, Types, and Best Practices
政采云技术
政采云技术
Jan 24, 2024 · Big Data

A Business Analyst’s End‑to‑End Journey Using a Data Middle Platform: From Issue Identification to Data‑Driven Solutions

The article walks through a detailed, real‑world scenario where a business analyst leverages a data middle platform—covering metric analysis, exploratory queries, data development, visualization, and productization—to diagnose a 30% sales decline and implement data‑driven remediation, illustrating core concepts such as OneData and OneService.

AnalyticsData DevelopmentData Governance
0 likes · 12 min read
A Business Analyst’s End‑to‑End Journey Using a Data Middle Platform: From Issue Identification to Data‑Driven Solutions
Architects Research Society
Architects Research Society
Jan 15, 2024 · Fundamentals

Evolution of Software Architecture Styles and Domains

This article outlines the evolution of software architecture styles, describing various architectural domains and sub‑domains—from web and mobile applications to integration, data, and analytics architectures—and their typical implementations, illustrated with a detailed classification table.

AnalyticsData ArchitectureDomain Architecture
0 likes · 8 min read
Evolution of Software Architecture Styles and Domains
Architects Research Society
Architects Research Society
Jan 2, 2024 · Big Data

Understanding Data Lakes: Concepts, Benefits, Challenges, and Comparison with Data Warehouses

This article explains what a data lake is, its origins, key characteristics such as collecting all data, enabling diverse user access, and flexible processing, compares it with traditional data warehouses, discusses cost advantages, potential pitfalls like data swamps, and outlines best‑practice considerations for enterprise adoption.

AnalyticsData ArchitectureData Lake
0 likes · 10 min read
Understanding Data Lakes: Concepts, Benefits, Challenges, and Comparison with Data Warehouses
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 5, 2023 · Big Data

NetEase EasyData Metric Middle Platform: Architecture, Core Technologies, and Future Plans

This article details NetEase EasyData's evolution and product matrix, explains why a metric middle platform is needed, describes its core technical architecture—including a unified logical semantic model, a custom metric query language, and engine decoupling—and outlines future development directions.

AnalyticsBig DataData Governance
0 likes · 12 min read
NetEase EasyData Metric Middle Platform: Architecture, Core Technologies, and Future Plans
Architects Research Society
Architects Research Society
Nov 26, 2023 · Big Data

Data Lake vs Data Warehouse: Key Differences and How to Choose

Data lakes and data warehouses serve different purposes in big‑data architectures; this article explains their definitions, core attributes, five major distinctions—including data retention, type support, user coverage, adaptability, and insight speed—and offers guidance on selecting or combining the two approaches.

AnalyticsData ArchitectureData Lake
0 likes · 12 min read
Data Lake vs Data Warehouse: Key Differences and How to Choose
Weimob Technology Center
Weimob Technology Center
Oct 13, 2023 · Big Data

Optimizing StarRocks Tables: Design Tips, Real‑World Cases and Monitoring Strategies

This article explains how to design efficient StarRocks tables with proper field types, partitioning and bucketing, compares update and primary‑key models, presents real‑world cases of memory and tablet issues, provides a complete table‑creation example, and outlines comprehensive monitoring metrics to keep the analytical data warehouse performant and stable.

AnalyticsPartitioningStarRocks
0 likes · 25 min read
Optimizing StarRocks Tables: Design Tips, Real‑World Cases and Monitoring Strategies
Sohu Tech Products
Sohu Tech Products
Oct 11, 2023 · Industry Insights

How StarRocks Materialized Views Power Real‑Time Lakehouse Analytics

The article provides a deep technical overview of StarRocks 3.0’s data‑lake analysis capabilities, its unified Lakehouse architecture, Catalog integration, Trino compatibility, extensive I/O optimizations, materialized view features, resource isolation techniques, real‑world use cases, and future development directions.

AnalyticsData LakeLakehouse
0 likes · 22 min read
How StarRocks Materialized Views Power Real‑Time Lakehouse Analytics
37 Interactive Technology Team
37 Interactive Technology Team
Sep 18, 2023 · Product Management

How to Build, Productize, and Iterate an Analytical Data Product

The guide explains how to create an analytical data product by first defining the business scenario and KPI, selecting and abstracting an analysis framework into reusable modules, visualizing core metrics across dimensions, and continuously iterating through cold‑start, promotion, and maintenance phases to keep the product aligned with evolving business needs.

AnalyticsBusiness IntelligenceData Product
0 likes · 18 min read
How to Build, Productize, and Iterate an Analytical Data Product
21CTO
21CTO
Jul 5, 2023 · Databases

Why MariaDB Is More Than Just a MySQL Fork: Exploring Its Powerful Tools

This article explains how MariaDB evolved from a MySQL branch into a comprehensive database ecosystem, covering the Community and Enterprise servers, diverse storage engines, MaxScale proxy, ColumnStore analytics, Xpand distributed scaling, and the SkySQL fully managed cloud service.

AnalyticsMariaDBStorage Engine
0 likes · 8 min read
Why MariaDB Is More Than Just a MySQL Fork: Exploring Its Powerful Tools
Architects Research Society
Architects Research Society
Jul 1, 2023 · Artificial Intelligence

What Is Data Science? Definitions, Work Processes, and Roles – Reflections on a Decade of Data Science and Future Visualization Tools

This article reviews a decade of data‑science growth, defines data science as a multidisciplinary field, outlines its four high‑level and fourteen low‑level work processes, categorises nine distinct data‑science roles, and discusses how these insights should shape the next generation of data‑visualisation and analysis tools.

AIAnalyticsData Science
0 likes · 12 min read
What Is Data Science? Definitions, Work Processes, and Roles – Reflections on a Decade of Data Science and Future Visualization Tools
DataFunSummit
DataFunSummit
Jun 4, 2023 · Databases

From Apache Doris to SelectDB: Evolution Towards the Next‑Generation Cloud‑Native Data Warehouse

This presentation introduces Apache Doris, examines changing data analysis demands in the cloud era, explains why SelectDB was created, and details SelectDB’s cloud‑native architecture, performance, unified capabilities, ease of use, cost efficiency, open‑source nature, and its application scenarios for modern data warehousing and log analytics.

AnalyticsApache DorisCloud-native
0 likes · 15 min read
From Apache Doris to SelectDB: Evolution Towards the Next‑Generation Cloud‑Native Data Warehouse
360 Tech Engineering
360 Tech Engineering
May 23, 2023 · Operations

Data‑Driven Growth: Underlying Logic, Case Studies, and Essential Factors

The article explains how data‑driven thinking replaces traditional money‑burning growth tactics by establishing logical loops, experimental validation, and concrete case studies in acquisition, activation, and targeting, while outlining the essential collaborative factors needed for successful data‑powered operations.

AnalyticsData-drivenGrowth
0 likes · 10 min read
Data‑Driven Growth: Underlying Logic, Case Studies, and Essential Factors
Big Data Technology Architecture
Big Data Technology Architecture
Apr 19, 2023 · Big Data

Why the Big Data Era Is Over

The article argues that the era of big data is ending, showing that most organizations store only modest amounts of data, that storage costs outweigh benefits, and that modern cloud and analytics tools allow efficient processing without needing massive datasets.

AnalyticsBig DataData Management
0 likes · 16 min read
Why the Big Data Era Is Over
Architects Research Society
Architects Research Society
Apr 12, 2023 · Databases

Introduction to Time Series Data and Best Practices with MongoDB

This article introduces time series data concepts, outlines the challenges of storing and analyzing high‑frequency data, and presents best‑practice guidelines for building MongoDB‑based time‑series applications, covering ingestion, read/write workloads, retention, security, and real‑world use cases.

AnalyticsDatabase designMongoDB
0 likes · 12 min read
Introduction to Time Series Data and Best Practices with MongoDB
DataFunSummit
DataFunSummit
Mar 24, 2023 · Big Data

Kuaishou Metric Middle Platform: Development Journey, Architecture, and Practices

This article summarizes the Kuaishou Data Platform’s metric middle‑platform sharing from the 2022 DataFun forum, detailing its three‑year evolution, key concepts, architectural design, implementation challenges, and practical lessons for building an enterprise‑grade metric platform that unifies data definition, production, and consumption across the company.

AnalyticsKuaishoumetric governance
0 likes · 20 min read
Kuaishou Metric Middle Platform: Development Journey, Architecture, and Practices
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Mar 14, 2023 · Frontend Development

How Dawn’s VTree Revolutionizes Cross‑Platform Event Tracking

NetEase Cloud Music’s open‑source Dawn framework introduces a high‑performance virtual tree (VTree) to deliver a unified, low‑cost, and highly accurate cross‑platform event‑tracking solution, addressing common pain points such as development overhead, precision, model stability, link tracing, and quality control.

AnalyticsVTreecross-platform
0 likes · 11 min read
How Dawn’s VTree Revolutionizes Cross‑Platform Event Tracking
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 13, 2023 · Big Data

Unlocking Big Data with Alibaba Cloud’s Native Data Lake Solution

Alibaba Cloud’s cloud‑native data lake analysis solution combines fully managed storage (OSS‑HDFS), a one‑stop lake management platform (Data Lake Formation), and multimodal compute capabilities, delivering high performance, massive scalability, and low cost for big‑data and AI workloads across offline, real‑time, and lake‑house scenarios.

AnalyticsBig DataCloud Native
0 likes · 11 min read
Unlocking Big Data with Alibaba Cloud’s Native Data Lake Solution
Data Thinking Notes
Data Thinking Notes
Mar 8, 2023 · Fundamentals

How BI Portals Transform Enterprise Data Governance for Scalable Analytics

This whitepaper explains why effective BI governance is essential for modern enterprises, outlines the key capabilities of data‑governance tools—including data quality, certification, usage statistics, classification, lineage, glossary, and lifecycle management—and shows how BI portals and data catalogs together enable scalable, user‑centric analytics.

AnalyticsBI governanceBI portal
0 likes · 12 min read
How BI Portals Transform Enterprise Data Governance for Scalable Analytics
Kuaishou Big Data
Kuaishou Big Data
Feb 14, 2023 · Big Data

How OAX Revolutionizes Open Analysis in Kuaishou’s Data Platform

This article introduces OAX (Open Analysis eXpressions), Kuaishou’s unified open‑analysis language, detailing its design background, guiding principles, five‑layer language model, syntax—including data types, compute capabilities and five analysis elements—its access protocol, runtime architecture, optimization steps, and the benefits it brings to the company’s big‑data analytics ecosystem.

AnalyticsData PlatformOAX
0 likes · 19 min read
How OAX Revolutionizes Open Analysis in Kuaishou’s Data Platform
DataFunSummit
DataFunSummit
Jan 30, 2023 · Big Data

Evolution of the Modern Data Stack: From Traditional Data Warehouses to BI+AI Self‑Service Analytics

This article traces the origins and evolution of the modern data stack, explains the shortcomings of traditional data warehouses, describes how cloud‑native ELT, modular components and analytics‑as‑software enable self‑service BI and AI, and outlines emerging trends such as DataOps, enhanced analytics and decision intelligence.

AIAnalyticsBI
0 likes · 15 min read
Evolution of the Modern Data Stack: From Traditional Data Warehouses to BI+AI Self‑Service Analytics
Data Thinking Notes
Data Thinking Notes
Jan 5, 2023 · Big Data

Why Data Lakes Are Outshining Traditional Data Warehouses: A Deep Dive

This comprehensive guide explains the evolution from traditional data warehouses to modern data lakes, detailing concepts, architectures, differences, implementation steps, and real‑world case studies, while also comparing major cloud providers' solutions and highlighting how data platforms support digital transformation and analytics.

AnalyticsBig DataData Lake
0 likes · 97 min read
Why Data Lakes Are Outshining Traditional Data Warehouses: A Deep Dive
ITPUB
ITPUB
Jan 2, 2023 · Databases

Choosing the Right OLAP Engine: Druid vs ClickHouse and Optimization Tips

This article introduces OLAP concepts, compares major OLAP engines such as Druid, Kylin, Doris, and ClickHouse, outlines real‑world application scenarios, and provides detailed optimization techniques—including materialized views, caching, tiered storage, and skip‑index configurations—to improve query performance.

AnalyticsClickHouseData Warehouse
0 likes · 16 min read
Choosing the Right OLAP Engine: Druid vs ClickHouse and Optimization Tips
DataFunTalk
DataFunTalk
Dec 26, 2022 · Product Management

How Product and Business Teams Should Participate in Building Data Metric Systems

The article explains how product and business teams should collaborate with data teams to build and promote data metric systems, emphasizing mutual empowerment, joint methodology, pilot testing, and scaling, while also announcing DataFun's 5‑year anniversary activities and upcoming big‑data and AI publications.

AnalyticsData Governancebusiness collaboration
0 likes · 3 min read
How Product and Business Teams Should Participate in Building Data Metric Systems
Architecture Digest
Architecture Digest
Dec 1, 2022 · Big Data

Understanding Data Warehouse Architecture and Layered Design

This article explains the concepts, architecture, and layered design of data warehouses, covering data flow, ETL processes, ODS, DWD, DWM, DWS, ADS layers, their characteristics, differences from databases, and the role of data marts in supporting OLAP and decision‑making.

AnalyticsBig DataData Layers
0 likes · 13 min read
Understanding Data Warehouse Architecture and Layered Design
DataFunSummit
DataFunSummit
Nov 22, 2022 · Big Data

BI Platform Practice at Xiaomi: Evolution, Architecture, and Future Directions

This article details Xiaomi's multi‑year journey in building a group‑wide Business Intelligence platform, covering its historical evolution, technical challenges in performance, modeling, visualization and permissions, the current four‑layer architecture, and future plans to make the platform more business‑centric and simpler.

AnalyticsBIBig Data
0 likes · 15 min read
BI Platform Practice at Xiaomi: Evolution, Architecture, and Future Directions
Java Captain
Java Captain
Oct 8, 2022 · Databases

Redefining JOIN in Business Intelligence: From Wide Tables to DQL

This article analyzes the limitations of traditional BI multi‑dimensional analysis that relies on wide tables and complex SQL JOINs, introduces a new DQL language that redefines JOIN operations into three plus one patterns, and demonstrates how DQL simplifies data modeling, reduces errors, and enables truly self‑service analytics.

AnalyticsBIDQL
0 likes · 17 min read
Redefining JOIN in Business Intelligence: From Wide Tables to DQL
Tencent Cloud Developer
Tencent Cloud Developer
Sep 9, 2022 · Big Data

Data Lake, Data Warehouse, and Lakehouse: Concepts, Architectures, and Industry Practices

The article explains how data lakes excel at ingesting massive, varied data, data warehouses optimize storage and query performance, and lake‑house architectures combine both strengths—offering scalable, low‑cost storage with high‑speed analytics—highlighting industry solutions from Snowflake, Databricks, and major cloud providers.

AnalyticsBig DataData Lake
0 likes · 8 min read
Data Lake, Data Warehouse, and Lakehouse: Concepts, Architectures, and Industry Practices
High Availability Architecture
High Availability Architecture
Aug 15, 2022 · Big Data

Comprehensive Guide to Event Tracking Governance and the One‑Stop Tracking Management Platform

This article explains why event‑tracking (埋点) governance is essential, outlines the methodology and practice of full‑link tracking management, and introduces the one‑stop tracking platform with its innovative features such as standardized processes, verification tools, real‑time dashboards, cross‑platform data unification, and future roadmap.

AnalyticsBig DataData Governance
0 likes · 15 min read
Comprehensive Guide to Event Tracking Governance and the One‑Stop Tracking Management Platform
Architect's Guide
Architect's Guide
Aug 9, 2022 · Databases

Seven Key Aspects of Distributed Storage Systems: Replication, Storage Engine, Transactions, Analytics, Multi‑core, Compute, and Compilation

The article presents a comprehensive guide to distributed storage, organizing its design and implementation into seven essential dimensions—replication, storage engine, transaction processing, analytical query execution, multi‑core scaling, compute engine architecture, and compilation techniques—each explained with core concepts, challenges, and practical considerations.

AnalyticsDatabase ArchitectureTransactions
0 likes · 13 min read
Seven Key Aspects of Distributed Storage Systems: Replication, Storage Engine, Transactions, Analytics, Multi‑core, Compute, and Compilation
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 6, 2022 · Operations

Why Operations Data Quality Is the Key to Successful Digital Transformation

In the era of big data, poor operations data quality undermines analytics, decision‑making and digital transformation, so organizations must adopt a three‑dimensional governance approach—covering organization, processes and technology—to ensure completeness, consistency, accuracy, uniqueness, relevance and timeliness of their operational data.

AnalyticsData GovernanceData Quality
0 likes · 17 min read
Why Operations Data Quality Is the Key to Successful Digital Transformation
Programmer DD
Programmer DD
Jul 28, 2022 · Databases

Why MongoDB Is Adding Native Analytics and What It Means for Developers

MongoDB is evolving from a purely operational document store to a hybrid system that embeds native analytics, cloud‑native features, and SQL access, aiming to boost developer productivity, support real‑time insights, and complement rather than replace traditional data warehouses.

AnalyticsData LakeMongoDB
0 likes · 12 min read
Why MongoDB Is Adding Native Analytics and What It Means for Developers
dbaplus Community
dbaplus Community
Jul 22, 2022 · Databases

Why MongoDB Is Adding Native Analytics and What It Means for Developers

The article examines MongoDB’s evolution toward built‑in analytics, detailing new features like native search, time‑series support, change streams, Atlas analytics nodes, and the upcoming Atlas SQL interface, while arguing that these capabilities aim to empower developers rather than replace dedicated data‑warehouse solutions.

AnalyticsAtlasHTAP
0 likes · 10 min read
Why MongoDB Is Adding Native Analytics and What It Means for Developers

Comprehensive Overview of Tracking System, Data Warehouse Construction, and Attribution in an E‑commerce Platform

The article presents a comprehensive end‑to‑end traffic data architecture for an e‑commerce platform, detailing hybrid frontend/backend tracking with SPM/SCM/action standards, data‑warehouse construction of fact and dimension tables, UUID i_code unification, real‑time attribution methods, and future automation of warehouse and model layers.

AnalyticsData TrackingData Warehouse
0 likes · 13 min read
Comprehensive Overview of Tracking System, Data Warehouse Construction, and Attribution in an E‑commerce Platform
DataFunTalk
DataFunTalk
Jul 10, 2022 · Big Data

Serverless Technologies Empowering Big Data Analytics: An Overview of Amazon EMR Serverless

This article presents a comprehensive overview of how Amazon EMR Serverless leverages serverless technology to simplify, scale, and cost‑optimize big data analytics, covering the evolution of serverless services, the intelligent lakehouse architecture, core concepts, key benefits, common use cases, and available documentation.

Amazon EMRAnalyticsBig Data
0 likes · 17 min read
Serverless Technologies Empowering Big Data Analytics: An Overview of Amazon EMR Serverless
LOFTER Tech Team
LOFTER Tech Team
Jun 2, 2022 · Fundamentals

Understanding Event Tracking (埋点): Purpose, Types, and Implementation Methods

This article explains the concept of event tracking (埋点) in applications, detailing its purposes such as user conversion analysis and security monitoring, classifying manual, visual, and full tracking methods, and discussing their advantages, disadvantages, and how to choose the appropriate approach.

Analyticsevent trackingfull tracking
0 likes · 6 min read
Understanding Event Tracking (埋点): Purpose, Types, and Implementation Methods
DataFunSummit
DataFunSummit
May 29, 2022 · Big Data

OPPO Commercial Data System Construction Practice: Platform, Ingestion, Development, Governance, and Analytics

This article presents OPPO's commercial data system construction practice, covering the data platform strategy, ingestion pipelines, development efficiency toolkits, data validation, visualization aids, UDF principles, warehouse architecture, metric systems, dimensional modeling, ETL optimization, governance metadata, quality management, monitoring, attribution services, analytics reporting, and a Q&A session.

AnalyticsData Platformdata engineering
0 likes · 17 min read
OPPO Commercial Data System Construction Practice: Platform, Ingestion, Development, Governance, and Analytics
21CTO
21CTO
May 11, 2022 · Artificial Intelligence

How SAS Now Fully Supports Python for Data Science and AI

SAS announced native Python support in its analytics platform, offering code examples, cloud‑native AI capabilities, and seamless integration with SAS Studio and other SAS products to empower data scientists to develop and deploy Python models alongside SAS code.

AIAnalyticsCloud Native
0 likes · 5 min read
How SAS Now Fully Supports Python for Data Science and AI
StarRocks
StarRocks
May 7, 2022 · Databases

How 360 Built a Lightning‑Fast Unified Analytics Platform with StarRocks

Facing massive data storage and query challenges, 360 upgraded its analytics architecture by adopting StarRocks, achieving multi‑dimensional, high‑concurrency analysis, simplified data pipelines, and significant performance and cost improvements across its radar and user‑portrait platforms.

AnalyticsBig DataOLAP
0 likes · 10 min read
How 360 Built a Lightning‑Fast Unified Analytics Platform with StarRocks
Top Architect
Top Architect
Mar 28, 2022 · Databases

Key Aspects of Distributed Storage Systems: Replication, Engines, Transactions, Analytics, Multi‑Core, Computation, and Compilation

This article provides a comprehensive overview of distributed storage, covering seven core aspects such as replication, storage engines, transaction processing, analytical query execution, multi‑core scalability, computation models, and compilation techniques, while also highlighting practical challenges and design considerations for modern database systems.

AnalyticsCompilationStorage Engine
0 likes · 13 min read
Key Aspects of Distributed Storage Systems: Replication, Engines, Transactions, Analytics, Multi‑Core, Computation, and Compilation
ByteDance ADFE Team
ByteDance ADFE Team
Feb 21, 2022 · Frontend Development

Event Collector: A Markup‑Based Web Event Tracking Solution

The article introduces Event Collector, a markup‑based web event tracking framework that replaces imperative tracking with data attributes or React components, explains its technical implementation, management tools, monorepo organization, release workflow, and React coding conventions to improve development efficiency and data analysis.

AnalyticsReactWeb Development
0 likes · 7 min read
Event Collector: A Markup‑Based Web Event Tracking Solution