Tagged articles
89 articles
Page 1 of 1
AI Illustrated Series
AI Illustrated Series
Apr 25, 2026 · Artificial Intelligence

From "Can Talk" to "Can Act": Deep Dive into Function Calling for AI Agents

The article explains how Function Calling enables large language model agents to overcome knowledge staleness and hallucination by invoking external tools—such as search, email, code execution, and databases—to fetch real‑time data, perform actions, and deliver verifiable, multi‑step responses.

AI agentsFunction CallingLLM
0 likes · 25 min read
From "Can Talk" to "Can Act": Deep Dive into Function Calling for AI Agents
Big Data Tech Team
Big Data Tech Team
Apr 20, 2026 · Artificial Intelligence

How AI is Redefining Data Workflows: 4 Game‑Changing Paradigms Explained

The article outlines four AI‑driven breakthroughs reshaping data work—AI‑for‑Data automation, generative‑AI‑enhanced governance, NoETL real‑time lake ingestion, and next‑generation SQL analysis—detailing their problems, concrete case studies, implementation steps, pitfalls, and measurable efficiency gains.

AI for DataData GovernanceNoETL
0 likes · 12 min read
How AI is Redefining Data Workflows: 4 Game‑Changing Paradigms Explained
Geek Labs
Geek Labs
Apr 7, 2026 · Artificial Intelligence

10 AI Stock‑Trading Skills Perfectly Suited for Chinese Investors

Amid the AI boom in the stock market, many retail investors turn to AI tools, but most are paid plugins or designed for U.S. markets; this article curates ten AI‑driven skills that cover real‑time data, technical analysis, quantitative backtesting, monitoring, and risk alerts specifically for Chinese investors.

AIChinese investorsquantitative analysis
0 likes · 11 min read
10 AI Stock‑Trading Skills Perfectly Suited for Chinese Investors
DataFunSummit
DataFunSummit
Mar 31, 2026 · Industry Insights

How SelectDB Overcomes the ‘Impossible Triangle’ in Real‑Time Automotive Data

The whitepaper explains how the explosive growth, multimodal nature, and real‑time collaboration demands of intelligent connected‑vehicle data create two “impossible triangles,” and how SelectDB’s three technical innovations—Index+Bitmap primary keys, Variant sparse columns, and hybrid full‑text/vector search—enable cost‑effective, high‑performance real‑time analytics across five automotive scenarios with proven case studies from leading OEMs.

Database InnovationSelectDBautomotive
0 likes · 17 min read
How SelectDB Overcomes the ‘Impossible Triangle’ in Real‑Time Automotive Data
StarRocks
StarRocks
Jan 7, 2026 · Big Data

How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data

This article details Gaode Maps' end‑to‑end lakehouse solution for handling high‑frequency, high‑volume trajectory data, covering the challenges of real‑time visibility, multi‑scenario queries, storage cost, and data silos, and describing the layered storage architecture, performance validation, and future expansion plans.

Apache FlinkData TieringLakehouse
0 likes · 21 min read
How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 24, 2025 · Big Data

How Leapmotor Scaled to 1M Cars with a Real‑Time Flink Data Platform

Leapmotor’s rapid growth to one million production cars drove a shift from daily batch data to minute‑level real‑time analytics, prompting the adoption of Flink as the core engine of a multi‑layered big‑data platform that handles massive IoT signals, supports fault diagnosis, and integrates batch and streaming workloads on the cloud.

Big DataData PlatformFlink
0 likes · 13 min read
How Leapmotor Scaled to 1M Cars with a Real‑Time Flink Data Platform
Code Wrench
Code Wrench
Oct 5, 2025 · Backend Development

Build a Real‑Time Go Stock Analyzer with WebSocket & Vue Frontend

This article details version 3.0 of a Go‑based stock analysis system that adds sector classification, persistent watchlists, scheduled strategy evaluation, real‑time WebSocket market feeds, K‑line charting with technical indicators, a Vue front‑end, deployment steps, future roadmap, and a public GitHub repository.

GoVueWebSocket
0 likes · 7 min read
Build a Real‑Time Go Stock Analyzer with WebSocket & Vue Frontend
High Availability Architecture
High Availability Architecture
Sep 10, 2025 · Big Data

How Ctrip Business Travel Built a Near‑Real‑Time Lakehouse with Flink CDC & Paimon

This article details Ctrip Business Travel’s implementation of a near‑real‑time data warehouse using Flink CDC and the Paimon lakehouse engine, covering order wide‑table construction, ticket refund alerts, ad attribution, batch‑stream integration, and practical lessons on Partial Update, Aggregation, and Tag‑based incremental processing.

?=Batch-Stream IntegrationFlink
0 likes · 17 min read
How Ctrip Business Travel Built a Near‑Real‑Time Lakehouse with Flink CDC & Paimon
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 29, 2025 · Big Data

How MaxCompute Streaming Insert Revolutionized Real‑Time Data Migration from BigQuery

This article details how a leading Southeast Asian tech group migrated its real‑time write workloads from Google BigQuery to MaxCompute using MaxCompute Streaming Insert, covering architecture, core features, migration challenges, optimization strategies, business impact, and future enhancements.

Big DataBigQuery MigrationMaxCompute
0 likes · 9 min read
How MaxCompute Streaming Insert Revolutionized Real‑Time Data Migration from BigQuery
JD Retail Technology
JD Retail Technology
Jun 10, 2025 · Artificial Intelligence

How JD Builds a Scalable AI‑Powered Recommendation Data System with Flink

This article explains JD's complex recommendation system data pipeline—from indexing, sampling, and feature engineering to explainability and real‑time metrics—highlighting challenges such as data consistency, latency, and the use of Flink for massive, low‑latency processing.

Flinkexplainabilityfeature engineering
0 likes · 23 min read
How JD Builds a Scalable AI‑Powered Recommendation Data System with Flink
DataFunSummit
DataFunSummit
Apr 8, 2025 · Big Data

Huolala’s Real‑Time Data Synchronization with Flink CDC: Architecture, Practices, and Future Outlook

This article presents Huolala’s end‑to‑end implementation of Flink CDC for real‑time data capture, detailing the business background, reasons for selecting Flink CDC over Canal, component comparisons, production‑level platform enhancements, data‑lake integration, validation methods, and future directions for unified data ingestion.

Flink CDCdata synchronizationreal-time data
0 likes · 13 min read
Huolala’s Real‑Time Data Synchronization with Flink CDC: Architecture, Practices, and Future Outlook
Airbnb Technology Team
Airbnb Technology Team
Jan 24, 2025 · Artificial Intelligence

Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning

Chronon is an open‑source framework that centralizes feature definitions to guarantee training‑inference consistency, eliminates complex ETL pipelines, and supports real‑time and batch processing across diverse data sources, cutting feature‑development cycles from months to under a week, as demonstrated by Airbnb’s 40,000‑feature deployment.

ChrononSparkfeature engineering
0 likes · 10 min read
Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 30, 2024 · Frontend Development

Common Data Request Methods for Large Screens and Their Implementation with SSE and WebSocket

This article compares HTTP polling, WebSocket, and Server‑Sent Events (SSE) for large‑screen data fetching, explains their advantages and drawbacks, outlines suitable business scenarios, and provides complete front‑end and back‑end code examples for implementing SSE and WebSocket connections.

HTTP pollingNode.jsSSE
0 likes · 7 min read
Common Data Request Methods for Large Screens and Their Implementation with SSE and WebSocket
Huolala Tech
Huolala Tech
Nov 7, 2024 · Big Data

How HuoLaLa Scaled Real‑Time Data Capture with Flink CDC: Architecture, Challenges, and Results

This article details HuoLaLa's logistics platform challenges with petabyte‑scale data, the selection of Apache Flink CDC for stable, compatible, and low‑latency data ingestion, the construction of a multi‑layer CDC capability, migration strategies, measurable performance gains, and future open‑source contributions.

Apache FlinkFlink CDCdata ingestion
0 likes · 15 min read
How HuoLaLa Scaled Real‑Time Data Capture with Flink CDC: Architecture, Challenges, and Results
DataFunSummit
DataFunSummit
Aug 7, 2024 · Big Data

Ant Group Real-Time Data Warehouse: Architecture, Solutions, and Data Lake Outlook

This article presents Ant Group's recent explorations and practices in real-time data warehousing, detailing its architecture, data quality assurance, stream‑batch integration, and future data lake implementation, while highlighting the use of Flink, ODPS, and Paimon for scalable, low‑latency analytics.

Data QualityFlinkreal-time data
0 likes · 15 min read
Ant Group Real-Time Data Warehouse: Architecture, Solutions, and Data Lake Outlook
Zhuanzhuan Tech
Zhuanzhuan Tech
Jul 2, 2024 · Mobile Development

Evolution and Design of the Lego Logging System for Mobile Applications

This article describes the four-stage evolution of the Lego client‑side logging system—covering its initial zero‑to‑one architecture, the separation of business and technical logs, real‑time reporting improvements, and the latest architecture redesign that boosts performance, reduces overhead, and provides a safe migration path.

architecturemigration strategymobile logging
0 likes · 14 min read
Evolution and Design of the Lego Logging System for Mobile Applications
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 24, 2024 · Big Data

How to Address Data Inconsistency and Validation Challenges Between Data and Algorithm Teams

This article discusses practical strategies for data and algorithm teams to handle real‑time data inconsistencies, validation difficulties, and communication gaps by emphasizing clear scope definition, realistic technical assessments, proactive risk identification, and the importance of specialized, well‑qualified talent.

algorithm collaborationreal-time data
0 likes · 6 min read
How to Address Data Inconsistency and Validation Challenges Between Data and Algorithm Teams
DataFunSummit
DataFunSummit
Jun 7, 2024 · Artificial Intelligence

Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform

Feature engineering, the process of creating input variables for machine learning models, is crucial for banking risk control; this article explains the concepts of features, variables, and metrics, outlines challenges in real‑time feature pipelines, and proposes a practical architecture and best practices for building an efficient, low‑code feature platform.

feature engineeringmachine learningplatform design
0 likes · 10 min read
Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform
DataFunTalk
DataFunTalk
Jun 1, 2024 · Big Data

Ant Group's Real-Time Data Warehouse Architecture, Solutions, and Data Lake Outlook

This article presents Ant Group's recent explorations and practices in real-time data warehousing, covering the system architecture, streaming data quality assurance, flow‑batch integrated applications, and future data lake integration, while sharing technical details and operational insights for large‑scale data processing.

Flinkdata-warehousereal-time data
0 likes · 16 min read
Ant Group's Real-Time Data Warehouse Architecture, Solutions, and Data Lake Outlook
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 20, 2024 · Big Data

Feishu ShenNuo's Real-Time Data Warehouse with Flink, Hudi, and Hologres

Feishu ShenNuo redesigned its data architecture by integrating Flink, Hudi, and Hologres to create a cloud‑native real‑time data warehouse that supports both millisecond‑level ad monitoring and minute‑level game operations, offering scalable storage, low‑latency queries, and comprehensive monitoring and capacity planning.

FlinkHologresHudi
0 likes · 16 min read
Feishu ShenNuo's Real-Time Data Warehouse with Flink, Hudi, and Hologres
DataFunSummit
DataFunSummit
Jan 30, 2024 · Big Data

CVTE’s Journey of Stream Computing Adoption: Architecture, Applications, and Lessons Learned

This article details CVTE’s adoption of stream computing, describing the company background, the challenges of traditional data pipelines, the design of a CDC‑Kafka integration platform, evaluations of PipelineDB, ksqlDB, Materialize and RisingWave, and the overall impact on real‑time analytics and operational efficiency.

CVTERisingWavereal-time data
0 likes · 9 min read
CVTE’s Journey of Stream Computing Adoption: Architecture, Applications, and Lessons Learned
DataFunTalk
DataFunTalk
Dec 27, 2023 · Big Data

Amoro Mixed Hive: A Unified Lakehouse Solution for Real‑Time and Batch Data Processing

This article describes how NetEase Youdao replaced its Doris‑based real‑time data warehouse with Amoro Mixed Hive, detailing the architectural challenges, the Mixed Hive design, implementation steps, performance optimizations, community contributions, and future roadmap to achieve a unified lakehouse with minute‑level freshness and reduced development and operational costs.

AmoroBig DataFlink
0 likes · 12 min read
Amoro Mixed Hive: A Unified Lakehouse Solution for Real‑Time and Batch Data Processing
DataFunTalk
DataFunTalk
Nov 11, 2023 · Big Data

Streaming Graph Processing in Ant Group: Real-Time Data Architecture and Applications

This article presents Ant Group's comprehensive real-time data framework and streaming graph processing engine, detailing its architecture, unified batch‑stream capabilities, and practical applications such as traffic attribution, real‑time OLAP, and user‑behavior intent analysis, while outlining future directions.

Big DataGraph ProcessingOLAP
0 likes · 15 min read
Streaming Graph Processing in Ant Group: Real-Time Data Architecture and Applications
JD Tech
JD Tech
Sep 11, 2023 · Big Data

Construction and High-Fidelity Load Testing of Real-Time Data Dual-Stream

This article explains how to build a dual‑stream real‑time data pipeline for big‑data applications, defines construction standards, and details a three‑step high‑fidelity load‑testing process that ensures stability and high availability during peak promotional periods.

Load Testingdual-streamhigh availability
0 likes · 10 min read
Construction and High-Fidelity Load Testing of Real-Time Data Dual-Stream
Didi Tech
Didi Tech
Jun 14, 2023 · Big Data

Real-Time Data Development Practices and Component Selection at Didi

Didi’s unified real‑time data stack outlines best‑practice component choices for four key scenarios—metric monitoring, BI analysis, online services, and feature/tag systems—detailing pipelines from source to sink, resource‑usage guidelines, and a one‑stop development platform to build stable, high‑performance streaming solutions.

DruidFlinkKafka
0 likes · 17 min read
Real-Time Data Development Practices and Component Selection at Didi
WeChat Backend Team
WeChat Backend Team
May 17, 2023 · Big Data

Boosting Real-Time Recommendations: Apache Pulsar Optimizations at WeChat

This article details how WeChat's Gemini‑2.0 big‑data platform leverages Apache Pulsar, outlining cloud‑native advantages, load‑balancing refinements, cache and SSD tuning, high‑availability safeguards, and cost‑saving strategies that together enable large‑scale, real‑time, deep‑learning recommendation workloads.

Apache PulsarBig DataCloud Native
0 likes · 17 min read
Boosting Real-Time Recommendations: Apache Pulsar Optimizations at WeChat
dbaplus Community
dbaplus Community
May 9, 2023 · Big Data

How a Bank Built a Near‑Real‑Time Data Platform with Kafka, Flink & Hudi

An in‑depth case study of a Chinese bank’s near‑real‑time data platform reveals its evolution from a monolithic CDC pipeline to a split architecture featuring a real‑time data lake and a data‑service bus, detailing component choices, schema‑registry integration, SDK development, observability, and future roadmap.

Big Data ArchitectureData LakeFlink
0 likes · 18 min read
How a Bank Built a Near‑Real‑Time Data Platform with Kafka, Flink & Hudi
DataFunTalk
DataFunTalk
Mar 9, 2023 · Big Data

Real‑Time Data Platform Architecture and Cloud‑Native Flink Migration at Manbang

This article presents a comprehensive case study of Manbang's real‑time data platform, detailing its business background, cloud‑native Flink + Hologres architecture, migration from self‑built clusters, real‑time product features, decision‑making workflows, and future roadmap, highlighting performance and cost benefits.

FlinkLogisticsStreaming
0 likes · 16 min read
Real‑Time Data Platform Architecture and Cloud‑Native Flink Migration at Manbang
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 1, 2023 · Big Data

How We Built a Scalable Real‑Time Data Architecture for a Complex Supply Chain

This article describes the challenges of a highly complex supply‑chain system, the evolution from early MySQL‑based reporting to a modern real‑time data platform using Flink, Kafka, ClickHouse, Hologres and other cloud services, and the tools and lessons learned to achieve low‑latency, high‑throughput analytics.

FlinkKafkaStreaming
0 likes · 11 min read
How We Built a Scalable Real‑Time Data Architecture for a Complex Supply Chain
DeWu Technology
DeWu Technology
Feb 24, 2023 · Big Data

Real-Time Data Architecture Evolution for a Complex Supply Chain

The article traces Dewu’s supply‑chain data platform from slow MySQL reporting through early CDC‑based wide tables to a Flink‑Kafka‑ClickHouse 1.0 design, then to a more scalable Flink‑Kafka‑Hologres 2.0 architecture that solves upsert and compute‑storage separation, while detailing key operational tricks, code‑generation tools, and future plans for lake‑house integration.

Big DataFlinkHologres
0 likes · 10 min read
Real-Time Data Architecture Evolution for a Complex Supply Chain
DeWu Technology
DeWu Technology
Feb 21, 2023 · Backend Development

Design and Implementation of a Traffic Control Platform for E-commerce Search and Recommendation

The article describes a modular traffic‑control platform for e‑commerce search and recommendation that lets operators quickly adjust strategies for emergencies, cold‑start items, and experiments, replacing costly multi‑team development with a unified operation center, service center, data hub, algorithmic PID controller, real‑time metrics, independent recall chain, and cross‑scene AB testing, while outlining future extensions.

AB testingPID controllerplatform architecture
0 likes · 16 min read
Design and Implementation of a Traffic Control Platform for E-commerce Search and Recommendation
DataFunSummit
DataFunSummit
Feb 16, 2023 · Big Data

JD Real-Time Data Product Practice: Overview, Low‑Code Platform, Stream‑Batch Integration, and Operations

This article summarizes JD's real‑time data product practice, covering product overview, low‑code real‑time platform construction, stream‑batch integrated architecture, and the three‑layer operational defense model, while highlighting challenges, evolution, user distribution, and future directions.

Big DataLow‑code platformreal-time data
0 likes · 13 min read
JD Real-Time Data Product Practice: Overview, Low‑Code Platform, Stream‑Batch Integration, and Operations
ITPUB
ITPUB
Feb 7, 2023 · Big Data

How Kuaigou Built a Scalable Real‑Time Data Warehouse with Spark, Flink, and Cloud

Facing massive, multi‑source traffic and the need for instant analytics, Kuaigou’s real‑time data warehouse evolved from Spark on‑premise to a cloud‑native stack using Alibaba Blink, Flink, and layered OLAP models, streamlining development, cutting costs, and enabling diverse real‑time applications.

FlinkOLAPSpark
0 likes · 11 min read
How Kuaigou Built a Scalable Real‑Time Data Warehouse with Spark, Flink, and Cloud
DataFunSummit
DataFunSummit
Jan 24, 2023 · Big Data

Building a Real-Time Data and User Profiling Architecture with Apache Doris at Zhihu

The article details Zhihu's data empowerment team's design and implementation of a low‑cost, high‑response real‑time data platform built on Apache Doris, covering real‑time business metrics, algorithm features, and user profiling, and explains the challenges, architectural choices, tooling, performance gains, and future directions.

Apache DorisData IntegrationData Quality
0 likes · 22 min read
Building a Real-Time Data and User Profiling Architecture with Apache Doris at Zhihu
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 30, 2022 · Big Data

How Manbang Built a Cloud‑Native Real‑Time Data Platform with Flink & Hologres

Manbang's logistics platform leverages a cloud‑native architecture built on Alibaba Cloud Flink and Hologres to deliver minute‑level real‑time data, feature computation, and decision‑making that dramatically improves SLA, reduces operational costs, and powers intelligent driver‑cargo matching across the ecosystem.

FlinkHologresLogistics
0 likes · 16 min read
How Manbang Built a Cloud‑Native Real‑Time Data Platform with Flink & Hologres
Xianyu Technology
Xianyu Technology
Dec 21, 2022 · Artificial Intelligence

Xianyu Recommendation System: Architecture, Challenges, and Deployment

The Xianyu recommendation system, built by backend expert Wan Xiaoyong, evolved from offline scoring to a full‑graph, serverless recall‑ranking pipeline that tackles C2C uncertainties through centralized feature engineering, model compression, staged deployment, flexible experimentation, robust governance, and plans for automated attribution and interpretability.

AIBig DataModel Deployment
0 likes · 10 min read
Xianyu Recommendation System: Architecture, Challenges, and Deployment
ITPUB
ITPUB
Dec 18, 2022 · Big Data

How to Build a Real‑Time Data Warehouse with EasyData: A Step‑by‑Step Guide

Learn how to design and implement a real‑time data warehouse for an app’s AB‑test monitoring using EasyData, covering data flow layers, CDC task creation, stream table registration, Flink SQL processing, and BI reporting, with detailed steps, code snippets, and practical tips.

CDCEasyDataFlink
0 likes · 13 min read
How to Build a Real‑Time Data Warehouse with EasyData: A Step‑by‑Step Guide
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Nov 11, 2022 · Industry Insights

How Real-Time Data Middle Platforms are Transforming the Telecom Industry

This article analyzes why telecom operators need a real‑time data middle platform, outlines its layered architecture and model design, examines the shift from Lambda to Kappa and lakehouse approaches, and highlights how these innovations enable faster, scenario‑driven insights and competitive advantage.

Big Data ArchitectureData Middle PlatformFlink
0 likes · 15 min read
How Real-Time Data Middle Platforms are Transforming the Telecom Industry
ITPUB
ITPUB
Aug 8, 2022 · Big Data

Why Real‑Time Data Warehouses Are the New Competitive Edge for Enterprises

As markets become increasingly dynamic, companies must build real‑time infrastructure to gain timely insights, and this article explains the three real‑time analytics scenarios, the limitations of traditional stream engines, and how Skylab’s integrated cloud‑native platform and Omega architecture address those challenges.

cloud-nativereal-time datastream processing
0 likes · 9 min read
Why Real‑Time Data Warehouses Are the New Competitive Edge for Enterprises
dbaplus Community
dbaplus Community
Jul 6, 2022 · Big Data

Building Real‑Time User Profiles at Zhihu with Apache Doris: A Practical Guide

Zhihu's data‑empowerment team designed a low‑cost, high‑response real‑time data architecture on Apache Doris that powers business analytics, algorithm features, and user profiling, dramatically improving timeliness, reducing targeting costs, and boosting key performance metrics across multiple services.

Apache Dorisperformance optimizationreal-time data
0 likes · 23 min read
Building Real‑Time User Profiles at Zhihu with Apache Doris: A Practical Guide
IT Architects Alliance
IT Architects Alliance
Jun 5, 2022 · Big Data

Real-Time Data and User Profiling Practices at Zhihu: Architecture, Challenges, and Solutions

This article presents a comprehensive case study of Zhihu's data empowerment team, detailing the design of a real‑time data platform and user profiling system, the challenges faced in scalability, latency, and data quality, and the practical solutions and architectural choices implemented to drive business value.

Data QualityLambda architecturedata pipeline
0 likes · 22 min read
Real-Time Data and User Profiling Practices at Zhihu: Architecture, Challenges, and Solutions

Dynamic Page Floor Sorting for Intelligent Marketing in NetEase Yanxuan

NetEase Yanxuan’s Olympus platform introduces dynamic page‑floor sorting that automatically reorders product modules in real time using a multi‑armed‑bandit algorithm, delivering faster, more accurate and stable personalized marketing, improving exposure efficiency, ROI and handling peak traffic with sub‑40 ms rendering.

Marketing Automationalgorithmdynamic sorting
0 likes · 10 min read
Dynamic Page Floor Sorting for Intelligent Marketing in NetEase Yanxuan
Alipay Experience Technology
Alipay Experience Technology
May 17, 2022 · Mobile Development

How Ant Group Built an Ultra‑Real‑Time Client Feature Center for Smarter AI

This article examines the challenges of traditional data feature acquisition and presents Ant Group’s ultra‑real‑time client feature center, detailing its architecture, data collection, streaming and script computation, backflow mechanisms, and monitoring to deliver rich, timely, and easy‑to‑use features for AI models.

Event-drivenPython VMclient-side features
0 likes · 11 min read
How Ant Group Built an Ultra‑Real‑Time Client Feature Center for Smarter AI
Architects Research Society
Architects Research Society
May 16, 2022 · Big Data

The Four Phases of Netflix’s Trillion‑Scale Real‑Time Data Infrastructure

This article chronicles Netflix’s evolution from a failing batch pipeline to a cloud‑native, multi‑tenant streaming platform across four phases, detailing the motivations, challenges, strategic bets, and patterns that enabled the company to scale real‑time data processing to trillions of events per day.

Netflixcloud-nativedata infrastructure
0 likes · 31 min read
The Four Phases of Netflix’s Trillion‑Scale Real‑Time Data Infrastructure
dbaplus Community
dbaplus Community
Apr 13, 2022 · Big Data

How Meituan Built a Scalable Real‑Time Data Warehouse with Flink

This article explains Meituan's real‑time data warehouse architecture, covering typical business scenarios, the evolution of its streaming platform, key design challenges, solutions such as unified data models, SQL‑based development, UDF hosting, operator optimizations, and future plans for incremental processing and unified batch‑stream semantics.

FlinkMeituanreal-time data
0 likes · 18 min read
How Meituan Built a Scalable Real‑Time Data Warehouse with Flink
DataFunSummit
DataFunSummit
Jan 30, 2022 · Big Data

Real‑time Data Warehouse at Meituan: Architecture, Challenges, and Solutions

This article presents Meituan's real‑time data warehouse platform, describing typical streaming use cases, the evolution of its architecture from Storm and Spark Streaming to Flink, the challenges of development, operations and data quality, and the engineering solutions—including unified SQL, web IDE, UDF hosting, pipeline testing, and operator performance optimizations—implemented to support large‑scale, low‑latency analytics.

Flinkplatform architecturereal-time data
0 likes · 17 min read
Real‑time Data Warehouse at Meituan: Architecture, Challenges, and Solutions
DataFunTalk
DataFunTalk
Jan 28, 2022 · Big Data

Real-Time Customer Data Platform (RT‑CDP) Architecture and Implementation at iFanFan

This article explains the concept, challenges, and key business goals of a real‑time Customer Data Platform, details the technology stack selection—including Nebula Graph, Apache Flink, Apache Beam, Kudu, and Doris—and describes the modular architecture, data model, identity service, streaming computation, storage layers, rule engine, operational results, and future directions.

Big DataCDPData Integration
0 likes · 43 min read
Real-Time Customer Data Platform (RT‑CDP) Architecture and Implementation at iFanFan
DataFunTalk
DataFunTalk
Jan 10, 2022 · Big Data

Real‑Time Data Warehouse at Meituan: Architecture, Challenges, and Solutions

The talk by Tang Chuxi of Meituan explains typical real‑time data scenarios, the challenges faced when building a streaming data warehouse, and the design, development, operation, and performance‑optimisation solutions implemented on a Flink‑based platform to support massive, low‑latency business applications.

FlinkMeituandata-warehouse
0 likes · 17 min read
Real‑Time Data Warehouse at Meituan: Architecture, Challenges, and Solutions
Architecture & Thinking
Architecture & Thinking
Nov 2, 2021 · Backend Development

How to Transform a T+1 Dashboard into Real‑Time T+0 with MQ and MongoDB

This article explains how a user‑behavior data dashboard originally built on a daily T+1 batch process was redesigned to achieve real‑time T+0 updates by introducing message‑queue notifications, a dedicated aggregation service, and MongoDB storage, improving data freshness and user experience.

Backend ArchitectureMongoDBdata aggregation
0 likes · 5 min read
How to Transform a T+1 Dashboard into Real‑Time T+0 with MQ and MongoDB
21CTO
21CTO
Oct 6, 2021 · Big Data

Building a Real-Time TB-Scale Bill Query System with Kafka, Kudu, and Presto

This article details the design and implementation of a real‑time, TB‑scale bill‑detail query platform that leverages Kafka for streaming, Debezium and Confluent Platform for change capture, Kudu for low‑latency storage, and Presto/Kylin for fast OLAP queries, while outlining deployment, integration, and future enhancements.

KafkaKuduPresto
0 likes · 19 min read
Building a Real-Time TB-Scale Bill Query System with Kafka, Kudu, and Presto
Architect
Architect
Oct 6, 2021 · Big Data

Design and Implementation of a Real-time and Offline Integrated Query System

This article details the requirements, architecture, and implementation of a real-time and offline integrated query system, covering data ingestion via Debezium and Confluent Platform, storage in Kudu and HDFS, query engines Presto and Kylin, and strategies for data synchronization, partitioning, and scaling.

Big DataDebeziumKafka
0 likes · 19 min read
Design and Implementation of a Real-time and Offline Integrated Query System
Tencent Cloud Developer
Tencent Cloud Developer
Aug 17, 2021 · Big Data

Elasticsearch Technical Event in Shenzhen

The Shenzhen Elasticsearch technical event, co‑hosted by the Elastic Chinese community and Tencent Cloud, presented practical sessions on optimizing the Elastic Stack for search, real‑time analytics, logging, security and APM, featuring compression encoding, MongoDB fusion, ByteDance extensions, cost‑effective log storage, Lucene indexing, cross‑cluster replication, vector engine integration, and large‑scale case studies from Tencent, Tiptop Data and vivo.

ElasticsearchMongoDBVector Computing
0 likes · 4 min read
Elasticsearch Technical Event in Shenzhen
Laravel Tech Community
Laravel Tech Community
Apr 22, 2021 · Big Data

Apache Kafka 2.8.0 Release Highlights and New Features

Apache Kafka 2.8.0 introduces several significant enhancements, including a new group API, mutual TLS authentication for SASL_SSL listeners, JSON request/response logging, broker connection rate limiting, topic identifiers, self‑managed quorum replacing ZooKeeper, and numerous improvements to Streams and Connect APIs for more reliable real‑time data pipelines.

Apache KafkaBig DataDistributed Systems
0 likes · 2 min read
Apache Kafka 2.8.0 Release Highlights and New Features
DataFunTalk
DataFunTalk
Apr 3, 2021 · Big Data

Building a Real-Time Data Computing Platform for Tencent Games: Practices and Architecture

This article describes Tencent Games' end‑to‑end real‑time data platform, covering its construction background, the unified OneData development framework, the OneFun data‑service API layer, micro‑service and ServiceMesh management, and the operational benefits achieved through automation, standardization, and scalability.

FlinkGame AnalyticsMicroservices
0 likes · 14 min read
Building a Real-Time Data Computing Platform for Tencent Games: Practices and Architecture
Big Data Technology & Architecture
Big Data Technology & Architecture
Feb 24, 2021 · Big Data

Gairos: Uber’s Real‑Time Data Processing, Storage and Query Platform and Its Scalability Optimizations

The article describes Uber’s Gairos platform—a unified real‑time data ingestion, storage, and query system built on Kafka, Elasticsearch and RT‑Gairos—detailing its architecture, Uber use cases such as dynamic pricing, the scalability and reliability challenges faced, and the suite of optimization techniques (sharding, query routing, caching, index merging, template tuning, and data‑pruning) applied to achieve low latency and high throughput.

ElasticsearchUberreal-time data
0 likes · 26 min read
Gairos: Uber’s Real‑Time Data Processing, Storage and Query Platform and Its Scalability Optimizations
DataFunTalk
DataFunTalk
Jan 11, 2021 · Big Data

Design and Scaling of Meituan Delivery Real‑Time Feature Platform

This article details how Meituan built a minute‑level, high‑throughput real‑time feature platform for its delivery business, covering the business model, six‑layer architecture, data processing challenges, stability measures, scaling achievements, and future roadmap to support millions of orders per minute with sub‑50 ms latency.

MeituanSystem Architecturedelivery optimization
0 likes · 14 min read
Design and Scaling of Meituan Delivery Real‑Time Feature Platform
Youzan Coder
Youzan Coder
Dec 18, 2020 · Big Data

Design and Implementation of a Configurable Real-Time Rule Engine for Live‑Streaming Product Audits

The paper presents a configurable real‑time rule engine for live‑streaming product audits that decouples data aggregation from rule execution, uses QLExpress for dynamic conditions, supports Dubbo and HTTP sources, and enables safe gray‑release updates, cutting the rule‑change cycle from weeks to near‑real‑time.

Big DataConfigurationQLExpress
0 likes · 8 min read
Design and Implementation of a Configurable Real-Time Rule Engine for Live‑Streaming Product Audits
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Nov 26, 2020 · Backend Development

How to Crawl Real-Time Data with Python WebSocket: A Step‑by‑Step Guide

This article explains how crawler engineers can fetch real‑time data such as sports scores, stock quotes, or cryptocurrency prices by comparing polling and WebSocket approaches, introducing the aiowebsocket library, and providing complete Python code to perform handshake, subscription, and continuous data streaming.

PythonWeb ScrapingWebSocket
0 likes · 10 min read
How to Crawl Real-Time Data with Python WebSocket: A Step‑by‑Step Guide
Big Data Technology & Architecture
Big Data Technology & Architecture
Sep 12, 2020 · Big Data

Designing a Real‑time Data Platform for Modern Data Warehouses

This article explores the evolution from traditional to modern data warehouses, outlines the key capabilities of real‑time data platforms such as data real‑time, virtualization, democratization and collaboration, and presents a comprehensive architecture design with unified collection, streaming, compute and visualization layers, while discussing functional, quality, stability, cost, agility and management considerations.

architecturedata virtualizationreal-time data
0 likes · 18 min read
Designing a Real‑time Data Platform for Modern Data Warehouses
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 7, 2020 · Big Data

How Alibaba’s ADC Project Automates Real‑Time SQL Generation with Design Patterns and Priority Queues

This article explains how the Alibaba DChain Data Converger (ADC) automatically creates wide‑table SQL for real‑time cross‑database analytics by using a pipeline architecture, priority‑queue‑driven task scheduling, and specific design patterns to handle metadata, joins, and resource management.

Big DataSQL generationpriority-queue
0 likes · 13 min read
How Alibaba’s ADC Project Automates Real‑Time SQL Generation with Design Patterns and Priority Queues
dbaplus Community
dbaplus Community
Jun 18, 2020 · Databases

How a Hybrid Data Warehouse Transformed Banking Data Services

This article details the 2015 hybrid data‑warehouse design implemented at Guangdong Huaxing Bank, explaining its real‑time, historical, and archival layers, the data‑bus concept, and how mixing in‑memory, relational, and Hadoop technologies addressed modern banking data‑volume, latency, and unstructured‑data challenges.

BankingBig DataHadoop
0 likes · 20 min read
How a Hybrid Data Warehouse Transformed Banking Data Services
Big Data Technology & Architecture
Big Data Technology & Architecture
May 18, 2020 · Big Data

Real‑time Data Platform (RTDP): Concepts, Architecture and Design Considerations

This article examines the design of a real‑time data platform, discussing its background concepts, modern data‑warehouse perspective, architectural layers, unified data‑collection, streaming, compute and visualization platforms, and the functional, quality, stability, cost and agility considerations required for building an end‑to‑end real‑time pipeline.

Data DemocratizationData Platformarchitecture
0 likes · 17 min read
Real‑time Data Platform (RTDP): Concepts, Architecture and Design Considerations
Python Programming Learning Circle
Python Programming Learning Circle
Jan 6, 2020 · Operations

Why 12306 Ticket System Crashes During Rush: Inside Its Massive Operational Challenges

The 12306 railway ticket platform faces extreme operational pressure because it must synchronize real‑time sales across online and offline channels, handle countless route and time combinations without buffering, manage complex inventory updates, enforce purchase limits, and process massive concurrent queries, making its load far greater than typical e‑commerce systems.

high concurrencyinventory managementreal-time data
0 likes · 7 min read
Why 12306 Ticket System Crashes During Rush: Inside Its Massive Operational Challenges
Tencent Cloud Developer
Tencent Cloud Developer
Sep 26, 2019 · Cloud Computing

How Cloud Development Transforms Mini‑Program Architecture: From Serverless Basics to Real‑Time AI Services

The article reviews the September 21 cloud‑development salon, detailing how Mini‑Program cloud services simplify backend setup, introduce serverless cloud calls, real‑time database push, value‑added functions and network interconnect, and showcases practical use cases such as game data streaming, AI chatbots, SCRM construction and movie‑ticketing operations.

AI chatbotMini ProgramSCRM
0 likes · 17 min read
How Cloud Development Transforms Mini‑Program Architecture: From Serverless Basics to Real‑Time AI Services
Beike Product & Technology
Beike Product & Technology
Feb 21, 2019 · Big Data

DATABUS Data Integration Platform: Architecture, Capabilities, and TiDB Ecosystem

The article presents an in‑depth overview of the DATABUS data integration platform, detailing its background, current challenges, core capabilities such as data syncing, metadata automation, real‑time subscriptions, and its reliance on TiDB, TiSpark, Hudi, and related big‑data technologies to enable near‑real‑time data warehousing.

Big DataData IntegrationHudi
0 likes · 13 min read
DATABUS Data Integration Platform: Architecture, Capabilities, and TiDB Ecosystem
DataFunTalk
DataFunTalk
Nov 7, 2018 · Artificial Intelligence

Evolution of Ele.me Recommendation Algorithms and Online Learning Practice

This article outlines the background of Ele.me's recommendation business, details the evolution of its recommendation algorithms from rule‑based models to deep learning and online learning, and explains the practical implementation of real‑time data pipelines, feature engineering, model training, and deployment.

Ele.meOnline Learningmachine learning
0 likes · 13 min read
Evolution of Ele.me Recommendation Algorithms and Online Learning Practice
21CTO
21CTO
Sep 14, 2018 · Backend Development

How Message Queues Enable Near Real‑Time Incremental Indexing in Search Engines

This article examines the high‑real‑time requirements of incremental data ingestion for search engines, compares three update schemes, and details how adopting a Kafka subscription‑based message‑queue approach dramatically improves latency and flexibility for the Nuomi search framework.

KafkaMessage Queueincremental indexing
0 likes · 8 min read
How Message Queues Enable Near Real‑Time Incremental Indexing in Search Engines
Meituan Technology Team
Meituan Technology Team
Jan 26, 2018 · Big Data

Design and Implementation of a Real-Time Data Processing System at Meituan

Meituan designed a Storm‑based real‑time data processing platform that guarantees at‑least‑once delivery and high availability, employs a custom spout, regression‑driven traffic smoothing, and a low‑latency KV store with atomic operations, persisting results in Kafka, MySQL and Cellar to power merchant dashboards and heat‑tag analytics, while planning broader real‑time analytics expansion.

Big DataDistributed SystemsStorm
0 likes · 10 min read
Design and Implementation of a Real-Time Data Processing System at Meituan
21CTO
21CTO
Jul 12, 2017 · Fundamentals

Why Logs Are the Hidden Backbone of Distributed Systems and Real‑Time Data

This note distills Jay Kreps' extensive blog on logs, explaining their core role in distributed databases, real‑time data pipelines, replication, and state‑machine consistency, and showing how logs unify concepts from version control to streaming architectures.

data replicationlogsreal-time data
0 likes · 12 min read
Why Logs Are the Hidden Backbone of Distributed Systems and Real‑Time Data
GF Securities FinTech
GF Securities FinTech
Sep 14, 2016 · Big Data

Scaling Real-Time Stock Market Data with Redis, Lua, and Go Goroutines

Exploring how a securities firm processes billions of daily stock‑market indicators in real time, this article compares an in‑process Redis + Lua solution with an out‑of‑process Goroutine‑based architecture, detailing data flow, performance trade‑offs, and scalability considerations for high‑frequency time‑series workloads.

Luaparallel computingreal-time data
0 likes · 12 min read
Scaling Real-Time Stock Market Data with Redis, Lua, and Go Goroutines
Ctrip Technology
Ctrip Technology
Aug 12, 2016 · Big Data

Ctrip's Real-Time Data Platform: Architecture, Practices, and Lessons Learned

This article details Ctrip's journey building a unified real-time data platform—covering business motivations, architectural requirements, technology choices like Kafka and Storm, implementation of Avro schemas, monitoring, alerting, operational lessons, and future explorations such as Streaming CQL and JStorm.

AlertingBig DataKafka
0 likes · 15 min read
Ctrip's Real-Time Data Platform: Architecture, Practices, and Lessons Learned
21CTO
21CTO
Dec 1, 2015 · Big Data

How to Build a Real‑Time Price Update System for Billion‑Item E‑Commerce

This article explains the design of a distributed, real‑time price‑update service that handles massive product data, combines query‑driven crawling, observer‑pattern notifications, and multiple data sources to keep e‑commerce price and inventory information fresh within minutes.

Big DataObserver Patterndistributed architecture
0 likes · 14 min read
How to Build a Real‑Time Price Update System for Billion‑Item E‑Commerce