Topic

real-time data

Collection size
45 articles
Page 1 of 3
DataFunSummit
DataFunSummit
Aug 7, 2024 · Big Data

Ant Group Real-Time Data Warehouse: Architecture, Solutions, and Data Lake Outlook

This article presents Ant Group's recent explorations and practices in real-time data warehousing, detailing its architecture, data quality assurance, stream‑batch integration, and future data lake implementation, while highlighting the use of Flink, ODPS, and Paimon for scalable, low‑latency analytics.

Data LakeData WarehouseFlink
0 likes · 15 min read
Ant Group Real-Time Data Warehouse: Architecture, Solutions, and Data Lake Outlook
DeWu Technology
DeWu Technology
Feb 24, 2023 · Big Data

Real-Time Data Architecture Evolution for a Complex Supply Chain

The article traces Dewu’s supply‑chain data platform from slow MySQL reporting through early CDC‑based wide tables to a Flink‑Kafka‑ClickHouse 1.0 design, then to a more scalable Flink‑Kafka‑Hologres 2.0 architecture that solves upsert and compute‑storage separation, while detailing key operational tricks, code‑generation tools, and future plans for lake‑house integration.

ClickHouseFlinkHologres
0 likes · 10 min read
Real-Time Data Architecture Evolution for a Complex Supply Chain
DeWu Technology
DeWu Technology
Feb 21, 2023 · Backend Development

Design and Implementation of a Traffic Control Platform for E-commerce Search and Recommendation

The article describes a modular traffic‑control platform for e‑commerce search and recommendation that lets operators quickly adjust strategies for emergencies, cold‑start items, and experiments, replacing costly multi‑team development with a unified operation center, service center, data hub, algorithmic PID controller, real‑time metrics, independent recall chain, and cross‑scene AB testing, while outlining future extensions.

AB testingPID controllerReal-time Data
0 likes · 16 min read
Design and Implementation of a Traffic Control Platform for E-commerce Search and Recommendation
Xianyu Technology
Xianyu Technology
Dec 21, 2022 · Artificial Intelligence

Xianyu Recommendation System: Architecture, Challenges, and Deployment

The Xianyu recommendation system, built by backend expert Wan Xiaoyong, evolved from offline scoring to a full‑graph, serverless recall‑ranking pipeline that tackles C2C uncertainties through centralized feature engineering, model compression, staged deployment, flexible experimentation, robust governance, and plans for automated attribution and interpretability.

AIReal-time Databig data
0 likes · 10 min read
Xianyu Recommendation System: Architecture, Challenges, and Deployment
Airbnb Technology Team
Airbnb Technology Team
Jan 24, 2025 · Artificial Intelligence

Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning

Chronon is an open‑source framework that centralizes feature definitions to guarantee training‑inference consistency, eliminates complex ETL pipelines, and supports real‑time and batch processing across diverse data sources, cutting feature‑development cycles from months to under a week, as demonstrated by Airbnb’s 40,000‑feature deployment.

ChrononData PipelineHive
0 likes · 10 min read
Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning
Tencent Cloud Developer
Tencent Cloud Developer
Aug 17, 2021 · Big Data

Elasticsearch Technical Event in Shenzhen

The Shenzhen Elasticsearch technical event, co‑hosted by the Elastic Chinese community and Tencent Cloud, presented practical sessions on optimizing the Elastic Stack for search, real‑time analytics, logging, security and APM, featuring compression encoding, MongoDB fusion, ByteDance extensions, cost‑effective log storage, Lucene indexing, cross‑cluster replication, vector engine integration, and large‑scale case studies from Tencent, Tiptop Data and vivo.

Cloud PlatformElasticsearchMongoDB
0 likes · 4 min read
Elasticsearch Technical Event in Shenzhen
Youzan Coder
Youzan Coder
Dec 18, 2020 · Big Data

Design and Implementation of a Configurable Real-Time Rule Engine for Live‑Streaming Product Audits

The paper presents a configurable real‑time rule engine for live‑streaming product audits that decouples data aggregation from rule execution, uses QLExpress for dynamic conditions, supports Dubbo and HTTP sources, and enables safe gray‑release updates, cutting the rule‑change cycle from weeks to near‑real‑time.

QLExpressReal-time Dataconfiguration
0 likes · 8 min read
Design and Implementation of a Configurable Real-Time Rule Engine for Live‑Streaming Product Audits
JD Tech
JD Tech
Sep 11, 2023 · Big Data

Construction and High-Fidelity Load Testing of Real-Time Data Dual-Stream

This article explains how to build a dual‑stream real‑time data pipeline for big‑data applications, defines construction standards, and details a three‑step high‑fidelity load‑testing process that ensures stability and high availability during peak promotional periods.

Real-time Databig datadual-stream
0 likes · 10 min read
Construction and High-Fidelity Load Testing of Real-Time Data Dual-Stream
Zhuanzhuan Tech
Zhuanzhuan Tech
Jul 2, 2024 · Mobile Development

Evolution and Design of the Lego Logging System for Mobile Applications

This article describes the four-stage evolution of the Lego client‑side logging system—covering its initial zero‑to‑one architecture, the separation of business and technical logs, real‑time reporting improvements, and the latest architecture redesign that boosts performance, reduces overhead, and provides a safe migration path.

Real-time Dataarchitecturemigration strategy
0 likes · 14 min read
Evolution and Design of the Lego Logging System for Mobile Applications
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 25, 2024 · Big Data

Real-time Data Synchronization from OceanBase to Kafka Using ActionOMS and Flink

This article demonstrates how to use ActionOMS to capture incremental changes from OceanBase, stream them to Kafka in various formats, and employ Flink to deduplicate and aggregate transaction data into a daily summary, illustrating a complete real-time data pipeline for financial use cases.

ActionOMSData SynchronizationFlink
0 likes · 10 min read
Real-time Data Synchronization from OceanBase to Kafka Using ActionOMS and Flink
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 30, 2024 · Frontend Development

Common Data Request Methods for Large Screens and Their Implementation with SSE and WebSocket

This article compares HTTP polling, WebSocket, and Server‑Sent Events (SSE) for large‑screen data fetching, explains their advantages and drawbacks, outlines suitable business scenarios, and provides complete front‑end and back‑end code examples for implementing SSE and WebSocket connections.

HTTP PollingNode.jsReal-time Data
0 likes · 7 min read
Common Data Request Methods for Large Screens and Their Implementation with SSE and WebSocket
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 29, 2015 · Databases

The Evolution and Architecture of Alibaba's Data Replication Center (DRC)

This article chronicles the development, design decisions, and technical achievements of Alibaba's Data Replication Center (DRC), a real‑time heterogeneous database synchronization platform that has become a core infrastructure for multi‑active data centers, large‑scale e‑commerce, and cloud services.

AlibabaDRCReal-time Data
0 likes · 19 min read
The Evolution and Architecture of Alibaba's Data Replication Center (DRC)
Architects Research Society
Architects Research Society
May 16, 2022 · Big Data

The Four Phases of Netflix’s Trillion‑Scale Real‑Time Data Infrastructure

This article chronicles Netflix’s evolution from a failing batch pipeline to a cloud‑native, multi‑tenant streaming platform across four phases, detailing the motivations, challenges, strategic bets, and patterns that enabled the company to scale real‑time data processing to trillions of events per day.

Cloud NativeData InfrastructureNetflix
0 likes · 31 min read
The Four Phases of Netflix’s Trillion‑Scale Real‑Time Data Infrastructure
DataFunSummit
DataFunSummit
Apr 8, 2025 · Big Data

Huolala’s Real‑Time Data Synchronization with Flink CDC: Architecture, Practices, and Future Outlook

This article presents Huolala’s end‑to‑end implementation of Flink CDC for real‑time data capture, detailing the business background, reasons for selecting Flink CDC over Canal, component comparisons, production‑level platform enhancements, data‑lake integration, validation methods, and future directions for unified data ingestion.

Data LakeData SynchronizationFlink CDC
0 likes · 13 min read
Huolala’s Real‑Time Data Synchronization with Flink CDC: Architecture, Practices, and Future Outlook
DataFunSummit
DataFunSummit
Jun 7, 2024 · Artificial Intelligence

Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform

Feature engineering, the process of creating input variables for machine learning models, is crucial for banking risk control; this article explains the concepts of features, variables, and metrics, outlines challenges in real‑time feature pipelines, and proposes a practical architecture and best practices for building an efficient, low‑code feature platform.

Real-time Datafeature engineeringmachine learning
0 likes · 10 min read
Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform
DataFunSummit
DataFunSummit
Jan 30, 2024 · Big Data

CVTE’s Journey of Stream Computing Adoption: Architecture, Applications, and Lessons Learned

This article details CVTE’s adoption of stream computing, describing the company background, the challenges of traditional data pipelines, the design of a CDC‑Kafka integration platform, evaluations of PipelineDB, ksqlDB, Materialize and RisingWave, and the overall impact on real‑time analytics and operational efficiency.

CVTEReal-time DataRisingWave
0 likes · 9 min read
CVTE’s Journey of Stream Computing Adoption: Architecture, Applications, and Lessons Learned
DataFunSummit
DataFunSummit
Feb 16, 2023 · Big Data

JD Real-Time Data Product Practice: Overview, Low‑Code Platform, Stream‑Batch Integration, and Operations

This article summarizes JD's real‑time data product practice, covering product overview, low‑code real‑time platform construction, stream‑batch integrated architecture, and the three‑layer operational defense model, while highlighting challenges, evolution, user distribution, and future directions.

Real-time DataStream Processingbig data
0 likes · 13 min read
JD Real-Time Data Product Practice: Overview, Low‑Code Platform, Stream‑Batch Integration, and Operations
DataFunSummit
DataFunSummit
Jan 24, 2023 · Big Data

Building a Real-Time Data and User Profiling Architecture with Apache Doris at Zhihu

The article details Zhihu's data empowerment team's design and implementation of a low‑cost, high‑response real‑time data platform built on Apache Doris, covering real‑time business metrics, algorithm features, and user profiling, and explains the challenges, architectural choices, tooling, performance gains, and future directions.

Apache DorisReal-time Datadata integration
0 likes · 22 min read
Building a Real-Time Data and User Profiling Architecture with Apache Doris at Zhihu
DataFunTalk
DataFunTalk
Jun 1, 2024 · Big Data

Ant Group's Real-Time Data Warehouse Architecture, Solutions, and Data Lake Outlook

This article presents Ant Group's recent explorations and practices in real-time data warehousing, covering the system architecture, streaming data quality assurance, flow‑batch integrated applications, and future data lake integration, while sharing technical details and operational insights for large‑scale data processing.

Data LakeData WarehouseFlink
0 likes · 16 min read
Ant Group's Real-Time Data Warehouse Architecture, Solutions, and Data Lake Outlook
DataFunTalk
DataFunTalk
Dec 27, 2023 · Big Data

Amoro Mixed Hive: A Unified Lakehouse Solution for Real‑Time and Batch Data Processing

This article describes how NetEase Youdao replaced its Doris‑based real‑time data warehouse with Amoro Mixed Hive, detailing the architectural challenges, the Mixed Hive design, implementation steps, performance optimizations, community contributions, and future roadmap to achieve a unified lakehouse with minute‑level freshness and reduced development and operational costs.

AmoroFlinkHive
0 likes · 12 min read
Amoro Mixed Hive: A Unified Lakehouse Solution for Real‑Time and Batch Data Processing