Big Data 16 min read

Kuaishou Data Service System: Modeling, Architecture, and Future Directions

This article presents Kuaishou's comprehensive data service system, covering its domain modeling, evolution from custom to unified services, the Octo query engine and data preparation platform architecture, the dual data API and analysis services, and future plans for intelligence and serverless high‑performance capabilities.

DataFunSummit
DataFunSummit
DataFunSummit
Kuaishou Data Service System: Modeling, Architecture, and Future Directions

The presentation introduces Kuaishou's data service system, beginning with a domain modeling approach that abstracts raw data sources (e.g., Hive, MySQL) into structured tables and semantic data models to address development and operational challenges.

It then outlines the evolution of the service platform through three stages—customized services, platform‑based services, and unified services—highlighting improvements in development efficiency, reduction of duplicated effort, and the introduction of three core capabilities: data query, data preparation, and data integration.

The technical architecture is described as "1 engine + 1 platform > 2 services." The Octo query engine provides a unified federation query language (FQL), supports cross‑engine federation, and leverages Apache Arrow for efficient data exchange. The data preparation platform offers multi‑level caching, intelligent cache pre‑heating, and both visual and automated data modeling to streamline table and metric preparation.

Two primary services are built on this foundation: a Data API service that delivers high‑performance, low‑latency table queries with configurable, form‑driven development, and a Data Analysis service that enables headless BI and the Open Analysis Expression (OAX) language for semantic, multi‑dimensional analytics.

Future directions focus on increasing intelligence and performance, including serverless on‑demand scaling and continued enhancements to the Octo engine to achieve sub‑2 ms latency and support massive query volumes.

analyticsBig DataData Modelingdata-platformData ServiceQuery Engine
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.