Big Data 7 min read

Fast OLAP Forum – Latest Practices and Innovations in Real‑Time OLAP

The Fast OLAP Forum held on December 19 at DataFunCon gathers leading experts from Baidu, Tencent, JD, and FreeWheel to share cutting‑edge techniques in vectorized execution, cloud‑native ClickHouse, large‑scale OLAP architectures, and Presto optimizations, offering deep insights for practitioners dealing with massive real‑time data workloads.

DataFunSummit
DataFunSummit
DataFunSummit
Fast OLAP Forum – Latest Practices and Innovations in Real‑Time OLAP

Data warehouse/OLAP analysis is a core topic in the big‑data field, and the surge of short‑video and other real‑time services has intensified performance demands, prompting innovations in compression, vectorization, and time‑series processing.

On December 19, 9:00‑12:45, the DataFunCon conference will host the Fast OLAP Forum, presented by Baidu senior R&D engineer and Apache Doris PPMC member Chen Mingyu, to showcase the latest practices in ultra‑fast OLAP.

Speaker 1 – Chen Mingyu (Baidu) : Senior R&D engineer responsible for Apache Doris and Palo, with seven years of distributed system experience, leading the open‑source effort of Doris.

Speaker 2 – Li Haopeng (Baidu) : Topic – “Apache Doris Vectorization Technology Implementation and Future Plans”. The talk covers the vectorized execution engine introduced in version 0.15, performance gains of 3‑10× for single‑table queries, the fundamentals of vectorization, and upcoming community roadmap.

Speaker 3 – Yi Guolei (Tencent TEG) : Topic – “Cloud‑Native ClickHouse Design”. The presentation discusses ClickHouse pain points, the compute‑storage separation architecture, and the implementation of a new MPP query engine.

Speaker 4 – Liu Wang (Baidu) : Topic – “Architecture and Practice of Baidu AiFanFan Data Platform”. The session describes the construction of real‑time and offline big‑data platforms, challenges in data integration, transformation, storage, governance, and analytics, and practical techniques such as Duplicate, Aggregate, Unique models, materialized views, and precise deduplication.

Speaker 5 – Chen Hongjian (JD) : Topic – “JD Retail Big Data OLAP Application and Practice”. The talk explains the impact of massive SKU and organizational changes on data accuracy, large‑scale data refresh (trillions of rows daily), and optimizations in large‑scale data refresh, deduplication, and pre‑computation that enable second‑level OLAP queries.

Speaker 6 – Qiao Zijian (FreeWheel) : Topic – “Presto Practice and Optimization in FreeWheel Advertising Platform”. The presentation covers Presto’s integration with AWS, a custom FW connector, cluster management, Parquet column‑read optimizations, push‑down aggregation, and future plans for broader Presto adoption.

The forum will be livestreamed and free to register; interested participants are encouraged to sign up and join the live session.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Big DataClickHouseData WarehouseOLAPvectorizationPrestoApache Doris
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.