Fast OLAP Forum – Latest Practices and Innovations in Real‑Time OLAP
The Fast OLAP Forum held on December 19 at DataFunCon gathers leading experts from Baidu, Tencent, JD, and FreeWheel to share cutting‑edge techniques in vectorized execution, cloud‑native ClickHouse, large‑scale OLAP architectures, and Presto optimizations, offering deep insights for practitioners dealing with massive real‑time data workloads.
Data warehouse/OLAP analysis is a core topic in the big‑data field, and the surge of short‑video and other real‑time services has intensified performance demands, prompting innovations in compression, vectorization, and time‑series processing.
On December 19, 9:00‑12:45, the DataFunCon conference will host the Fast OLAP Forum, presented by Baidu senior R&D engineer and Apache Doris PPMC member Chen Mingyu, to showcase the latest practices in ultra‑fast OLAP.
Speaker 1 – Chen Mingyu (Baidu) : Senior R&D engineer responsible for Apache Doris and Palo, with seven years of distributed system experience, leading the open‑source effort of Doris.
Speaker 2 – Li Haopeng (Baidu) : Topic – “Apache Doris Vectorization Technology Implementation and Future Plans”. The talk covers the vectorized execution engine introduced in version 0.15, performance gains of 3‑10× for single‑table queries, the fundamentals of vectorization, and upcoming community roadmap.
Speaker 3 – Yi Guolei (Tencent TEG) : Topic – “Cloud‑Native ClickHouse Design”. The presentation discusses ClickHouse pain points, the compute‑storage separation architecture, and the implementation of a new MPP query engine.
Speaker 4 – Liu Wang (Baidu) : Topic – “Architecture and Practice of Baidu AiFanFan Data Platform”. The session describes the construction of real‑time and offline big‑data platforms, challenges in data integration, transformation, storage, governance, and analytics, and practical techniques such as Duplicate, Aggregate, Unique models, materialized views, and precise deduplication.
Speaker 5 – Chen Hongjian (JD) : Topic – “JD Retail Big Data OLAP Application and Practice”. The talk explains the impact of massive SKU and organizational changes on data accuracy, large‑scale data refresh (trillions of rows daily), and optimizations in large‑scale data refresh, deduplication, and pre‑computation that enable second‑level OLAP queries.
Speaker 6 – Qiao Zijian (FreeWheel) : Topic – “Presto Practice and Optimization in FreeWheel Advertising Platform”. The presentation covers Presto’s integration with AWS, a custom FW connector, cluster management, Parquet column‑read optimizations, push‑down aggregation, and future plans for broader Presto adoption.
The forum will be livestreamed and free to register; interested participants are encouraged to sign up and join the live session.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
DataFunSummit
Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
