DataFun Summit 2023 Real‑Time Computing Forum – Speaker Line‑up and Session Details
The DataFun Summit 2023 Real‑Time Computing Forum showcases a series of expert talks on Apache Flink, stream‑batch integration, cloud‑native streaming databases, and large‑scale real‑time data warehousing, featuring speakers from Alibaba Cloud, Taobao, Didi, Ant Group and RisingWave.
Real‑time computing delivers sub‑second data freshness, and Apache Flink remains a core technology in this space, evolving toward unified stream‑batch processing, stream data warehouses, and Kubernetes‑based scheduling.
The DataFun Summit 2023 Data Infrastructure Forum invites experts from Taobao, Ant Group, Didi, Alibaba Cloud and RisingWave to share practical experiences, new technical directions, and architectural explorations for Flink‑based real‑time data warehouses.
Speaker: Li Jinsong – Senior Technical Expert, Alibaba Cloud
Bio: Leader of Alibaba Cloud’s open‑source big‑data table storage team, developer of Apache Paimon, PMC member of Apache Flink, and contributor to Apache Iceberg & Beam. Focuses on distributed stream and batch computing, lake storage, and stream‑lake integration.
Speaker: Shao Liangkai – Data Platform Technical Expert, Taobao
Bio: Head of Taobao’s real‑time data product for major sales events, PM for Double‑11 promotion data, led migration from Blink to Flink and built stability solutions.
Talk Title: Real‑time Stability Assurance for Taobao’s Mega‑Promotion
Outline: 1) Overview of the promotion product; 2) Business challenges under extreme traffic peaks; 3) Stability solutions; 4) Overall impact; 5) Future outlook.
Audience Benefits: Real‑time stress‑testing solutions for extreme scenarios, Holo storage optimization, and special‑case stability case studies.
Speaker: Wu Chong – Flink SQL Lead, Alibaba Cloud
Bio: Senior technical expert, PMC member and committer of Apache Flink, co‑creator of Flink CDC, focuses on stream and batch processing.
Talk Title: Apache Flink – From Stream Computing to Stream Data Warehouse
Outline: Problems of current real‑time warehouses, Flink’s evolution toward stream data warehouses, progress on Paimon, more robust stream processing, batch performance gains, data management features, SQL services, and future roadmap.
Audience Benefits: Selecting a real‑time warehouse, future trends, overview of new Flink features, and upcoming development plans.
Speaker: Liu Zhuo – Real‑time Computing Engineer, Didi
Bio: Experienced in real‑time computing, currently responsible for migrating Didi’s Flink engine from Yarn to Kubernetes.
Talk Title: Flink on K8s – Didi’s Practice and Experience
Outline: Component selection, engine refactoring, optimization, and stability improvements when running Flink on Kubernetes.
Audience Benefits: Advantages of moving Flink from Yarn to K8s, how to adapt Flink for K8s, and stability‑related optimizations.
Speaker: Min Wenjun – Technical Expert, Ant Group
Bio: Graduate of Nanjing University of Science & Technology, works on real‑time computing at Ant Group, focusing on stream‑batch integration and data lake.
Talk Title: Flink Stream‑Batch Integration at Ant Group
Outline: 1) Stream‑batch scenarios; 2) Challenges; 3) Optimizations.
Audience Benefits: Understanding how stream‑batch integration improves business efficiency and the challenges faced during deployment.
Speaker: Fu Yu – Database Engineer, RisingWave Labs
Bio: Nanjing University graduate, 7‑year database systems developer, team lead for RisingWave’s streaming database kernel.
Talk Title: Inside RisingWave – Cloud‑Native Streaming Database Technology
Outline: 1) What is a streaming database; 2) RisingWave’s design philosophy; 3) Cloud‑native architecture; 4) Application cases.
Audience Benefits: Reducing development and operations costs with streaming databases, building a real‑time compute stack quickly, and leveraging cloud‑native techniques to lower streaming compute costs.
DataFunTalk
Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.