DataFunSummit
Mar 25, 2024 · Big Data
Exploring Real-Time Data Lake Practices at Kangaroo Cloud
This article shares Kangaroo Cloud's exploration and practice of a real-time data lake, covering background, data lake concepts, challenges, solution architecture using the Shuzhan platform with Iceberg/Hudi, CDC ingestion, small file handling, cross-cluster ingestion, materialized view acceleration, and future development plans.
CDCCross-Cluster IngestionHudi
0 likes · 12 min read