Bilibili Tech
Jul 15, 2022 · Big Data
Lakehouse Architecture Practice at Bilibili: Query Acceleration and Index Enhancement
Bilibili’s lakehouse architecture merges Iceberg‑based data lake flexibility with data‑warehouse efficiency, using Kafka‑Flink real‑time ingestion, Spark offline loads, Trino queries, Alluxio caching, Z‑Order/Hilbert sorting, and enhanced BloomFilter and bitmap indexes to boost query speed up to tenfold while drastically cutting file reads.
Query OptimizationZ-Order sortingbig data architecture
0 likes · 17 min read