JD Retail Technology
Feb 11, 2022 · Big Data
Runtime Filter Join Optimization in JD Spark Using Bloom Filters
This article details JD Spark's Runtime Filter Join optimization, which leverages Bloom filters to prune large‑table data before shuffle, reducing I/O and execution time across batch and real‑time workloads, and presents architecture, implementation challenges, code examples, and performance gains in both benchmark and production environments.
Bloom FilterRuntime Filter JoinShuffle Reduction
0 likes · 15 min read
