NetEase Yanxuan Technology Product Team
Jul 25, 2022 · Big Data
Probability Algorithms in Big Data: BloomFilter and Count-min Sketch Applications
The article explains how space‑efficient probabilistic structures such as BloomFilter and Count‑min Sketch enable large‑scale data deduplication, join pruning, real‑time idempotent filtering, and approximate top‑K analytics by trading modest accuracy loss for dramatically reduced storage and faster computation.
Big DataBloomFilterCount-Min Sketch
0 likes · 12 min read