Tagged articles
2 articles
Page 1 of 1
Java Backend Technology
Java Backend Technology
Dec 9, 2021 · Big Data

How to Efficiently Find Common URLs in Billions of Records

This article explains how to handle the massive‑data problem of intersecting two files containing billions of URLs by using hash‑based divide‑and‑conquer techniques, file partitioning, and in‑memory hash lookups to achieve scalable performance beyond naive O(m·n) approaches.

HashURL intersectionalgorithm
0 likes · 8 min read
How to Efficiently Find Common URLs in Billions of Records