Tagged articles
1 articles
Page 1 of 1
Xianyu Technology
Xianyu Technology
Dec 12, 2018 · Big Data

Community Data Normalization Using Prefix Matching and Text Similarity

The study presents a four‑step pipeline that normalizes community data for rental platforms by clustering records using longest‑common‑prefix patterns, geographic filtering, Levenshtein similarity, and pattern‑based parent‑child assignment, achieving under 8 % false positives and 5 % false negatives.

GeospatialReal Estateclustering
0 likes · 10 min read
Community Data Normalization Using Prefix Matching and Text Similarity