JD Tech Talk
Nov 5, 2019 · Artificial Intelligence
GeoBERT: A Multi‑Task Pre‑trained Language Model for Chinese Address Text
This article introduces GeoBERT, a novel pre‑training method for Chinese address strings that leverages seven jointly constrained tasks to capture spatial semantics, administrative hierarchy, and similarity relationships, enabling downstream address classification, segmentation, POI extraction, similarity comparison, and authenticity verification with reduced annotation dependence.
Chinese languageGeoBERTNLP
0 likes · 15 min read