Tagged articles
3 articles
Page 1 of 1
Architect
Architect
May 24, 2024 · Backend Development

How to Build a MySQL‑Based Fuzzy Company Name Matcher with Regex and IKAnalyzer

This article walks through a real‑world approval workflow where a business user adds a company, explains why MySQL REGEXP is chosen over Elasticsearch for fuzzy matching, and details the step‑by‑step implementation—including name preprocessing, tokenization with IKAnalyzer, and a custom SQL query that ranks results by match degree.

IKAnalyzerbackend-developmentdatabase
0 likes · 13 min read
How to Build a MySQL‑Based Fuzzy Company Name Matcher with Regex and IKAnalyzer
MaGe Linux Operations
MaGe Linux Operations
Apr 25, 2017 · Artificial Intelligence

Top Python and Java NLP Tools for Chinese Text Processing

This article surveys a wide range of natural language processing libraries—including Python packages like NLTK and spaCy, Java frameworks such as OpenNLP and StanfordNLP, and specialized Chinese tokenizers like IKAnalyzer, ICTCLAS, and FudanNLP—detailing their features, usage, and setup steps for Chinese text analysis.

ChineseIKAnalyzerNLP
0 likes · 14 min read
Top Python and Java NLP Tools for Chinese Text Processing