Tagged articles
5 articles
Page 1 of 1
Open Source Tech Hub
Open Source Tech Hub
Feb 20, 2025 · Backend Development

Build and Use jieba-php: Chinese Word Segmentation in PHP via Rust

This guide explains how to install the jieba-php extension— a Rust‑based Chinese word segmentation library for PHP—by listing required dependencies, showing the cargo build steps, demonstrating runtime commands, describing the provided API, and offering a complete usage example.

chinese segmentationjieba
0 likes · 2 min read
Build and Use jieba-php: Chinese Word Segmentation in PHP via Rust
JD Cloud Developers
JD Cloud Developers
Dec 24, 2021 · Databases

Boost Address Keyword Search with PostgreSQL: GIN, pg_trgm, and pg_bigm

This article explains how PostgreSQL's native full‑text search, GIN indexes, and extensions like pg_trgm, pg_bigm, and pg_jieba can dramatically improve the speed and accuracy of Chinese address keyword searches, offering a cost‑effective alternative to external search engines.

Full‑Text SearchGIN Indexaddress keyword search
0 likes · 6 min read
Boost Address Keyword Search with PostgreSQL: GIN, pg_trgm, and pg_bigm
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 16, 2021 · Artificial Intelligence

Master Chinese Text Segmentation with jieba: Installation, Modes, and Advanced Tricks

This tutorial walks you through installing the jieba Python library, explains its three segmentation modes—precise, full, and search—demonstrates how to add or delete words, manage custom dictionaries, handle stop words, perform weight analysis, adjust word frequencies, and retrieve token positions, all with clear code examples and visual output.

NLPPythonchinese segmentation
0 likes · 10 min read
Master Chinese Text Segmentation with jieba: Installation, Modes, and Advanced Tricks
58 Tech
58 Tech
Feb 26, 2020 · Artificial Intelligence

Tag Mining and Optimization Practices Using Chinese Segmentation Tools

This article presents a comprehensive overview of tag mining practices—including similarity‑based, compound‑word, topic, hot‑search, and image‑based approaches—along with detailed evaluations of Chinese segmentation tools and systematic tag optimization techniques such as synonym and negative‑word detection.

HanLPNLPchinese segmentation
0 likes · 15 min read
Tag Mining and Optimization Practices Using Chinese Segmentation Tools
58 Tech
58 Tech
Jan 22, 2019 · Artificial Intelligence

Chinese Word Segmentation: Challenges, Methods, and Practical Practices

The article explains why Chinese word segmentation is essential for NLP tasks, outlines its fundamental difficulties such as ambiguity and out‑of‑vocabulary words, reviews dictionary‑based, statistical, and CRF approaches, and shares practical experiences from 58 Search’s production system.

CRFLanguage ModelNLP
0 likes · 21 min read
Chinese Word Segmentation: Challenges, Methods, and Practical Practices