Tag

Chinese Tokenization

0 views collected around this technical thread.

Wukong Talks Architecture
Wukong Talks Architecture
Mar 31, 2021 · Backend Development

How to Install and Use the IK Chinese Analyzer Plugin in Elasticsearch

This article explains why Elasticsearch's built‑in tokenizers struggle with Chinese text, introduces the IK analyzer plugin, provides step‑by‑step Docker and file‑based installation methods, shows how to configure custom dictionaries via Nginx, and demonstrates smart and max‑word tokenization queries.

Chinese TokenizationCustom DictionaryDocker
0 likes · 12 min read
How to Install and Use the IK Chinese Analyzer Plugin in Elasticsearch
Wukong Talks Architecture
Wukong Talks Architecture
Oct 9, 2020 · Big Data

Elasticsearch Fundamentals: Architecture, Indexing, Queries, Docker Setup, and Chinese Tokenization

This tutorial introduces Elasticsearch's core concepts, installation via Docker, index and document operations, query DSL, aggregations, and Chinese tokenization using the IK analyzer with custom dictionaries, providing step‑by‑step code examples for building a searchable log analysis stack.

Chinese TokenizationDockerElasticsearch
0 likes · 28 min read
Elasticsearch Fundamentals: Architecture, Indexing, Queries, Docker Setup, and Chinese Tokenization