Tagged articles
30 articles
Page 1 of 1
JavaEdge
JavaEdge
Mar 12, 2025 · Artificial Intelligence

How to Analyze Chinese Sentiment Text Data: From Stats to Word Clouds

This article guides Java developers through a complete Chinese sentiment‑analysis dataset exploration, covering label distribution, sentence length statistics, vocabulary counts, adjective extraction, and visual word‑cloud generation using Python libraries such as pandas, seaborn, jieba, and wordcloud.

Data visualizationNLPPython
0 likes · 10 min read
How to Analyze Chinese Sentiment Text Data: From Stats to Word Clouds
Open Source Tech Hub
Open Source Tech Hub
Feb 20, 2025 · Backend Development

Build and Use jieba-php: Chinese Word Segmentation in PHP via Rust

This guide explains how to install the jieba-php extension— a Rust‑based Chinese word segmentation library for PHP—by listing required dependencies, showing the cargo build steps, demonstrating runtime commands, describing the provided API, and offering a complete usage example.

chinese segmentationjieba
0 likes · 2 min read
Build and Use jieba-php: Chinese Word Segmentation in PHP via Rust
Infra Learning Club
Infra Learning Club
Oct 31, 2024 · Artificial Intelligence

What Is a Token in Large Language Models?

The article explains that a token is the unit processed by large language models, describes three common tokenizer methods—word‑level, character‑level, and sub‑word level—with English and Chinese examples, discusses their advantages and limitations, and shows how OpenAI’s tokenizer varies across model versions.

NLPTokencharacter-level
0 likes · 5 min read
What Is a Token in Large Language Models?
Model Perspective
Model Perspective
Sep 11, 2023 · Artificial Intelligence

Why Chinese Word Segmentation Matters: Techniques, Challenges, and Python Demo

This article explores Chinese word segmentation, illustrating its linguistic nuances with a humorous example, explains key methods—including dictionary‑based, statistical, and deep‑learning approaches—and provides Python code using a simple dictionary algorithm and the popular jieba library to demonstrate practical implementation.

Chinese NLPPythonjieba
0 likes · 6 min read
Why Chinese Word Segmentation Matters: Techniques, Challenges, and Python Demo
Python Crawling & Data Mining
Python Crawling & Data Mining
Feb 9, 2022 · Artificial Intelligence

How to Turn Crawled CSV Data into Word Clouds and Sentiment Scores with Python

This guide walks you through extracting text from a CSV obtained via Python web scraping, cleaning it with stop‑words, generating a word‑cloud, performing jieba tokenization and frequency analysis, and finally applying SnowNLP for sentiment scoring, with all code snippets and data links provided.

Sentiment AnalysisSnowNLPWeb Scraping
0 likes · 12 min read
How to Turn Crawled CSV Data into Word Clouds and Sentiment Scores with Python
Python Programming Learning Circle
Python Programming Learning Circle
Dec 16, 2021 · Artificial Intelligence

Part-of-Speech Tagging with Jieba in Python

This article explains how to perform Chinese part-of-speech tagging using the jieba.posseg library in Python, including loading stop words, extracting article content via Newspaper3k, applying precise mode segmentation, filtering, and presenting results in a pandas DataFrame.

NLPPOS taggingPython
0 likes · 3 min read
Part-of-Speech Tagging with Jieba in Python
Python Programming Learning Circle
Python Programming Learning Circle
Nov 15, 2021 · Backend Development

Python Web Project: Visualizing Hot Search Rankings and Domestic COVID‑19 Cases with Flask, Web Scraping, and ECharts

This report describes a Python‑based web application built with Flask that scrapes hot‑search data from Weibo, Baidu and Zhihu, processes it using jieba and other libraries, and visualizes the results together with domestic COVID‑19 statistics using ECharts on a responsive front‑end page.

data-visualizationfrontendjieba
0 likes · 7 min read
Python Web Project: Visualizing Hot Search Rankings and Domestic COVID‑19 Cases with Flask, Web Scraping, and ECharts
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 16, 2021 · Artificial Intelligence

Master Chinese Text Segmentation with jieba: Installation, Modes, and Advanced Tricks

This tutorial walks you through installing the jieba Python library, explains its three segmentation modes—precise, full, and search—demonstrates how to add or delete words, manage custom dictionaries, handle stop words, perform weight analysis, adjust word frequencies, and retrieve token positions, all with clear code examples and visual output.

NLPPythonchinese segmentation
0 likes · 10 min read
Master Chinese Text Segmentation with jieba: Installation, Modes, and Advanced Tricks
MaGe Linux Operations
MaGe Linux Operations
Jul 11, 2019 · Backend Development

How to Scrape JD Product Reviews and Create Word Clouds with Python

This tutorial walks you through analyzing JD product pages, extracting comment data via requests with proper headers, handling pagination, saving results, cleaning text using jieba, and visualizing frequent terms as a word cloud, all illustrated with step‑by‑step screenshots and code snippets.

Pythonjiebarequests
0 likes · 10 min read
How to Scrape JD Product Reviews and Create Word Clouds with Python
MaGe Linux Operations
MaGe Linux Operations
Mar 22, 2018 · Artificial Intelligence

Mapping Character Relationships in 'Heavenly Sword and Dragon Slaying' with Jieba, Word2Vec & NetworkX

This article demonstrates how to combine Jieba segmentation, Word2Vec embeddings, and NetworkX graph visualization to extract and analyze character relationships from the Chinese novel "Heavenly Sword and Dragon Slaying," detailing data preparation, model training, entity matrix construction, and network graph generation.

Graph VisualizationNLPPython
0 likes · 10 min read
Mapping Character Relationships in 'Heavenly Sword and Dragon Slaying' with Jieba, Word2Vec & NetworkX
MaGe Linux Operations
MaGe Linux Operations
Jun 17, 2017 · Artificial Intelligence

Create a One Piece‑Inspired Word Cloud with Python, Jieba, and WordCloud

This tutorial guides readers with basic programming experience through using Python 3 in Jupyter to extract keywords from One Piece lyrics with jieba, and then generate a Chopper‑shaped word cloud using matplotlib and the wordcloud library, covering required dependencies and step‑by‑step code.

Data visualizationJupyterPython
0 likes · 3 min read
Create a One Piece‑Inspired Word Cloud with Python, Jieba, and WordCloud
MaGe Linux Operations
MaGe Linux Operations
Apr 9, 2017 · Artificial Intelligence

How to Install and Fix WordCloud in Python for Chinese Text Visualization

This guide walks you through installing the Python WordCloud library, resolving common compilation errors, handling Chinese font encoding issues, and creating basic and image‑masked word clouds, complete with code snippets and troubleshooting tips for smooth visualization of Chinese text data.

Chinese NLPPythonjieba
0 likes · 4 min read
How to Install and Fix WordCloud in Python for Chinese Text Visualization