Tagged articles
10 articles
Page 1 of 1
JavaEdge
JavaEdge
Mar 12, 2025 · Artificial Intelligence

How to Analyze Chinese Sentiment Text Data: From Stats to Word Clouds

This article guides Java developers through a complete Chinese sentiment‑analysis dataset exploration, covering label distribution, sentence length statistics, vocabulary counts, adjective extraction, and visual word‑cloud generation using Python libraries such as pandas, seaborn, jieba, and wordcloud.

Data visualizationNLPPython
0 likes · 10 min read
How to Analyze Chinese Sentiment Text Data: From Stats to Word Clouds
Test Development Learning Exchange
Test Development Learning Exchange
Nov 11, 2023 · Artificial Intelligence

Python Techniques for Comprehensive Text Data Analysis

This guide demonstrates how to use Python for end‑to‑end text data analysis, covering preprocessing, word‑frequency visualization, classification, sentiment detection, similarity measurement, entity recognition, keyword extraction, summarization, translation, and generation with clear code examples.

NLPPythonSentiment Analysis
0 likes · 6 min read
Python Techniques for Comprehensive Text Data Analysis
AntTech
AntTech
Jan 18, 2023 · Artificial Intelligence

Ant Security's Tianjian Content Risk Control System Receives Five‑Star Rating in 2022 Content Review Service Evaluation

On January 17, the China Academy of Information and Communications Technology announced that Ant Security's self‑developed Tianjian multimodal content risk control system achieved the highest five‑star rating in both text and image assessments of the 2022 content review service evaluation, highlighting its advanced AI‑driven moderation capabilities.

Image AnalysisMultimodal AIant security
0 likes · 4 min read
Ant Security's Tianjian Content Risk Control System Receives Five‑Star Rating in 2022 Content Review Service Evaluation
Python Programming Learning Circle
Python Programming Learning Circle
Dec 16, 2021 · Artificial Intelligence

Part-of-Speech Tagging with Jieba in Python

This article explains how to perform Chinese part-of-speech tagging using the jieba.posseg library in Python, including loading stop words, extracting article content via Newspaper3k, applying precise mode segmentation, filtering, and presenting results in a pandas DataFrame.

NLPPOS taggingPython
0 likes · 3 min read
Part-of-Speech Tagging with Jieba in Python
MaGe Linux Operations
MaGe Linux Operations
Jun 1, 2020 · Backend Development

Mastering Elasticsearch Analyzers: A Deep Dive into Tokenizers and Filters

This article explains how Elasticsearch uses Analyzer components—character filters, tokenizers, and token filters—to perform text analysis, reviews the built‑in analyzers such as standard, simple, stop, whitespace, keyword, pattern, language, ICU and IK, and provides practical _analyze API examples with code snippets and result screenshots.

ElasticsearchICU PluginIK Analyzer
0 likes · 11 min read
Mastering Elasticsearch Analyzers: A Deep Dive into Tokenizers and Filters
DataFunTalk
DataFunTalk
Dec 24, 2018 · Artificial Intelligence

Application Scenarios and Practical Implementation of NLP in Yuewen's Content Mining Platform

This article details the business background, technical architecture, and practical deployments of natural language processing for content mining at Yuewen, covering tag construction, knowledge‑graph building, role analysis, recommendation generation, porn and plagiarism detection, and summarizing lessons learned.

Content MiningNLPtext analysis
0 likes · 15 min read
Application Scenarios and Practical Implementation of NLP in Yuewen's Content Mining Platform
System Architect Go
System Architect Go
Sep 3, 2018 · Fundamentals

Understanding Elasticsearch Analyzer, Tokenizer, and Token Filters

This article explains the core components of Elasticsearch's full‑text search analysis—Analyzers, Tokenizers, and Token Filters—detailing their roles, building blocks, built‑in types, and how they combine to customize text processing for effective indexing and querying.

ElasticsearchFull‑Text SearchToken Filter
0 likes · 5 min read
Understanding Elasticsearch Analyzer, Tokenizer, and Token Filters