Tag

text analysis

0 views collected around this technical thread.

Python Programming Learning Circle
Python Programming Learning Circle
Jan 9, 2025 · Fundamentals

Python Data Preprocessing and Visualization of Jay Chou Lyrics: From JSON to Word Cloud

This tutorial demonstrates how to convert a JSON lyric database into Excel, filter Jay Chou songs, perform Chinese word segmentation with Jieba, compute word frequencies, and create visualizations such as word clouds using Python code and online tools.

VisualizationWordClouddata preprocessing
0 likes · 9 min read
Python Data Preprocessing and Visualization of Jay Chou Lyrics: From JSON to Word Cloud
Python Programming Learning Circle
Python Programming Learning Circle
Dec 1, 2023 · Artificial Intelligence

Generating Word Cloud and Pie Chart from a News Article Using Python

This article demonstrates how to scrape a news webpage with Python, extract and segment its Chinese text using jieba, count word frequencies, and visualize the top ten terms as a word cloud and a pie chart with pyecharts.

Data VisualizationWeb Scrapingjieba
0 likes · 3 min read
Generating Word Cloud and Pie Chart from a News Article Using Python
Test Development Learning Exchange
Test Development Learning Exchange
Nov 11, 2023 · Artificial Intelligence

Python Techniques for Comprehensive Text Data Analysis

This guide demonstrates how to use Python for end‑to‑end text data analysis, covering preprocessing, word‑frequency visualization, classification, sentiment detection, similarity measurement, entity recognition, keyword extraction, summarization, translation, and generation with clear code examples.

NLPPythondata preprocessing
0 likes · 6 min read
Python Techniques for Comprehensive Text Data Analysis
AntTech
AntTech
Jan 18, 2023 · Artificial Intelligence

Ant Security's Tianjian Content Risk Control System Receives Five‑Star Rating in 2022 Content Review Service Evaluation

On January 17, the China Academy of Information and Communications Technology announced that Ant Security's self‑developed Tianjian multimodal content risk control system achieved the highest five‑star rating in both text and image assessments of the 2022 content review service evaluation, highlighting its advanced AI‑driven moderation capabilities.

ant securitycontent moderationimage analysis
0 likes · 4 min read
Ant Security's Tianjian Content Risk Control System Receives Five‑Star Rating in 2022 Content Review Service Evaluation
Python Programming Learning Circle
Python Programming Learning Circle
Dec 16, 2021 · Artificial Intelligence

Part-of-Speech Tagging with Jieba in Python

This article explains how to perform Chinese part-of-speech tagging using the jieba.posseg library in Python, including loading stop words, extracting article content via Newspaper3k, applying precise mode segmentation, filtering, and presenting results in a pandas DataFrame.

NLPPOS taggingPython
0 likes · 3 min read
Part-of-Speech Tagging with Jieba in Python
DataFunTalk
DataFunTalk
Dec 24, 2018 · Artificial Intelligence

Application Scenarios and Practical Implementation of NLP in Yuewen's Content Mining Platform

This article details the business background, technical architecture, and practical deployments of natural language processing for content mining at Yuewen, covering tag construction, knowledge‑graph building, role analysis, recommendation generation, porn and plagiarism detection, and summarizing lessons learned.

Content MiningNLPknowledge graph
0 likes · 15 min read
Application Scenarios and Practical Implementation of NLP in Yuewen's Content Mining Platform
System Architect Go
System Architect Go
Sep 3, 2018 · Fundamentals

Understanding Elasticsearch Analyzer, Tokenizer, and Token Filters

This article explains the core components of Elasticsearch's full‑text search analysis—Analyzers, Tokenizers, and Token Filters—detailing their roles, building blocks, built‑in types, and how they combine to customize text processing for effective indexing and querying.

AnalyzerElasticsearchFull-Text Search
0 likes · 5 min read
Understanding Elasticsearch Analyzer, Tokenizer, and Token Filters