Tagged articles
13 articles
Page 1 of 1
Model Perspective
Model Perspective
Apr 6, 2026 · Fundamentals

Why Some Chinese Math Terms Nail the Meaning While Others Miss the Mark

The article examines the history and criteria of Chinese mathematical terminology, highlighting well‑translated terms like "微积分" and "向量", exposing misleading translations such as "数学归纳法", and offering guidelines for creating clear, concise, and unambiguous mathematical names.

ChineseLanguageTerminology
0 likes · 14 min read
Why Some Chinese Math Terms Nail the Meaning While Others Miss the Mark
Python Crawling & Data Mining
Python Crawling & Data Mining
Apr 1, 2023 · Backend Development

How to Fix Chinese Garbled Text in Python Web Scraping

This article explains three practical methods—using response.content, apparent_encoding, and custom encode/decode—to resolve Chinese character garbling in Python web crawlers, includes code snippets and screenshots, and offers concise guidance for developers facing this common issue.

ChinesePythonencoding
0 likes · 5 min read
How to Fix Chinese Garbled Text in Python Web Scraping
Laravel Tech Community
Laravel Tech Community
Feb 2, 2023 · Backend Development

Using overtrue/pinyin in PHP to Convert Chinese Characters to Pinyin

This guide explains how to install the overtrue/pinyin Composer package and demonstrates various PHP methods for converting Chinese text to pinyin, including tone styles, array output, permalink generation, abbreviation extraction, handling polyphones, and using the command‑line tool.

BackendChineseovertrue
0 likes · 6 min read
Using overtrue/pinyin in PHP to Convert Chinese Characters to Pinyin
ELab Team
ELab Team
Sep 23, 2022 · Artificial Intelligence

Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes

This tutorial walks you through NLP fundamentals, the evolution of BERT, the concept of pre‑trained models, and a step‑by‑step guide to fine‑tune a Chinese BERT on a cloze‑style task, complete with code snippets and verification results.

BERTChineseCloze Task
0 likes · 13 min read
Fine‑Tune a Chinese BERT Model for Cloze Tasks in 30 Minutes
Beike Product & Technology
Beike Product & Technology
Jul 1, 2021 · Artificial Intelligence

Semantic Data Augmentation and GigaSpeech: Highlights of Two INTERSPEECH 2021 Papers from the Beike Voice Team

The article summarizes two INTERSPEECH 2021 papers from Beike's voice technology team, detailing a grammar‑based semantic data augmentation method that improves end‑to‑end Chinese speech recognition and introducing GigaSpeech, a massive 10,000‑hour multilingual English speech dataset for robust ASR research.

ChineseGigaSpeechInterspeech
0 likes · 7 min read
Semantic Data Augmentation and GigaSpeech: Highlights of Two INTERSPEECH 2021 Papers from the Beike Voice Team
58 Tech
58 Tech
Aug 19, 2020 · Artificial Intelligence

Speech Recognition in 58.com: Application Scenarios, Data Collection, Kaldi Chain Model Practice, and End‑to‑End Exploration

This article presents a comprehensive overview of how 58.com leverages large‑scale voice data from call‑center, private phone, and micro‑chat platforms, detailing data collection, annotation, Kaldi‑based chain model training, lattice‑free techniques, and end‑to‑end Transformer‑CTC models to improve Chinese speech recognition performance.

ASRChineseDeep Learning
0 likes · 16 min read
Speech Recognition in 58.com: Application Scenarios, Data Collection, Kaldi Chain Model Practice, and End‑to‑End Exploration
Java Captain
Java Captain
Jul 2, 2018 · Databases

How to Fully Resolve MySQL Chinese Character Encoding Issues

This guide explains why MySQL tables default to latin1, demonstrates how to check and change table, session, and global character set settings, and shows the necessary my.cnf configuration changes to permanently enable UTF‑8 support for storing Chinese characters.

ChineseDatabase ConfigurationUTF-8
0 likes · 6 min read
How to Fully Resolve MySQL Chinese Character Encoding Issues
MaGe Linux Operations
MaGe Linux Operations
Apr 25, 2017 · Artificial Intelligence

Top Python and Java NLP Tools for Chinese Text Processing

This article surveys a wide range of natural language processing libraries—including Python packages like NLTK and spaCy, Java frameworks such as OpenNLP and StanfordNLP, and specialized Chinese tokenizers like IKAnalyzer, ICTCLAS, and FudanNLP—detailing their features, usage, and setup steps for Chinese text analysis.

ChineseIKAnalyzerJava
0 likes · 14 min read
Top Python and Java NLP Tools for Chinese Text Processing
ITPUB
ITPUB
Mar 21, 2017 · Databases

How to Fix Chinese Character Garbling When Migrating Oracle to MySQL

When moving data from Oracle to MySQL, Chinese characters may appear as garbled text because the server uses a latin1 charset; this guide explains three levels of fixes—table, database, and server—by configuring UTF‑8 settings and verifying the changes.

Character SetChineseData Migration
0 likes · 5 min read
How to Fix Chinese Character Garbling When Migrating Oracle to MySQL