Tagged articles
8 articles
Page 1 of 1
Liangxu Linux
Liangxu Linux
Jul 14, 2024 · Fundamentals

Decoding Chinese Text: ASCII, GB2312, GBK, GB18030, and UTF‑8 Explained

This article explains how computer text is represented by assigning unique numeric codes to characters and converting those codes into binary, then compares the most common Chinese encodings—ASCII, GB2312, GBK, GB18030, and UTF‑8—detailing their compatibility, byte lengths, and practical impact on software development.

ASCIIGB18030GB2312
0 likes · 14 min read
Decoding Chinese Text: ASCII, GB2312, GBK, GB18030, and UTF‑8 Explained
Seewo Tech Circle
Seewo Tech Circle
Aug 30, 2019 · Fundamentals

Demystifying Character Encoding: From ASCII to Unicode and Beyond

This article explains the fundamentals of character encoding, covering concepts such as information, symbols, character sets, various encoding schemes like ASCII, GB2312, UTF‑8, Unicode planes, common pitfalls, and practical examples to help developers avoid garbled text.

GB2312UTF-8Unicode
0 likes · 9 min read
Demystifying Character Encoding: From ASCII to Unicode and Beyond
ITPUB
ITPUB
Sep 19, 2016 · Fundamentals

Understanding Character Encoding: From ASCII to Unicode and UTF‑8

This article explains the fundamentals of character encoding, covering the evolution from the 7‑bit ASCII standard to Chinese GB2312, the development of Unicode and UTF‑8, and provides practical guidance for handling these encodings in Windows and Linux C programs, including a sample UTF‑8 detection function.

ASCIIC programmingGB2312
0 likes · 13 min read
Understanding Character Encoding: From ASCII to Unicode and UTF‑8
21CTO
21CTO
Jan 4, 2016 · Fundamentals

Why Chinese Text Gets Garbled and How to Fix It: A Deep Dive into Encoding Standards

This article explains why Chinese characters often appear as garbled text on Windows and Linux, introduces the history and hierarchy of Chinese encoding standards such as GB2312, GBK, GB18030 and Unicode, compares ASCII, UTF‑8/16/32, shows practical command‑line experiments, and offers guidance for handling Chinese text in C and Python programs.

CGB2312Python
0 likes · 25 min read
Why Chinese Text Gets Garbled and How to Fix It: A Deep Dive into Encoding Standards
21CTO
21CTO
Sep 29, 2015 · Fundamentals

Why Chinese Text Gets Garbled and How to Fix It – A Practical Encoding Guide

This article explains why Chinese characters often appear as mojibake on Windows and Linux, introduces the history and technical details of ASCII, GB2312, GBK, GB18030 and Unicode, and provides concrete examples and command‑line tools for inspecting and converting file encodings.

C programmingGB2312UTF-8
0 likes · 26 min read
Why Chinese Text Gets Garbled and How to Fix It – A Practical Encoding Guide