Tag

character encoding

0 views collected around this technical thread.

Lobster Programming
Lobster Programming
Feb 27, 2025 · Fundamentals

Why Garbled Characters Appear: Exploring ASCII, GB2312, GBK & Unicode

This article explains how character encoding works—from ASCII and its extensions to Chinese GB2312 and GBK, through Unicode's UCS‑2, UCS‑4, and the versatile UTF‑8—showing why mismatched encodings produce garbled text and why UTF‑8 is the default in Spring Boot.

ASCIIGB2312GBK
0 likes · 9 min read
Why Garbled Characters Appear: Exploring ASCII, GB2312, GBK & Unicode
Java Tech Enthusiast
Java Tech Enthusiast
Apr 21, 2024 · Fundamentals

Decoding Binary UTF-8 Signage in a Public Restroom Using Java

The article explains how a binary message on a multilingual public‑restroom sign was decoded by identifying UTF‑8 byte patterns, extracting the first 24 bits to reveal the Chinese character “向”, and providing a Java program that parses the entire bit string into readable Chinese text.

Binary EncodingJavaUTF-8
0 likes · 4 min read
Decoding Binary UTF-8 Signage in a Public Restroom Using Java
Top Architecture Tech Stack
Top Architecture Tech Stack
Feb 23, 2024 · Fundamentals

Understanding Character Encoding: ASCII, GB2312, Unicode, and UTF-8

This article explains the history, purpose, and differences of major character encodings—including ASCII, GB2312, Unicode, and UTF-8—while showing how they are used and converted in modern computing environments.

ASCIIGB2312UTF-8
0 likes · 11 min read
Understanding Character Encoding: ASCII, GB2312, Unicode, and UTF-8
php中文网 Courses
php中文网 Courses
Dec 13, 2023 · Backend Development

Optimizing Web Page Encoding and Character Conversion with PHP

This article explains how to set the correct character encoding for web pages and demonstrates PHP functions such as meta charset, iconv, urlencode/urldecode, and mysqli_real_escape_string with practical code examples to ensure proper display and safe database insertion.

Web Developmentcharacter encodingiconv
0 likes · 5 min read
Optimizing Web Page Encoding and Character Conversion with PHP
Sohu Tech Products
Sohu Tech Products
Dec 6, 2023 · Frontend Development

The Nuances of Base64 Encoding Strings in JavaScript

The article explains that JavaScript’s native btoa() and atob() functions only handle ASCII, so to correctly base64‑encode Unicode strings you must convert them with TextEncoder to UTF‑8 bytes, use Uint8Array, and decode with TextDecoder, while checking for malformed surrogate pairs via isWellFormed or encodeURIComponent to avoid silent data loss.

Base64JavaScriptTextDecoder
0 likes · 14 min read
The Nuances of Base64 Encoding Strings in JavaScript
Architect's Guide
Architect's Guide
Sep 15, 2023 · Databases

Why You Should Never Use MySQL “utf8” and Switch to “utf8mb4”

The article explains that MySQL’s legacy “utf8” charset only supports three‑byte characters, causing errors with genuine four‑byte UTF‑8 symbols, and advises all MySQL/MariaDB users to migrate to the proper “utf8mb4” charset using available conversion guides.

DatabaseMariaDBMySQL
0 likes · 7 min read
Why You Should Never Use MySQL “utf8” and Switch to “utf8mb4”
360 Tech Engineering
360 Tech Engineering
Jul 18, 2023 · Fundamentals

Understanding Characters, Character Sets, and Encoding: From ASCII to Unicode

This article explains the concepts of characters, character sets, and character encoding, describes how computers store and render text using methods like ASCII, GB2312, Unicode, and UTF‑8/16/32, and discusses why garbled text occurs across different languages and systems.

ASCIIUTF-8Unicode
0 likes · 10 min read
Understanding Characters, Character Sets, and Encoding: From ASCII to Unicode
Sohu Tech Products
Sohu Tech Products
Jul 12, 2023 · Fundamentals

The Mystery of Character Encoding: Unicode, UTF‑8, UTF‑16, GBK and Emoji

This article explains the fundamentals of character encoding, covering Unicode’s universal character set, the structure of its planes and surrogate areas, the variable‑length UTF‑8 and UTF‑16 encodings, Chinese‑specific GBK encoding, and practical iOS code examples for handling Unicode, emojis and regular‑expression based Chinese character detection.

GBKUTF-8Unicode
0 likes · 12 min read
The Mystery of Character Encoding: Unicode, UTF‑8, UTF‑16, GBK and Emoji
Top Architect
Top Architect
Dec 22, 2022 · Databases

Understanding MySQL utf8 Limitations and Why You Should Use utf8mb4

This article explains why MySQL's traditional utf8 charset cannot store 4‑byte characters such as emojis, demonstrates the resulting insert errors, and shows how switching the database, system, and column collations to utf8mb4 resolves the issue while also providing a brief history of MySQL's charset implementation.

DatabaseMySQLSQL
0 likes · 9 min read
Understanding MySQL utf8 Limitations and Why You Should Use utf8mb4
Tencent Cloud Developer
Tencent Cloud Developer
May 17, 2022 · Fundamentals

A Comprehensive History and Overview of Character Encoding and Unicode

The article traces character encoding from early telegraph and Morse code through ASCII, ISO national sets and Chinese standards, explains Unicode’s unification and its UTF‑8/‑16/‑32 forms, and shows how modern languages—especially JavaScript—handle code points, highlighting the cultural and technical significance for developers.

ASCIIHistoryJavaScript
0 likes · 31 min read
A Comprehensive History and Overview of Character Encoding and Unicode
IT Services Circle
IT Services Circle
Mar 4, 2022 · Fundamentals

Understanding Character Encoding: From GBK and UTF-8 to Unicode

This tutorial explains the origins and evolution of character encoding, covering early ASCII, Chinese GBK/GB18030, the universal Unicode standard, UTF‑8 variable‑length encoding, and practical differences between Python 2 and Python 3 with code examples.

ASCIIGBKPython
0 likes · 9 min read
Understanding Character Encoding: From GBK and UTF-8 to Unicode
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Nov 12, 2021 · Databases

Understanding MySQL Encoding Mechanism and Solving Chinese Character Query Issues

This article explains MySQL's character encoding workflow, illustrates why queries containing Chinese characters fail without proper settings, and shows how to configure JDBC URLs, server variables, and Docker‑based MySQL instances to ensure lossless UTF‑8 handling.

Database ConfigurationDockerJDBC
0 likes · 9 min read
Understanding MySQL Encoding Mechanism and Solving Chinese Character Query Issues
Xueersi Online School Tech Team
Xueersi Online School Tech Team
Jul 9, 2021 · Fundamentals

Understanding Character Encoding and Redis SDS Dynamic String Implementation

This article explains how computers store text using binary, introduces ASCII, Unicode and UTF‑8 encoding rules, discusses the limitations of C‑style null‑terminated strings, and describes Redis's Simple Dynamic String (SDS) data structure, its old and new versions, advantages, and related APIs.

C stringsRedisSDS
0 likes · 14 min read
Understanding Character Encoding and Redis SDS Dynamic String Implementation
Python Programming Learning Circle
Python Programming Learning Circle
Mar 9, 2021 · Fundamentals

Understanding How Input Methods Encode Characters and Code Pages

This article explains the process by which keyboard input is transformed into ASCII codes, how input methods map those codes to proprietary encodings, the role of code pages and fonts in rendering characters, and the use of Unicode, BOM, and endianness when saving text files.

ASCIIUnicodecharacter encoding
0 likes · 3 min read
Understanding How Input Methods Encode Characters and Code Pages
macrozheng
macrozheng
Feb 8, 2021 · Fundamentals

Why Do You See “锟斤拷” in Text? Uncover the Encoding Mystery

This article explains how character encoding works, using ASCII, Unicode, UTF‑8 and GBK examples to reveal why the garbled string “锟斤拷” appears when mismatched encodings are processed, and shows the underlying byte‑level transformations.

ASCIIGBKUTF-8
0 likes · 4 min read
Why Do You See “锟斤拷” in Text? Uncover the Encoding Mystery
Laravel Tech Community
Laravel Tech Community
Aug 26, 2020 · Backend Development

PHP mb_http_output() – Set or Get HTTP Output Character Encoding

This article explains the PHP mb_http_output() function, detailing how it sets or retrieves the HTTP output character encoding, its parameters, return values, and provides a complete example for configuring Unicode handling in PHP scripts.

HTTPPHPbackend development
0 likes · 2 min read
PHP mb_http_output() – Set or Get HTTP Output Character Encoding
Laravel Tech Community
Laravel Tech Community
Aug 23, 2020 · Backend Development

Using mb_detect_order() to Set and Retrieve Character Encoding Detection Order in PHP

This article explains how the PHP mb_detect_order() function configures or retrieves the list of character encodings used for automatic detection, describes its parameters and return values, and provides practical code examples for both string and array inputs.

backendcharacter encodingmb_detect_order
0 likes · 2 min read
Using mb_detect_order() to Set and Retrieve Character Encoding Detection Order in PHP
Laravel Tech Community
Laravel Tech Community
Aug 21, 2020 · Backend Development

PHP mb_internal_encoding(): Set or Get Internal Character Encoding

This article explains the PHP mb_internal_encoding() function, detailing its syntax, parameter description, return values, and provides a clear example showing how to set the internal encoding to UTF-8 and retrieve the current encoding.

ExamplePHPbackend
0 likes · 2 min read
PHP mb_internal_encoding(): Set or Get Internal Character Encoding
Architecture Digest
Architecture Digest
Jun 23, 2020 · Databases

Never Use MySQL “utf8” – Switch to “utf8mb4” for Real UTF‑8 Support

The article explains why MySQL’s legacy “utf8” character set only supports three‑byte sequences, causing errors with genuine four‑byte UTF‑8 characters, and shows how the proper “utf8mb4” charset resolves the issue, including historical background, practical examples, and migration guidance.

DatabaseMariaDBMySQL
0 likes · 10 min read
Never Use MySQL “utf8” – Switch to “utf8mb4” for Real UTF‑8 Support