Tagged articles
56 articles
Page 1 of 1
Senior Brother's Insights
Senior Brother's Insights
May 13, 2025 · Databases

Debunking MySQL VARCHAR Myths: Why 255 Isn’t the Magic Number

This article examines common misconceptions about MySQL VARCHAR—why 255 was historically used, the real maximum length, performance differences between short and long definitions, character‑set impacts on storage, and why aligning sizes to powers of two offers no real benefit.

character encodingmysqlperformance
0 likes · 13 min read
Debunking MySQL VARCHAR Myths: Why 255 Isn’t the Magic Number
Liangxu Linux
Liangxu Linux
Jul 14, 2024 · Fundamentals

Decoding Chinese Text: ASCII, GB2312, GBK, GB18030, and UTF‑8 Explained

This article explains how computer text is represented by assigning unique numeric codes to characters and converting those codes into binary, then compares the most common Chinese encodings—ASCII, GB2312, GBK, GB18030, and UTF‑8—detailing their compatibility, byte lengths, and practical impact on software development.

ASCIIGB18030GB2312
0 likes · 14 min read
Decoding Chinese Text: ASCII, GB2312, GBK, GB18030, and UTF‑8 Explained
Java Tech Enthusiast
Java Tech Enthusiast
Apr 21, 2024 · Fundamentals

Decoding Binary UTF-8 Signage in a Public Restroom Using Java

The article explains how a binary message on a multilingual public‑restroom sign was decoded by identifying UTF‑8 byte patterns, extracting the first 24 bits to reveal the Chinese character “向”, and providing a Java program that parses the entire bit string into readable Chinese text.

JavaUTF-8Unicode
0 likes · 4 min read
Decoding Binary UTF-8 Signage in a Public Restroom Using Java
php Courses
php Courses
Dec 13, 2023 · Backend Development

Optimizing Web Page Encoding and Character Conversion with PHP

This article explains how to set the correct character encoding for web pages and demonstrates PHP functions such as meta charset, iconv, urlencode/urldecode, and mysqli_real_escape_string with practical code examples to ensure proper display and safe database insertion.

IConvMySQLiPHP
0 likes · 5 min read
Optimizing Web Page Encoding and Character Conversion with PHP
Sohu Tech Products
Sohu Tech Products
Dec 6, 2023 · Frontend Development

The Nuances of Base64 Encoding Strings in JavaScript

The article explains that JavaScript’s native btoa() and atob() functions only handle ASCII, so to correctly base64‑encode Unicode strings you must convert them with TextEncoder to UTF‑8 bytes, use Uint8Array, and decode with TextDecoder, while checking for malformed surrogate pairs via isWellFormed or encodeURIComponent to avoid silent data loss.

Base64JavaScriptTextDecoder
0 likes · 14 min read
The Nuances of Base64 Encoding Strings in JavaScript
Architect's Guide
Architect's Guide
Sep 15, 2023 · Databases

Why You Should Never Use MySQL “utf8” and Switch to “utf8mb4”

The article explains that MySQL’s legacy “utf8” charset only supports three‑byte characters, causing errors with genuine four‑byte UTF‑8 symbols, and advises all MySQL/MariaDB users to migrate to the proper “utf8mb4” charset using available conversion guides.

MariaDBcharacter encodingmysql
0 likes · 7 min read
Why You Should Never Use MySQL “utf8” and Switch to “utf8mb4”
Liangxu Linux
Liangxu Linux
Jul 17, 2023 · Fundamentals

Mastering Character Encodings: From ANSI to UTF‑8 and Beyond

This guide explains the essential character set encodings—ANSI, ASCII, GB2312/GBK/GB18030, Unicode planes, UTF‑16, UTF‑32, and UTF‑8—and shows how they are used in MFC and Qt, providing code examples to avoid garbled text in software.

ANSIMFCQt
0 likes · 8 min read
Mastering Character Encodings: From ANSI to UTF‑8 and Beyond
Sohu Tech Products
Sohu Tech Products
Jul 12, 2023 · Fundamentals

The Mystery of Character Encoding: Unicode, UTF‑8, UTF‑16, GBK and Emoji

This article explains the fundamentals of character encoding, covering Unicode’s universal character set, the structure of its planes and surrogate areas, the variable‑length UTF‑8 and UTF‑16 encodings, Chinese‑specific GBK encoding, and practical iOS code examples for handling Unicode, emojis and regular‑expression based Chinese character detection.

EmojiGBKUTF-8
0 likes · 12 min read
The Mystery of Character Encoding: Unicode, UTF‑8, UTF‑16, GBK and Emoji
Alipay Experience Technology
Alipay Experience Technology
Apr 23, 2023 · Fundamentals

Decoding Emoji: Unicode, Variants, and JavaScript Handling

This article explains how emojis are represented in Unicode, covering basic code points, variation selectors, skin‑tone modifiers, zero‑width joiners, flag ligatures, tag sequences and keycap symbols, and shows how JavaScript can correctly process them using grapheme‑cluster techniques.

EmojiUnicodecharacter encoding
0 likes · 16 min read
Decoding Emoji: Unicode, Variants, and JavaScript Handling
Yunxuetang Frontend Team
Yunxuetang Frontend Team
Apr 14, 2023 · Frontend Development

Top Front-End Topics: Encoding, Mobile Adaptation, ES6, Pinia, Git & Memory

This article curates essential front‑end resources covering character encoding fundamentals, mobile screen adaptation strategies, practical ES6 tricks, Pinia state‑management advantages over Vuex, classic Git operations for code merging, and crucial browser memory knowledge for stable web applications.

browser memorycharacter encodinges6
0 likes · 4 min read
Top Front-End Topics: Encoding, Mobile Adaptation, ES6, Pinia, Git & Memory
Top Architect
Top Architect
Dec 22, 2022 · Databases

Understanding MySQL utf8 Limitations and Why You Should Use utf8mb4

This article explains why MySQL's traditional utf8 charset cannot store 4‑byte characters such as emojis, demonstrates the resulting insert errors, and shows how switching the database, system, and column collations to utf8mb4 resolves the issue while also providing a brief history of MySQL's charset implementation.

SQLcharacter encodingmysql
0 likes · 9 min read
Understanding MySQL utf8 Limitations and Why You Should Use utf8mb4
Tencent Cloud Developer
Tencent Cloud Developer
May 17, 2022 · Fundamentals

A Comprehensive History and Overview of Character Encoding and Unicode

The article traces character encoding from early telegraph and Morse code through ASCII, ISO national sets and Chinese standards, explains Unicode’s unification and its UTF‑8/‑16/‑32 forms, and shows how modern languages—especially JavaScript—handle code points, highlighting the cultural and technical significance for developers.

ASCIIJavaScriptUTF-16
0 likes · 31 min read
A Comprehensive History and Overview of Character Encoding and Unicode
macrozheng
macrozheng
Feb 8, 2021 · Fundamentals

Why Do You See “锟斤拷” in Text? Uncover the Encoding Mystery

This article explains how character encoding works, using ASCII, Unicode, UTF‑8 and GBK examples to reveal why the garbled string “锟斤拷” appears when mismatched encodings are processed, and shows the underlying byte‑level transformations.

ASCIIGBKUTF-8
0 likes · 4 min read
Why Do You See “锟斤拷” in Text? Uncover the Encoding Mystery
Seewo Tech Circle
Seewo Tech Circle
Aug 30, 2019 · Fundamentals

Demystifying Character Encoding: From ASCII to Unicode and Beyond

This article explains the fundamentals of character encoding, covering concepts such as information, symbols, character sets, various encoding schemes like ASCII, GB2312, UTF‑8, Unicode planes, common pitfalls, and practical examples to help developers avoid garbled text.

GB2312UTF-8Unicode
0 likes · 9 min read
Demystifying Character Encoding: From ASCII to Unicode and Beyond
21CTO
21CTO
May 13, 2019 · Databases

Why MySQL’s ‘utf8’ Isn’t Real UTF‑8 and How utf8mb4 Fixes It

Discover why MySQL’s legacy ‘utf8’ charset only supports three‑byte characters, causing storage errors for true UTF‑8 data, and learn how switching to the proper ‘utf8mb4’ charset resolves these issues, with a brief history and practical migration guidance.

MariaDBUTF-8character encoding
0 likes · 7 min read
Why MySQL’s ‘utf8’ Isn’t Real UTF‑8 and How utf8mb4 Fixes It
JD Tech
JD Tech
Dec 18, 2018 · Fundamentals

Understanding Character Encoding: Bits, Bytes, Unicode, UTF-8, UTF-16, and UTF-32

This article explains the origins of character sets, the relationships among various encodings such as ASCII, GB2312, GBK, GB18030, Unicode, UTF-8, UTF-16, and UTF-32, and shows how JavaScript handles Unicode and emoji characters, including practical code examples and solutions for length‑limited input fields.

UTF-16UTF-8Unicode
0 likes · 11 min read
Understanding Character Encoding: Bits, Bytes, Unicode, UTF-8, UTF-16, and UTF-32
Java Captain
Java Captain
Jul 2, 2018 · Databases

How to Fully Resolve MySQL Chinese Character Encoding Issues

This guide explains why MySQL tables default to latin1, demonstrates how to check and change table, session, and global character set settings, and shows the necessary my.cnf configuration changes to permanently enable UTF‑8 support for storing Chinese characters.

ChineseDatabase ConfigurationUTF-8
0 likes · 6 min read
How to Fully Resolve MySQL Chinese Character Encoding Issues
MaGe Linux Operations
MaGe Linux Operations
Jan 15, 2018 · Fundamentals

Mastering Character Encoding in Python: From ASCII to UTF‑8

This article explains the fundamental concepts of characters, character sets, and encodings, compares common encodings such as ASCII, Unicode, and UTF‑8, and shows how Python 2 and Python 3 handle default encodings, string types, and common Unicode errors with practical code examples.

UTF-8Unicodecharacter encoding
0 likes · 14 min read
Mastering Character Encoding in Python: From ASCII to UTF‑8
Tencent IMWeb Frontend Team
Tencent IMWeb Frontend Team
Sep 13, 2017 · Backend Development

Mastering Character Encoding in Node.js: Avoid Garbled Text

This article introduces the fundamentals of character encoding and decoding, explains the difference between character sets and encodings, and demonstrates practical Node.js examples using the iconv‑lite library to correctly encode and decode Chinese text, helping developers prevent common garbled‑text issues.

Backend DevelopmentGBKNode.js
0 likes · 7 min read
Mastering Character Encoding in Node.js: Avoid Garbled Text
ITPUB
ITPUB
Sep 19, 2016 · Fundamentals

Understanding Character Encoding: From ASCII to Unicode and UTF‑8

This article explains the fundamentals of character encoding, covering the evolution from the 7‑bit ASCII standard to Chinese GB2312, the development of Unicode and UTF‑8, and provides practical guidance for handling these encodings in Windows and Linux C programs, including a sample UTF‑8 detection function.

ASCIIC programmingGB2312
0 likes · 13 min read
Understanding Character Encoding: From ASCII to Unicode and UTF‑8
21CTO
21CTO
Mar 31, 2016 · Databases

Why Does MySQL Produce Garbled Text? Causes, Prevention, and Fixes

This article explains the multiple encoding/decoding steps MySQL performs when storing and retrieving data, identifies why mismatched character sets cause mojibake, and provides practical methods to avoid and repair corrupted text in MySQL tables.

character encodingcharset conversiondata repair
0 likes · 17 min read
Why Does MySQL Produce Garbled Text? Causes, Prevention, and Fixes
dbaplus Community
dbaplus Community
Jan 7, 2016 · Databases

Mastering Character Sets: Diagnose and Fix MySQL Garbled Text

This article explains character sets and encodings, shows why MySQL data can become garbled, and provides step‑by‑step methods—including proper configuration, common pitfalls, and reliable repair techniques—to prevent and correct encoding issues such as emoji storage failures.

EmojiGBKUTF-8
0 likes · 24 min read
Mastering Character Sets: Diagnose and Fix MySQL Garbled Text
21CTO
21CTO
Sep 29, 2015 · Fundamentals

Why Chinese Text Gets Garbled and How to Fix It – A Practical Encoding Guide

This article explains why Chinese characters often appear as mojibake on Windows and Linux, introduces the history and technical details of ASCII, GB2312, GBK, GB18030 and Unicode, and provides concrete examples and command‑line tools for inspecting and converting file encodings.

C programmingGB2312UTF-8
0 likes · 26 min read
Why Chinese Text Gets Garbled and How to Fix It – A Practical Encoding Guide
21CTO
21CTO
Aug 14, 2015 · Fundamentals

Why Web Pages Show Garbled Text: Charsets, Encodings & HTTP Headers

This article explains how computers store and display characters using binary, defines character sets and encodings such as ASCII, GB2312, GBK, GB18030, BIG5 and Unicode, compares UTF‑8, UTF‑16, UTF‑32, and describes related HTTP headers like Accept‑Charset, Content‑Type, and Content‑Encoding.

ASCIICharsetGB18030
0 likes · 20 min read
Why Web Pages Show Garbled Text: Charsets, Encodings & HTTP Headers
MaGe Linux Operations
MaGe Linux Operations
Mar 17, 2015 · Databases

Why MySQL Shows Garbled Text and How to Fix It

This article explains the root causes of MySQL character‑set garbling, illustrates the encoding/decoding steps during data insertion and retrieval, and provides practical methods—including dump‑reload and binary conversion—to prevent and repair corrupted text.

SQLcharacter encodinggarbled text
0 likes · 19 min read
Why MySQL Shows Garbled Text and How to Fix It