Tagged articles
72 articles
Page 1 of 1
Lin is Dream
Lin is Dream
Mar 11, 2025 · Fundamentals

Unlocking Data: How Bits, Bytes, Hex and Encoding Shape Everything

This article explains the relationships between bits, binary, hexadecimal, bytes and strings, shows how computers store and process data using binary, demonstrates hexadecimal and Base64 encodings, provides conversion tables and examples, and clarifies how UTF‑8 encodes characters.

Base64BinaryUTF-8
0 likes · 13 min read
Unlocking Data: How Bits, Bytes, Hex and Encoding Shape Everything
ITPUB
ITPUB
Jan 18, 2025 · Cloud Native

Prometheus 3.0 Unveiled: New UI, Remote‑Write 2.0, and Native Histograms

Prometheus 3.0, the first major release in seven years, introduces a rebuilt UI, Remote‑Write 2.0 with richer metadata, full UTF‑8 support, native OpenTelemetry ingestion, experimental native histograms, performance gains, and a set of breaking changes that require careful migration.

Cloud NativeNative HistogramsPrometheus
0 likes · 8 min read
Prometheus 3.0 Unveiled: New UI, Remote‑Write 2.0, and Native Histograms
Java Tech Enthusiast
Java Tech Enthusiast
Jul 27, 2024 · Fundamentals

The Story Behind the Creation of UTF-8 and Its Advantages

Rob Pike and Ken Thompson devised UTF‑8 in 1992 at Bell Labs, turning a three‑day prototype into the web’s dominant Unicode encoding by using a variable‑length, ASCII‑compatible, length‑prefixed and prefix‑free scheme that maximizes efficiency, robustness, and universal adoption across more than 96 % of sites.

UTF-8Unicodeencoding
0 likes · 6 min read
The Story Behind the Creation of UTF-8 and Its Advantages
Liangxu Linux
Liangxu Linux
Jul 14, 2024 · Fundamentals

Decoding Chinese Text: ASCII, GB2312, GBK, GB18030, and UTF‑8 Explained

This article explains how computer text is represented by assigning unique numeric codes to characters and converting those codes into binary, then compares the most common Chinese encodings—ASCII, GB2312, GBK, GB18030, and UTF‑8—detailing their compatibility, byte lengths, and practical impact on software development.

ASCIIGB18030GB2312
0 likes · 14 min read
Decoding Chinese Text: ASCII, GB2312, GBK, GB18030, and UTF‑8 Explained
Java Tech Enthusiast
Java Tech Enthusiast
Apr 21, 2024 · Fundamentals

Decoding Binary UTF-8 Signage in a Public Restroom Using Java

The article explains how a binary message on a multilingual public‑restroom sign was decoded by identifying UTF‑8 byte patterns, extracting the first 24 bits to reveal the Chinese character “向”, and providing a Java program that parses the entire bit string into readable Chinese text.

JavaUTF-8Unicode
0 likes · 4 min read
Decoding Binary UTF-8 Signage in a Public Restroom Using Java
Liangxu Linux
Liangxu Linux
Jul 17, 2023 · Fundamentals

Mastering Character Encodings: From ANSI to UTF‑8 and Beyond

This guide explains the essential character set encodings—ANSI, ASCII, GB2312/GBK/GB18030, Unicode planes, UTF‑16, UTF‑32, and UTF‑8—and shows how they are used in MFC and Qt, providing code examples to avoid garbled text in software.

ANSIMFCQt
0 likes · 8 min read
Mastering Character Encodings: From ANSI to UTF‑8 and Beyond
Sohu Tech Products
Sohu Tech Products
Jul 12, 2023 · Fundamentals

The Mystery of Character Encoding: Unicode, UTF‑8, UTF‑16, GBK and Emoji

This article explains the fundamentals of character encoding, covering Unicode’s universal character set, the structure of its planes and surrogate areas, the variable‑length UTF‑8 and UTF‑16 encodings, Chinese‑specific GBK encoding, and practical iOS code examples for handling Unicode, emojis and regular‑expression based Chinese character detection.

EmojiGBKUTF-8
0 likes · 12 min read
The Mystery of Character Encoding: Unicode, UTF‑8, UTF‑16, GBK and Emoji
Laravel Tech Community
Laravel Tech Community
Dec 28, 2022 · Information Security

Apache SpamAssassin 4.0 – New Features and Improvements

Apache SpamAssassin 4.0 introduces comprehensive Unicode support, enhanced geolocation, improved Bayesian filtering for non‑English mail, better SSL client certificate handling, new DKIM/SPF and URL‑expansion plugins, and an ExtractText plugin for attachment analysis, representing a major upgrade over the 3.4 series.

Bayesian FilteringSpamAssassinUTF-8
0 likes · 3 min read
Apache SpamAssassin 4.0 – New Features and Improvements
Tencent Cloud Developer
Tencent Cloud Developer
May 17, 2022 · Fundamentals

A Comprehensive History and Overview of Character Encoding and Unicode

The article traces character encoding from early telegraph and Morse code through ASCII, ISO national sets and Chinese standards, explains Unicode’s unification and its UTF‑8/‑16/‑32 forms, and shows how modern languages—especially JavaScript—handle code points, highlighting the cultural and technical significance for developers.

ASCIIJavaScriptUTF-16
0 likes · 31 min read
A Comprehensive History and Overview of Character Encoding and Unicode
Programmer DD
Programmer DD
Apr 19, 2022 · Backend Development

Why Java 8 Switched String Storage to byte[] and How It Saves Memory

The article explains how Java 8 changed the internal representation of String from a char[] to a byte[] to reduce memory consumption, the role of Latin‑1 encoding, the impact on garbage collection, and why UTF‑16 remains the practical choice for Java strings.

JavaMemory OptimizationString
0 likes · 8 min read
Why Java 8 Switched String Storage to byte[] and How It Saves Memory
21CTO
21CTO
Mar 23, 2022 · Fundamentals

Java 18: New Features That Accelerate Development

Java 18, released by Oracle as a non‑LTS incremental update, introduces a simple web server, enhanced API documentation with syntax‑highlighted code snippets, and switches the default charset to UTF‑8, offering developers faster testing, clearer docs, and consistent encoding across platforms.

API documentationJavaJava 18
0 likes · 4 min read
Java 18: New Features That Accelerate Development
Liangxu Linux
Liangxu Linux
May 13, 2021 · Fundamentals

Why Does Text Become Garbled? A Deep Dive into UTF‑8, GBK, and Unicode

This article explains why characters appear as garbled text when encoding and decoding methods mismatch, explores how Excel defaults to GBK, shows how to convert files with iconv, and walks through the evolution from ASCII to GB2312, GBK, GB18030, and finally Unicode's UTF‑8 encoding.

Character SetGBKUTF-8
0 likes · 6 min read
Why Does Text Become Garbled? A Deep Dive into UTF‑8, GBK, and Unicode
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Feb 23, 2021 · Backend Development

How to Fix Chinese Garbled Text in IntelliJ IDEA Console: 4 Proven Steps

This guide explains four practical methods—adjusting Tomcat VM options, setting global and project encodings, customizing IDEA VM options, and editing configuration files—to resolve Chinese character garbling in the IntelliJ IDEA console, with step‑by‑step instructions and screenshots.

Chinese encodingConsole garbled textIDE configuration
0 likes · 4 min read
How to Fix Chinese Garbled Text in IntelliJ IDEA Console: 4 Proven Steps
ByteFE
ByteFE
Feb 10, 2021 · Frontend Development

Handling Unicode and Supplementary Characters in JavaScript

This article explains how JavaScript processes Unicode characters, demonstrates the limitations of legacy APIs like charCodeAt and fromCharCode with supplementary characters, and introduces modern methods such as codePointAt, fromCodePoint, Unicode escape syntax, surrogate pairs, and polyfills for full Unicode support.

JavaScriptSurrogate PairUTF-8
0 likes · 10 min read
Handling Unicode and Supplementary Characters in JavaScript
macrozheng
macrozheng
Feb 8, 2021 · Fundamentals

Why Do You See “锟斤拷” in Text? Uncover the Encoding Mystery

This article explains how character encoding works, using ASCII, Unicode, UTF‑8 and GBK examples to reveal why the garbled string “锟斤拷” appears when mismatched encodings are processed, and shows the underlying byte‑level transformations.

ASCIIGBKUTF-8
0 likes · 4 min read
Why Do You See “锟斤拷” in Text? Uncover the Encoding Mystery
Laravel Tech Community
Laravel Tech Community
Jul 7, 2020 · Backend Development

PHP rawurldecode Function and Custom UTF-8 URL Decoding

This article explains PHP's rawurldecode() function for decoding URL‑encoded strings, shows its signature, parameters, return value, provides a simple usage example, notes UTF‑8 considerations, and presents a custom utf8RawUrlDecode() function for handling non‑standard %uXXXX sequences.

BackendPHPURL decoding
0 likes · 3 min read
PHP rawurldecode Function and Custom UTF-8 URL Decoding
Architecture Digest
Architecture Digest
Mar 8, 2020 · Databases

MySQL Encoding Process and Character Set Handling

This article explains how MySQL’s character_set parameters such as character_set_client and character_set_results control the encoding and decoding of client commands and query results, illustrates common pitfalls with UTF‑8, GBK and Latin‑1, and provides practical commands to avoid garbled text.

Character SetGBKUTF-8
0 likes · 10 min read
MySQL Encoding Process and Character Set Handling
Seewo Tech Circle
Seewo Tech Circle
Aug 30, 2019 · Fundamentals

Demystifying Character Encoding: From ASCII to Unicode and Beyond

This article explains the fundamentals of character encoding, covering concepts such as information, symbols, character sets, various encoding schemes like ASCII, GB2312, UTF‑8, Unicode planes, common pitfalls, and practical examples to help developers avoid garbled text.

GB2312UTF-8Unicode
0 likes · 9 min read
Demystifying Character Encoding: From ASCII to Unicode and Beyond
21CTO
21CTO
May 13, 2019 · Databases

Why MySQL’s ‘utf8’ Isn’t Real UTF‑8 and How utf8mb4 Fixes It

Discover why MySQL’s legacy ‘utf8’ charset only supports three‑byte characters, causing storage errors for true UTF‑8 data, and learn how switching to the proper ‘utf8mb4’ charset resolves these issues, with a brief history and practical migration guidance.

MariaDBUTF-8character encoding
0 likes · 7 min read
Why MySQL’s ‘utf8’ Isn’t Real UTF‑8 and How utf8mb4 Fixes It
JD Tech
JD Tech
Dec 18, 2018 · Fundamentals

Understanding Character Encoding: Bits, Bytes, Unicode, UTF-8, UTF-16, and UTF-32

This article explains the origins of character sets, the relationships among various encodings such as ASCII, GB2312, GBK, GB18030, Unicode, UTF-8, UTF-16, and UTF-32, and shows how JavaScript handles Unicode and emoji characters, including practical code examples and solutions for length‑limited input fields.

UTF-16UTF-8Unicode
0 likes · 11 min read
Understanding Character Encoding: Bits, Bytes, Unicode, UTF-8, UTF-16, and UTF-32
Java Captain
Java Captain
Jul 2, 2018 · Databases

How to Fully Resolve MySQL Chinese Character Encoding Issues

This guide explains why MySQL tables default to latin1, demonstrates how to check and change table, session, and global character set settings, and shows the necessary my.cnf configuration changes to permanently enable UTF‑8 support for storing Chinese characters.

ChineseDatabase ConfigurationUTF-8
0 likes · 6 min read
How to Fully Resolve MySQL Chinese Character Encoding Issues
Tencent Music Tech Team
Tencent Music Tech Team
Feb 9, 2018 · Mobile Development

Understanding String Encoding in Android JNI: From Native Crash to Source Code Analysis

This article investigates an Android JNI native crash caused by misusing NewString(), examines why a custom UTF‑8‑to‑UTF‑16 conversion was used instead of NewStringUTF(), compares Dalvik and ART string encodings, reveals a Dalvik UTF‑8 conversion bug fixed in ART, and advises developers on encoding nuances across Android versions.

ARTAndroidDalvik
0 likes · 26 min read
Understanding String Encoding in Android JNI: From Native Crash to Source Code Analysis
MaGe Linux Operations
MaGe Linux Operations
Jan 15, 2018 · Fundamentals

Mastering Character Encoding in Python: From ASCII to UTF‑8

This article explains the fundamental concepts of characters, character sets, and encodings, compares common encodings such as ASCII, Unicode, and UTF‑8, and shows how Python 2 and Python 3 handle default encodings, string types, and common Unicode errors with practical code examples.

UTF-8Unicodecharacter encoding
0 likes · 14 min read
Mastering Character Encoding in Python: From ASCII to UTF‑8
DevOps
DevOps
Jan 7, 2018 · Fundamentals

Resolving Chinese Character Encoding Issues in Git on Windows

This article explains why Git on Windows often displays garbled Chinese characters in tools like Git Bash, CMD, PowerShell, and IDEs, and provides step‑by‑step solutions—including Git config changes and environment variable adjustments—to enforce UTF‑8 encoding and eliminate the problem.

GitUTF-8Windows
0 likes · 6 min read
Resolving Chinese Character Encoding Issues in Git on Windows
Tencent IMWeb Frontend Team
Tencent IMWeb Frontend Team
Sep 13, 2017 · Backend Development

Mastering Character Encoding in Node.js: Avoid Garbled Text

This article introduces the fundamentals of character encoding and decoding, explains the difference between character sets and encodings, and demonstrates practical Node.js examples using the iconv‑lite library to correctly encode and decode Chinese text, helping developers prevent common garbled‑text issues.

Backend DevelopmentGBKNode.js
0 likes · 7 min read
Mastering Character Encoding in Node.js: Avoid Garbled Text
Aotu Lab
Aotu Lab
Jun 30, 2017 · Fundamentals

Why Do escape and encodeURI Encode URLs Differently? Explore Percent-Encoding

This article explains the differences between JavaScript’s escape, encodeURI, and encodeURIComponent functions, detailing their encoding rules, percent‑encoding standards, reserved and unreserved characters, and how Unicode characters are transformed into UTF‑8 byte sequences, while also covering ASCII, Unicode, and UTF‑8 fundamentals.

JavaScriptURL encodingUTF-8
0 likes · 11 min read
Why Do escape and encodeURI Encode URLs Differently? Explore Percent-Encoding
ITPUB
ITPUB
Jun 21, 2017 · Databases

Why MySQL Data Gets Garbled: A Step‑by‑Step Encoding Journey

This article explains the three‑stage encoding and decoding flow when inserting and retrieving data in MySQL, identifies why mismatched character sets cause garbled text, and shows how to ensure consistent charset settings across client, server, and tables.

Character SetGBKUTF-8
0 likes · 5 min read
Why MySQL Data Gets Garbled: A Step‑by‑Step Encoding Journey
ITPUB
ITPUB
Mar 21, 2017 · Databases

How to Fix Chinese Character Garbling When Migrating Oracle to MySQL

When moving data from Oracle to MySQL, Chinese characters may appear as garbled text because the server uses a latin1 charset; this guide explains three levels of fixes—table, database, and server—by configuring UTF‑8 settings and verifying the changes.

Character SetChineseData Migration
0 likes · 5 min read
How to Fix Chinese Character Garbling When Migrating Oracle to MySQL
ITPUB
ITPUB
Sep 19, 2016 · Fundamentals

Understanding Character Encoding: From ASCII to Unicode and UTF‑8

This article explains the fundamentals of character encoding, covering the evolution from the 7‑bit ASCII standard to Chinese GB2312, the development of Unicode and UTF‑8, and provides practical guidance for handling these encodings in Windows and Linux C programs, including a sample UTF‑8 detection function.

ASCIIC programmingGB2312
0 likes · 13 min read
Understanding Character Encoding: From ASCII to Unicode and UTF‑8
ITPUB
ITPUB
Jul 11, 2016 · Backend Development

How to Fix Chinese Character Garbling in JSP, Servlets, and Java Web Apps

This guide explains five common scenarios where Chinese characters appear garbled in Java web applications—JSP pages, servlets, POST form submissions, URI parameters, and properties files—and provides step‑by‑step configurations such as setting pageEncoding, response content type, request character encoding, Tomcat URIEncoding, and using native2ascii.

JSPJavaServlet
0 likes · 5 min read
How to Fix Chinese Character Garbling in JSP, Servlets, and Java Web Apps
dbaplus Community
dbaplus Community
Jan 7, 2016 · Databases

Mastering Character Sets: Diagnose and Fix MySQL Garbled Text

This article explains character sets and encodings, shows why MySQL data can become garbled, and provides step‑by‑step methods—including proper configuration, common pitfalls, and reliable repair techniques—to prevent and correct encoding issues such as emoji storage failures.

EmojiGBKUTF-8
0 likes · 24 min read
Mastering Character Sets: Diagnose and Fix MySQL Garbled Text
21CTO
21CTO
Jan 4, 2016 · Fundamentals

Why Chinese Text Gets Garbled and How to Fix It: A Deep Dive into Encoding Standards

This article explains why Chinese characters often appear as garbled text on Windows and Linux, introduces the history and hierarchy of Chinese encoding standards such as GB2312, GBK, GB18030 and Unicode, compares ASCII, UTF‑8/16/32, shows practical command‑line experiments, and offers guidance for handling Chinese text in C and Python programs.

CGB2312Python
0 likes · 25 min read
Why Chinese Text Gets Garbled and How to Fix It: A Deep Dive into Encoding Standards
21CTO
21CTO
Sep 29, 2015 · Fundamentals

Why Chinese Text Gets Garbled and How to Fix It – A Practical Encoding Guide

This article explains why Chinese characters often appear as mojibake on Windows and Linux, introduces the history and technical details of ASCII, GB2312, GBK, GB18030 and Unicode, and provides concrete examples and command‑line tools for inspecting and converting file encodings.

C programmingGB2312UTF-8
0 likes · 26 min read
Why Chinese Text Gets Garbled and How to Fix It – A Practical Encoding Guide
21CTO
21CTO
Aug 14, 2015 · Fundamentals

Why Web Pages Show Garbled Text: Charsets, Encodings & HTTP Headers

This article explains how computers store and display characters using binary, defines character sets and encodings such as ASCII, GB2312, GBK, GB18030, BIG5 and Unicode, compares UTF‑8, UTF‑16, UTF‑32, and describes related HTTP headers like Accept‑Charset, Content‑Type, and Content‑Encoding.

ASCIICharsetGB18030
0 likes · 20 min read
Why Web Pages Show Garbled Text: Charsets, Encodings & HTTP Headers
MaGe Linux Operations
MaGe Linux Operations
Jun 25, 2014 · Databases

How to Change MySQL Character Set to UTF-8 on Linux

This guide walks through locating MySQL's configuration file on Linux, copying a sample cnf, editing it to set default-character-set=utf8 for client and server, restarting MySQL, and verifying the change with SHOW VARIABLES, plus alternative methods via SQL commands and database options.

Character SetLinuxUTF-8
0 likes · 8 min read
How to Change MySQL Character Set to UTF-8 on Linux