Tagged articles
122 articles
Page 1 of 2
DeepHub IMBA
DeepHub IMBA
Apr 6, 2026 · Artificial Intelligence

Mastering Machine Learning Feature Engineering: Scaling, Encoding, Aggregation, Embedding, and Automation

The article explains why good features matter more than fancy algorithms and walks through practical techniques—scaling, log transforms, binning, interaction, various encoding schemes, datetime extraction, text statistics, geospatial distances, aggregation, feature selection, and automated feature generation—illustrated with concrete pandas and scikit‑learn code examples.

Automationencodingfeature engineering
0 likes · 16 min read
Mastering Machine Learning Feature Engineering: Scaling, Encoding, Aggregation, Embedding, and Automation
Lisa Notes
Lisa Notes
Mar 27, 2026 · Fundamentals

Python Learning Day 60: Mastering pass, while/for Loops, break‑continue and String Operations

This tutorial‑style note walks through Python’s pass statement, the mechanics of while and for loops (including nested loops and common pitfalls), the use of break and continue, and a comprehensive overview of string creation, slicing, case conversion, searching, replacement, and encoding, all illustrated with concrete code examples and expected outputs.

PythonSlicingString Manipulation
0 likes · 23 min read
Python Learning Day 60: Mastering pass, while/for Loops, break‑continue and String Operations
IT Services Circle
IT Services Circle
Dec 20, 2025 · Fundamentals

10 Common Python String Pitfalls Every Developer Should Avoid

This article enumerates ten classic mistakes when handling Python strings—covering immutability, identity vs equality, truthiness of empty values, misuse of strip and split, Unicode length, performance of concatenation, encoding issues, and trailing backslashes—to help developers write safer, more efficient code.

ComparisonImmutablePitfalls
0 likes · 7 min read
10 Common Python String Pitfalls Every Developer Should Avoid
PaperAgent
PaperAgent
Dec 13, 2025 · Artificial Intelligence

Why Unified Multimodal Models Are the Key to Next‑Gen AGI – A Deep Survey

This article surveys the latest research on Unified Multimodal Foundations (UFM), explaining why integrating understanding and generation across text, image, video, and audio is essential for AGI, and detailing modeling paradigms, encoding/decoding strategies, training pipelines, benchmarks, and real‑world applications.

AI researchBenchmarkTraining
0 likes · 10 min read
Why Unified Multimodal Models Are the Key to Next‑Gen AGI – A Deep Survey
Efficient Ops
Efficient Ops
Nov 30, 2025 · Databases

Why MySQL’s utf8 Isn’t True UTF‑8 and How utf8mb4 Solves It

MySQL’s original utf8 implementation was a limited, buggy version that only supported three‑byte characters, leading to data loss for emojis and rare symbols, so MySQL introduced utf8mb4 as a full UTF‑8 solution and now recommends it as the default encoding.

Character Setdatabasesencoding
0 likes · 5 min read
Why MySQL’s utf8 Isn’t True UTF‑8 and How utf8mb4 Solves It
php Courses
php Courses
Oct 13, 2025 · Backend Development

Mastering PHP’s strlen: How to Accurately Measure String Length

Learn how PHP’s built‑in strlen function works to return the byte length of a string, see a clear example with code, understand its output, and discover important considerations about character encoding and multibyte strings to ensure accurate length calculations.

PHPencodingstring length
0 likes · 3 min read
Mastering PHP’s strlen: How to Accurately Measure String Length
Lin is Dream
Lin is Dream
Oct 11, 2025 · Backend Development

Why Your Spring Boot File Upload Fails: Async Reading, Temp Files, and Encoding Gotchas

This article explains common pitfalls when handling file uploads in Spring Boot, including temporary file lifecycle causing FileNotFound errors in asynchronous processing, and character encoding mismatches that lead to garbled text, and provides practical solutions such as size limits, main‑thread parsing, and BOM‑based charset detection.

BOMMultipartFileencoding
0 likes · 7 min read
Why Your Spring Boot File Upload Fails: Async Reading, Temp Files, and Encoding Gotchas
Tencent Cloud Developer
Tencent Cloud Developer
Apr 17, 2025 · Fundamentals

Protobuf Encoding Principles and Optimization Techniques

The article explains how Protocol Buffers (proto3) encode basic and composite types using varint, zigzag, fixed-size and IEEE‑754 formats, describes tag and length field structures, and presents optimization strategies such as selecting size‑efficient types, flattening nested messages, and delta‑encoding to significantly reduce serialized byte‑stream size.

Data StructuresProtobufProtocol Buffers
0 likes · 15 min read
Protobuf Encoding Principles and Optimization Techniques
Lin is Dream
Lin is Dream
Mar 11, 2025 · Fundamentals

Unlocking Data: How Bits, Bytes, Hex and Encoding Shape Everything

This article explains the relationships between bits, binary, hexadecimal, bytes and strings, shows how computers store and process data using binary, demonstrates hexadecimal and Base64 encodings, provides conversion tables and examples, and clarifies how UTF‑8 encodes characters.

Base64BinaryUTF-8
0 likes · 13 min read
Unlocking Data: How Bits, Bytes, Hex and Encoding Shape Everything
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 23, 2024 · Databases

Why MySQL Strings Get Garbled: Mastering Charset and Collation

This article dives deep into MySQL's charset and collation system, explaining concepts, configuration levels, system variables, string literals, conversion rules, Unicode sorting algorithms, binary collations, and practical tips to avoid common encoding pitfalls and ensure correct string handling.

CharsetUnicodecollation
0 likes · 57 min read
Why MySQL Strings Get Garbled: Mastering Charset and Collation
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 1, 2024 · Backend Development

Fixing Chinese Character Garbling in Python Web Scraping

This article walks through a real‑world Python web‑scraping issue where Chinese characters appear as garbled text, explains why the default encoding fails, and shows how setting the response’s apparent encoding resolves the problem, complete with sample code and practical tips for posting questions.

PythonWeb Scrapingencoding
0 likes · 5 min read
Fixing Chinese Character Garbling in Python Web Scraping
Java Tech Enthusiast
Java Tech Enthusiast
Jul 27, 2024 · Fundamentals

The Story Behind the Creation of UTF-8 and Its Advantages

Rob Pike and Ken Thompson devised UTF‑8 in 1992 at Bell Labs, turning a three‑day prototype into the web’s dominant Unicode encoding by using a variable‑length, ASCII‑compatible, length‑prefixed and prefix‑free scheme that maximizes efficiency, robustness, and universal adoption across more than 96 % of sites.

UTF-8Unicodeencoding
0 likes · 6 min read
The Story Behind the Creation of UTF-8 and Its Advantages
dbaplus Community
dbaplus Community
Jun 2, 2024 · Databases

Why MySQL Emoji Inserts Fail and How utf8mb4 Fixes Them

This article explains why inserting emoji characters into a MySQL table defined with the default utf8 charset triggers a string‑value error, explores the underlying encoding and charset concepts, compares utf8mb3 and utf8mb4, and shows how to convert tables to avoid such failures.

Character SetEmojicollation
0 likes · 9 min read
Why MySQL Emoji Inserts Fail and How utf8mb4 Fixes Them
Liangxu Linux
Liangxu Linux
May 6, 2024 · Databases

Why MySQL’s “utf8” Isn’t Real UTF‑8 and How utf8mb4 Fixes It

Discover why MySQL’s legacy ‘utf8’ charset only supports three‑byte characters, causing storage errors for true UTF‑8 data, and learn how the ‘utf8mb4’ charset resolves these issues, with historical context, technical details, and migration guidance.

Character SetMariaDBencoding
0 likes · 7 min read
Why MySQL’s “utf8” Isn’t Real UTF‑8 and How utf8mb4 Fixes It
dbaplus Community
dbaplus Community
May 5, 2024 · Databases

Why MySQL’s utf8 Isn’t Real UTF‑8 and How utf8mb4 Solves Emoji Errors

The article explains why MySQL’s built‑in utf8 charset cannot store four‑byte characters such as emojis, demonstrates the resulting insert error, shows how switching the database, table and connection to utf8mb4 fixes the issue, and recounts the historical reasons behind MySQL’s limited utf8 implementation.

Character SetEmojiencoding
0 likes · 8 min read
Why MySQL’s utf8 Isn’t Real UTF‑8 and How utf8mb4 Solves Emoji Errors
Bilibili Tech
Bilibili Tech
Apr 26, 2024 · Artificial Intelligence

2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations

On World Intellectual Property Day, Bilibili honored ten breakthrough patents that together enable billion‑scale video duplicate detection, AI‑driven story generation, synchronized live rhythm‑games, automatic OTT casting, knowledge‑graph‑based content moderation, glitch‑free multi‑audio streaming, modular playback integration, neural‑network resolution encoding, AV1 reference‑frame pruning, and fine‑grained GPU isolation.

StreamingVideo processingartificial intelligence
0 likes · 6 min read
2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations
Open Source Tech Hub
Open Source Tech Hub
Feb 19, 2024 · Backend Development

How Sqids Generates Short, URL‑Safe IDs and When to Use Them

Sqids is an open‑source library that creates short, unique, URL‑safe identifiers from one or more non‑negative integers, offering use cases like link shortening, event IDs, and custom object IDs while outlining its limitations and practical PHP examples.

PHPURL-safeencoding
0 likes · 5 min read
How Sqids Generates Short, URL‑Safe IDs and When to Use Them
php Courses
php Courses
Jan 8, 2024 · Backend Development

Using PHP urlencode to Encode URLs and Non-ASCII Characters

This article explains how PHP's urlencode function encodes URLs containing special or non‑ASCII characters, provides example code for both English and Chinese strings, and discusses handling spaces with rawurlencode for more precise encoding.

Backendencodingurl-encoding
0 likes · 4 min read
Using PHP urlencode to Encode URLs and Non-ASCII Characters
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 19, 2023 · Fundamentals

How to Fix Matplotlib Title Encoding Errors in Python

This article walks through a common Matplotlib title garbling issue caused by encoding settings, explains why the 'encoding' argument triggers an AttributeError, and provides a concise code fix using font configuration to display Chinese characters correctly.

MatplotlibPythonencoding
0 likes · 3 min read
How to Fix Matplotlib Title Encoding Errors in Python
Architects' Tech Alliance
Architects' Tech Alliance
Dec 13, 2023 · Fundamentals

Evolution of PCIe Standards and Test Requirements

This article traces the evolution of the PCI Express (PCIe) standard from its 1.0 inception to the latest 6.0 specification, highlighting key differences in data rates, encoding schemes, equalization techniques, and test requirements that enable higher bandwidth and reliability for modern data‑center and AI workloads.

InterfacePCIeencoding
0 likes · 11 min read
Evolution of PCIe Standards and Test Requirements
DeWu Technology
DeWu Technology
Oct 25, 2023 · Mobile Development

Video Experience Optimization: Technical Improvements and Performance Analysis

The report details how targeted technical enhancements—refined encoding pipelines, playback logic tweaks, and start‑up caching—address mobile video challenges such as latency, stutter, and desynchronization, yielding up to 25 % less stutter, a 42.8 % drop in failures, and nearly 50 % faster first‑frame loading while outlining future testing and cross‑team optimization plans.

Mobile DevelopmentVideo Optimizationencoding
0 likes · 13 min read
Video Experience Optimization: Technical Improvements and Performance Analysis
php Courses
php Courses
Aug 21, 2023 · Backend Development

Using PHP mb_strlen() to Accurately Measure Multibyte String Lengths

This article explains how to use PHP's mb_strlen() function to accurately measure string lengths, handle multibyte characters, specify encodings, retrieve byte counts, count character occurrences, and work with different language encodings, providing clear code examples for each case.

Backend DevelopmentPHPencoding
0 likes · 4 min read
Using PHP mb_strlen() to Accurately Measure Multibyte String Lengths
Top Architect
Top Architect
May 17, 2023 · Databases

Redis Data Types Overview and Internal Implementations

An extensive guide to Redis data structures—including strings, hashes, lists, sets, sorted sets, streams, hyperloglog, geospatial, bitmap, and bitfield—detailing their use cases, underlying encodings, conversion thresholds, and internal implementation details with code examples.

CacheData Structuresencoding
0 likes · 30 min read
Redis Data Types Overview and Internal Implementations
NiuNiu MaTe
NiuNiu MaTe
May 17, 2023 · Databases

Mastering Redis HSET: Basics, Commands, and Internal Encoding

This article introduces Redis HSET, covering its definition, suitable use cases, common commands for creating, reading, updating, and deleting hash fields, as well as the underlying encoding mechanisms (ziplist vs hashtable) and best practices, including the deprecation of HMSET.

CacheHSETHash
0 likes · 7 min read
Mastering Redis HSET: Basics, Commands, and Internal Encoding
Python Crawling & Data Mining
Python Crawling & Data Mining
Apr 1, 2023 · Backend Development

How to Fix Chinese Garbled Text in Python Web Scraping

This article explains three practical methods—using response.content, apparent_encoding, and custom encode/decode—to resolve Chinese character garbling in Python web crawlers, includes code snippets and screenshots, and offers concise guidance for developers facing this common issue.

ChinesePythonencoding
0 likes · 5 min read
How to Fix Chinese Garbled Text in Python Web Scraping
ITPUB
ITPUB
Mar 15, 2023 · Databases

Why Is Redis So Fast? Deep Dive into Its History, Architecture, and Performance

This article explores Redis’s evolution from its 2009 inception, outlines major version releases, examines its in‑memory design, efficient data structures, encoding schemes, single‑threaded event loop with epoll, and benchmark results, illustrating why Redis achieves exceptionally high throughput and low latency in real‑world deployments.

BenchmarkThread Modelencoding
0 likes · 15 min read
Why Is Redis So Fast? Deep Dive into Its History, Architecture, and Performance
Laravel Tech Community
Laravel Tech Community
Jan 10, 2023 · Information Security

Dongle: A Lightweight, Semantic, Developer‑Friendly Go Library for Encoding, Decoding, and Cryptographic Operations

Dongle is a lightweight, Go-based library offering extensive encoding/decoding and cryptographic functions—including various hash, HMAC, and symmetric/asymmetric algorithms—along with recent updates such as tea‑mode support, empty‑string padding, and code restructuring, and is featured in the awesome‑go collection.

encodinginformation security
0 likes · 4 min read
Dongle: A Lightweight, Semantic, Developer‑Friendly Go Library for Encoding, Decoding, and Cryptographic Operations
21CTO
21CTO
Nov 1, 2022 · Cloud Computing

How Netflix Engineers Seamless Streaming with Cloud‑Based Encoding and CDN

Netflix delivers billions of hours of video by compressing and transcoding raw movies into multiple formats, splitting them into small chunks processed in parallel on AWS, storing them in S3, and distributing them via its custom Open Connect CDN to ensure low‑latency, high‑quality playback worldwide.

AWSCDNNetflix
0 likes · 10 min read
How Netflix Engineers Seamless Streaming with Cloud‑Based Encoding and CDN
转转QA
转转QA
Sep 6, 2022 · Backend Development

A Case Study of Troubleshooting Service Log Garbled Character Issues

This article details a step‑by‑step investigation of Java service log garbled‑character problems caused by incorrect LC_CTYPE and locale settings, describing how environment variable synchronization via SSH led to ASCII encoding defaults and outlining preventive configurations for both client and server.

Environment VariablesJavaLinux
0 likes · 9 min read
A Case Study of Troubleshooting Service Log Garbled Character Issues
ITPUB
ITPUB
Jul 20, 2022 · Databases

Master Redis Memory: 7 Proven Techniques to Store More Data with Less RAM

This article explains how Redis stores key‑value pairs, dives into the underlying dict, dictEntry and redisObject structures, and presents seven practical memory‑saving tricks—including key shortening, encoding tweaks, object sharing, bitmap usage, hash consolidation, fragmentation cleanup, and 32‑bit deployment—to dramatically reduce RAM consumption while preserving performance.

BitmapData StructuresHash
0 likes · 19 min read
Master Redis Memory: 7 Proven Techniques to Store More Data with Less RAM
ITPUB
ITPUB
May 31, 2022 · Backend Development

How Ctrip Optimized Hotel Cache Memory: From HashMap Overhead to Advanced Encoding

Facing billions of cached hotel records, Ctrip’s backend team analyzed Java object layout and HashMap memory costs, evaluated alternative structures, and applied bitmap, dictionary, run‑length and delta encodings to compress data, ultimately reducing cache size to under 2% of its original footprint.

Backendcachingdata-structures
0 likes · 24 min read
How Ctrip Optimized Hotel Cache Memory: From HashMap Overhead to Advanced Encoding
Ctrip Technology
Ctrip Technology
May 26, 2022 · Backend Development

Memory Structure Selection and Optimization for Hotel Query Service

This article examines how Ctrip's hotel query service selects and optimizes in‑memory cache structures—covering Java object layout, HashMap overhead, alternative collections, and various encoding techniques—to achieve high‑performance reads and updates while drastically reducing memory consumption.

CacheHashMapJava
0 likes · 23 min read
Memory Structure Selection and Optimization for Hotel Query Service
Sohu Tech Products
Sohu Tech Products
Mar 2, 2022 · Mobile Development

Overview of iOS Live Streaming Workflow

This article provides a comprehensive overview of the iOS live‑streaming workflow, detailing the six stages—capture, processing, encoding, packaging, network transmission, and playback—along with sample code for video/audio capture, encoding settings, and RTMP transmission.

AVFoundationMobile DevelopmentRTMP
0 likes · 18 min read
Overview of iOS Live Streaming Workflow
IT Architects Alliance
IT Architects Alliance
Dec 28, 2021 · Fundamentals

Mastering QR Codes: Theory, Encoding, Decoding, and Python Implementation

This comprehensive guide explains QR code fundamentals—including their structure, versions, error‑correction levels, and data capacity—details the full encoding pipeline from requirement analysis to matrix construction, outlines the decoding steps, explores commercial use cases, and provides practical Python examples using python‑qrcode, Amazing‑QR, and Zxing.

Error CorrectionPythonQR code
0 likes · 22 min read
Mastering QR Codes: Theory, Encoding, Decoding, and Python Implementation
ELab Team
ELab Team
Oct 19, 2021 · Fundamentals

Unlocking QR Code Secrets: Principles, Design, and Code Generation

This article explains QR code fundamentals, including its advantages, structural components, error‑correction levels, design principles, generation workflow, JavaScript implementation details, and extensions such as artistic QR codes and mini‑program codes, providing practical examples and visual illustrations.

Error CorrectionJavaScriptartistic QR
0 likes · 11 min read
Unlocking QR Code Secrets: Principles, Design, and Code Generation
Java Backend Technology
Java Backend Technology
Oct 14, 2021 · Databases

Why MySQL’s utf8 Fails with Emojis and How utf8mb4 Solves It

This article explains the difference between MySQL’s utf8 and utf8mb4 character sets, why utf8 cannot store emojis or complex Chinese characters, and provides step‑by‑step examples showing how to configure tables and columns with utf8mb4 to avoid encoding errors.

Character SetEmojiencoding
0 likes · 10 min read
Why MySQL’s utf8 Fails with Emojis and How utf8mb4 Solves It
Top Architect
Top Architect
Sep 3, 2021 · Databases

Why MySQL’s “utf8” Is Not Real UTF‑8 and You Should Switch to utf8mb4

The article explains that MySQL’s legacy “utf8” charset only supports three‑byte characters, causing errors when storing true four‑byte UTF‑8 symbols, and shows how the newer “utf8mb4” charset provides full Unicode support, with historical context and migration guidance.

Character SetMariaDBencoding
0 likes · 7 min read
Why MySQL’s “utf8” Is Not Real UTF‑8 and You Should Switch to utf8mb4
Liangxu Linux
Liangxu Linux
May 13, 2021 · Fundamentals

Why Does Text Become Garbled? A Deep Dive into UTF‑8, GBK, and Unicode

This article explains why characters appear as garbled text when encoding and decoding methods mismatch, explores how Excel defaults to GBK, shows how to convert files with iconv, and walks through the evolution from ASCII to GB2312, GBK, GB18030, and finally Unicode's UTF‑8 encoding.

Character SetGBKUTF-8
0 likes · 6 min read
Why Does Text Become Garbled? A Deep Dive into UTF‑8, GBK, and Unicode
Python Programming Learning Circle
Python Programming Learning Circle
Apr 9, 2021 · Fundamentals

Python File I/O: Reading and Writing Files

This article explains how Python interacts with the operating system to open, read, write, and close files using the built‑in open() function, various file modes, error handling, the with statement for automatic resource management, and considerations for binary data and character encodings.

encodingfile-ioreading
0 likes · 7 min read
Python File I/O: Reading and Writing Files
DeWu Technology
DeWu Technology
Jan 24, 2021 · Fundamentals

Overview of Video Container Formats and H.264 Encoding

The article outlines how video container formats such as AVI, MOV, MP4, WMV, RM, FLV and MKV package encoded streams, then explains H.264 encoding fundamentals—including I‑, P‑, B‑frames, macroblocks, GOP structure, and NAL units like SPS and PPS that define parameters for efficient compression and transport.

ContainerH.264NAL
0 likes · 10 min read
Overview of Video Container Formats and H.264 Encoding
DataFunTalk
DataFunTalk
Jan 23, 2021 · Artificial Intelligence

Feature Engineering: Mapping Raw Data to Machine‑Learning Features and Best Practices

This article explains how feature engineering transforms raw data into numerical representations for machine‑learning models, covering mapping of numeric and categorical values, one‑hot and multi‑hot encoding, sparse representations, scaling, handling outliers, binning, data quality checks, and feature interactions to capture non‑linear relationships.

data preprocessingencodingfeature engineering
0 likes · 20 min read
Feature Engineering: Mapping Raw Data to Machine‑Learning Features and Best Practices
php Courses
php Courses
Jan 21, 2021 · Backend Development

Understanding PHP json_encode Output: When Arrays Become Objects

This article explains why PHP's json_encode sometimes returns a JSON object instead of an array, demonstrates several common scenarios, and provides practical code solutions to ensure the desired array format in the resulting JSON.

ArrayJSONencoding
0 likes · 4 min read
Understanding PHP json_encode Output: When Arrays Become Objects
Laravel Tech Community
Laravel Tech Community
Jan 7, 2021 · Databases

Understanding MySQL utf8 vs utf8mb4 and How to Store Emoji Correctly

This article explains why MySQL's legacy utf8 charset cannot store four‑byte characters such as emoji, demonstrates the resulting insertion error, and shows how switching the database, table, and column collations to utf8mb4 resolves the issue while also covering the historical reasons behind MySQL's limited utf8 implementation.

Character SetEmojiencoding
0 likes · 7 min read
Understanding MySQL utf8 vs utf8mb4 and How to Store Emoji Correctly
Architecture Digest
Architecture Digest
Dec 23, 2020 · Databases

Understanding Redis Object Types and Their Underlying Data Structures

This article explains the five core Redis object types—strings, lists, hashes, sets, and sorted sets—their possible encodings, and the underlying data structures such as SDS, linked lists, hash tables, ziplists, intsets, and skiplists, including details of the redisObject layout and rehash mechanisms.

Data Structuresdatabasesencoding
0 likes · 22 min read
Understanding Redis Object Types and Their Underlying Data Structures
Tencent Database Technology
Tencent Database Technology
Dec 17, 2020 · Databases

Understanding MySQL Character Set Variables

This article explains the purpose and usage of MySQL character set system variables, illustrates how they affect client‑server communication and file handling, and provides practical examples and diagrams to help developers correctly configure character sets for optimal performance.

Database VariablesSQLencoding
0 likes · 9 min read
Understanding MySQL Character Set Variables
Ctrip Technology
Ctrip Technology
Nov 12, 2020 · Backend Development

Ctrip Live Streaming Technology Overview and Implementation

This article presents a comprehensive overview of Ctrip's live streaming solution, covering the evolution of live streaming, core streaming principles, encoding and protocol choices such as RTMP, HLS, WebRTC, the front‑end framework built with Android, ReactNative and native modules, encountered integration challenges, and the final architectural summary.

AndroidRTMPReactNative
0 likes · 12 min read
Ctrip Live Streaming Technology Overview and Implementation
TAL Education Technology
TAL Education Technology
Nov 5, 2020 · Fundamentals

Understanding URIs: History, Components, and Encoding/Decoding

This article provides a comprehensive overview of Uniform Resource Identifiers (URIs), covering their historical evolution, the relationship with URLs and URNs, the syntax defined by RFC standards, character sets, component breakdown, and practical encoding and decoding algorithms for web development.

URIURLURN
0 likes · 30 min read
Understanding URIs: History, Components, and Encoding/Decoding
Laravel Tech Community
Laravel Tech Community
Oct 13, 2020 · Backend Development

Using mb_strtolower() to Convert Strings to Lowercase in PHP

The article explains PHP's mb_strtolower() function, detailing its purpose to convert all alphabetic characters in a string to lowercase, describing its parameters, return value, and providing two illustrative examples—including handling of UTF‑8 encoded text.

PHPStringencoding
0 likes · 2 min read
Using mb_strtolower() to Convert Strings to Lowercase in PHP
Laravel Tech Community
Laravel Tech Community
Oct 12, 2020 · Backend Development

PHP mb_strlen() – Get String Length

The mb_strlen() function in PHP returns the number of characters in a string, correctly handling multibyte characters and allowing an optional encoding parameter, with examples showing differences from strlen() under various encodings.

encodingmb_strlenstring length
0 likes · 2 min read
PHP mb_strlen() – Get String Length
Laravel Tech Community
Laravel Tech Community
Aug 24, 2020 · Backend Development

Using mb_send_mail() to Send Encoded Emails in PHP

This article explains how the PHP mb_send_mail() function works as a wrapper for mail(), detailing its parameters, return values, and providing a complete example that demonstrates sending Japanese‑encoded email bodies, subjects, and headers with proper MIME settings.

BackendEmailencoding
0 likes · 3 min read
Using mb_send_mail() to Send Encoded Emails in PHP
Liangxu Linux
Liangxu Linux
Aug 1, 2020 · Databases

Why MySQL’s “utf8” Isn’t Real UTF‑8 and How utf8mb4 Fixes It

The article explains that MySQL’s legacy "utf8" charset only supports three‑byte sequences, causing errors with true four‑byte Unicode characters, and shows how the later "utf8mb4" charset provides proper UTF‑8 support along with historical context and migration guidance.

Character SetMariaDBencoding
0 likes · 8 min read
Why MySQL’s “utf8” Isn’t Real UTF‑8 and How utf8mb4 Fixes It
Aikesheng Open Source Community
Aikesheng Open Source Community
Jul 22, 2020 · Databases

Understanding and Resolving MySQL Character Set and Data Corruption Issues

This article explains why MySQL data can become garbled due to mismatched character sets during insertion or retrieval, explores various scenarios such as client‑side encoding errors, mixed encodings within tables, and LATIN1 storage, and provides detailed solutions including setting proper client encodings, using strict SQL_MODE, and exporting/importing data to unify character sets.

Character SetData CorruptionSQL
0 likes · 15 min read
Understanding and Resolving MySQL Character Set and Data Corruption Issues
Top Architect
Top Architect
Jul 21, 2020 · Databases

Why MySQL’s “utf8” Is Not Real UTF‑8 and You Should Use utf8mb4

The article explains that MySQL’s legacy “utf8” character set only supports three‑byte sequences and therefore cannot store true UTF‑8 characters, describes the historical reasons behind this limitation, and advises all MySQL/MariaDB users to migrate to the proper utf8mb4 charset.

Character SetMariaDBdatabase
0 likes · 12 min read
Why MySQL’s “utf8” Is Not Real UTF‑8 and You Should Use utf8mb4
FunTester
FunTester
Jul 3, 2020 · Backend Development

Handling Unicode Encoding Issues and Database Transaction Rollback in Java Services

The article explains a character‑encoding pitfall caused by a Python middle‑layer converting parameters to Unicode, provides a Java utility to decode escaped Unicode strings, and demonstrates how to use Spring's @Transactional annotation with rollbackFor to ensure database operations are rolled back on errors.

JavaUnicodeencoding
0 likes · 4 min read
Handling Unicode Encoding Issues and Database Transaction Rollback in Java Services
Architecture Digest
Architecture Digest
Mar 8, 2020 · Databases

MySQL Encoding Process and Character Set Handling

This article explains how MySQL’s character_set parameters such as character_set_client and character_set_results control the encoding and decoding of client commands and query results, illustrates common pitfalls with UTF‑8, GBK and Latin‑1, and provides practical commands to avoid garbled text.

Character SetGBKUTF-8
0 likes · 10 min read
MySQL Encoding Process and Character Set Handling
ITPUB
ITPUB
Feb 28, 2020 · Databases

Why MySQL Shows Garbled Text and How to Fix It

This article explains the root causes of MySQL character‑set garbling, illustrates the encoding/decoding steps during data insertion and retrieval, and provides practical methods—including dump‑reload and binary conversion—to prevent and repair corrupted data.

Character SetData MigrationSQL
0 likes · 13 min read
Why MySQL Shows Garbled Text and How to Fix It
360 Quality & Efficiency
360 Quality & Efficiency
Feb 21, 2020 · Fundamentals

Key Differences Between Python 2 and Python 3

This article outlines the historical development of Python, explains why Python 2 reached end‑of‑life in 2020, and details the major syntactic and functional differences between Python 2 and Python 3, including encoding defaults, exception handling, range functions, print statements, and input behavior, concluding with guidance on choosing a version.

Pythonencodingprogramming
0 likes · 8 min read
Key Differences Between Python 2 and Python 3
Sohu Tech Products
Sohu Tech Products
Dec 25, 2019 · Fundamentals

Understanding ASCII: Standards, Python and Rust Implementations

This article introduces the ASCII encoding standard defined in RFC20, explains its binary mapping and character ranges, demonstrates Python's built‑in ord/chr functions, and provides a custom Rust crate implementing equivalent ASCII conversion utilities with testing examples.

ASCIIPythonRFC20
0 likes · 9 min read
Understanding ASCII: Standards, Python and Rust Implementations
360 Quality & Efficiency
360 Quality & Efficiency
Oct 17, 2019 · Fundamentals

Why URL Encoding Is Essential for Reliable Web Requests

This article explains why URLs and their parameters must be encoded during testing, covering readability, completeness, reserved characters, non‑ASCII symbols, and unsafe characters, and provides practical examples to ensure reliable data transmission and correct parameter parsing.

HTTPURL encodingdata integrity
0 likes · 4 min read
Why URL Encoding Is Essential for Reliable Web Requests
ITPUB
ITPUB
Oct 10, 2019 · Databases

Why MySQL’s “utf8” Isn’t Real UTF‑8 and How to Switch to utf8mb4

The article explains that MySQL’s legacy “utf8” charset only supports three‑byte characters, causing errors when storing true four‑byte UTF‑8 symbols like emojis, and shows why switching to the proper “utf8mb4” charset is essential for correct Unicode handling.

Character SetMariaDBUnicode
0 likes · 8 min read
Why MySQL’s “utf8” Isn’t Real UTF‑8 and How to Switch to utf8mb4