Programmer DD
Programmer DD
Jul 22, 2020 · Fundamentals

Why Java’s char Can’t Represent All Unicode Characters – Understanding UTF‑16 and Code Points

This article explains how Java’s char type stores Unicode code units in UTF‑16, why its range of \u0000 to \uffff limits direct representation of newer Unicode characters, and how methods like String.length, getBytes, and code‑point APIs help handle multi‑byte characters such as emojis and rare Chinese glyphs.

Code PointsUTF-16Unicode
0 likes · 10 min read
Why Java’s char Can’t Represent All Unicode Characters – Understanding UTF‑16 and Code Points