
20
|
Chapter 1: CJKV Information Processing Overview
0213:2004 are examples of legacy character set standards that include two or more planes.
Obviously, Row-Cell notation must be expanded to include the notion of plane, meaning
Plane-Row-Cell. In Japanese, this is expressed as (menkuten) and is used as the
preferred notation for referencing the characters in JIS X 0213:2004.
When I provide lists of characters throughout this book, I usually include Row-Cell (or
Plane-Row-Cell) codes. ese are useful for future reference of these data, and so that
you don’t have to hunt for the codes yourself. Now that Unicode plays an important
role in today’s soware development eorts, I also provide Unicode scalar values when
appropriate.
What Is a Unicode Scalar Value?
Like Row-Cell notation, Unicode scalar values serve as an encoding-independent method
of referring to specic Unicode characters, or sequences of Unicode characters. It is a no-
tation, but instead of using decimal values such as Row-Cell notation, hexadecimal values
are used due to the larger 256×256 encoding space aorded by Unicode. Still, Unicode
scalar values are not tied to a specic encoding, such as UTF-8, UTF-16, nor UTF-32.
e syntax for Unicode scalar values is simple and consists of a prex followed by four
to six hexadecimal digits. e prex is “,” and the length of the hexadecimal digits var-