
C
APPENDIX
The Unicode Character Set
Java characters are encoded using the Unicode Character Set, which is
designed to support international alphabets, punctuation, and mathemat-
ical and technical symbols. Each character is stored as 16 bits, so as many as
65,536 characters are supported.
The American Standard Code for Information Interchange (ASCII) char-
acter set is supported by the first 128 Unicode characters from 0000 to
007F, which are called the controls and Basic Latin characters, as shown on
the next page.
Any character from the Unicode set can be specified as a char literal in a Java
program by using the following syntax: ‘\uNNNN’ where NNNN ...