Appendix B. Character Sets
The first 128 Unicode characters—that is, characters through 127—are identical to the ASCII character set. 32 is the ASCII space; therefore, 32 is the Unicode space. 33 is the ASCII exclamation point; therefore, 33 is the Unicode exclamation point, and so on. Table 2.1 lists this character set.
Table B-1. The first 128 Unicode Characters, Also Known as the ASCII Character Set
Code |
Character |
Code |
Character |
Code |
Character |
Code |
Character |
---|---|---|---|---|---|---|---|
0 |
nul (null) |
32 |
space |
64 |
@ |
96 |
` |
1 |
soh (start of header) |
33 |
! |
65 |
A |
97 |
a |
2 |
stx (start of text) |
34 |
" |
66 |
B |
98 |
b |
3 |
etx (end of text) |
35 |
# |
67 |
C |
99 |
c |
4 |
eot (end of transmission) |
36 |
$ |
68 |
D |
100 |
d |
5 |
enq (enquiry) |
37 |
% |
69 |
E |
101 |
e |
6 |
ack (acknowledge) |
38 |
& |
70 |
F |
102 |
f |
7 |
bel (bell) |
39 |
` |
71 |
G |
103 |
g |
8 |
bs (backspace) |
40 |
( |
72 |
H |
104 |
h |
9 |
tab (tab) |
41 |
) |
73 |
I |
105 |
i |
10 |
lf (linefeed) |
42 |
* |
74 |
J |
106 |
j |
11 |
vtb (vertical tab) |
43 |
+ |
75 |
K |
107 |
k |
12 |
ff (formfeed) |
44 |
, |
76 |
L |
108 |
l |
13 |
cr (carriage return) |
45 |
- |
77 |
M |
109 |
m |
14 |
so (shift out) |
46 |
. |
78 |
N |
110 |
n |
15 |
si (shift in) |
47 |
/ |
79 |
O |
111 |
o |
16 |
dle (data link escape) |
48 |
0 |
80 |
P |
112 |
p |
17 |
dc1 (device control 1, XON) |
49 |
1 |
81 |
Q |
113 |
q |
18 |
dc2 (device control 2) |
50 |
2 |
82 |
R |
114 |
r |
19 |
dc3 (device control 3, XOFF) |
51 |
3 |
83 |
S |
115 |
s |
20 |
dc4 (device control 4) |
52 |
4 |
84 |
T |
116 |
t |
21 |
nak (negative acknowledge) |
53 |
5 |
85 |
U |
117 |
u |
22 |
syn (synchronous idle) ... |
Get Java I/O now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.