Appendix B. Character Sets

The first 128 Unicode characters—that is, characters through 127—are identical to the ASCII character set. 32 is the ASCII space; therefore, 32 is the Unicode space. 33 is the ASCII exclamation point; therefore, 33 is the Unicode exclamation point, and so on. Table 2.1 lists this character set.

Table B-1. The first 128 Unicode Characters, Also Known as the ASCII Character Set

Code

Character

Code

Character

Code

Character

Code

Character

0

nul (null)

32

space

64

@

96

`

1

soh (start of header)

33

!

65

A

97

a

2

stx (start of text)

34

"

66

B

98

b

3

etx (end of text)

35

#

67

C

99

c

4

eot (end of transmission)

36

$

68

D

100

d

5

enq (enquiry)

37

%

69

E

101

e

6

ack (acknowledge)

38

&

70

F

102

f

7

bel (bell)

39

`

71

G

103

g

8

bs (backspace)

40

(

72

H

104

h

9

tab (tab)

41

)

73

I

105

i

10

lf (linefeed)

42

*

74

J

106

j

11

vtb (vertical tab)

43

+

75

K

107

k

12

ff (formfeed)

44

,

76

L

108

l

13

cr (carriage return)

45

-

77

M

109

m

14

so (shift out)

46

.

78

N

110

n

15

si (shift in)

47

/

79

O

111

o

16

dle (data link escape)

48

0

80

P

112

p

17

dc1 (device control 1, XON)

49

1

81

Q

113

q

18

dc2 (device control 2)

50

2

82

R

114

r

19

dc3 (device control 3, XOFF)

51

3

83

S

115

s

20

dc4 (device control 4)

52

4

84

T

116

t

21

nak (negative acknowledge)

53

5

85

U

117

u

22

syn (synchronous idle) ...

Get Java I/O now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.