Appendix A. Character Sets

The first 128 Unicode characters—that is, characters 0 through 127—are identical to the ASCII character set. 32 is the ASCII space; therefore, 32 is the Unicode space. 33 is the ASCII exclamation point; therefore, 33 is the Unicode exclamation point, and so on. Table A-1 lists this character set.

Table A-1. The first 128 Unicode characters and the ASCII character set

Code

Character

Code

Character

Code

Character

Code

Character

0

NUL (null)

32

space

64

@

96

`

1

SOH (start of header)

33

!

65

A

97

a

2

STX (start of text)

34

"

66

B

98

b

3

ETX (end of text)

35

#

67

C

99

c

4

EOT (end of transmission)

36

$

68

D

100

d

5

ENQ (enquiry)

37

%

69

E

101

e

6

ACK (acknowledge)

38

&

70

F

102

f

7

BEL (bell)

39

`

71

G

103

g

8

BS (backspace)

40

(

72

H

104

h

9

TAB (tab)

41

)

73

I

105

i

10

LF (linefeed)

42

*

74

J

106

j

11

VTB (vertical tab)

43

+

75

K

107

k

12

FF (formfeed)

44

,

76

L

108

l

13

CR (carriage return)

45

77

M

109

m

14

SO (shift out)

46

.

78

N

110

n

15

SI (shift in)

47

/

79

O

111

o

16

DLE (data link escape)

48

0

80

P

112

p

17

DC1 (device control 1, XON)

49

1

81

Q

113

q

18

DC2 (device control 2)

50

2

82

R

114

r

19

DC3 (device control 3, XOFF)

51

3

83

S

115

s

20

DC4 (device control 4)

52

4

84

T

116

t

21

NAK (negative acknowledge)

53

5

85

U

117

u

22

SYN (synchronous idle)

54

6

86

Get Java I/O, 2nd Edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.