Skip to Main Content
Unicode Explained
book

Unicode Explained

by Jukka K. Korpela
June 2006
Beginner content levelBeginner
688 pages
26h 18m
English
O'Reilly Media, Inc.
Content preview from Unicode Explained
Short Long name of property Java function
blk Block UnicodeBlock.of
gc General Category getType
IDC ID Continue isUnicodeIdentifierPart
IDS ID Start isUnicodeIdentifierStart
lc Lowercase Mapping toLowerCase
Lower Lowercase isLowerCase
nv Numeric Value getNumericValue
tc Titlecase Mapping toTitleCase
uc Uppercase Mapping toUpperCase
Upper Uppercase isUpperCase
WSpace White Space isWhitespace
The Java function isLetter doesn’t quite correspond to the Alphabetic property, since
the latter is true also for characters with General Category value of Nl (Number, letter)
and for characters with the OAlpha (Other, Alphabetic) property. For most practical
purposes, isLetter is adequate for testing whether a character is alphabetic. In some
cases, isUnicodeIdentifierStart is better, since it includes Nl.
In addition to functions like isUnicodeIdentifierStart, there are functions like isJa
vaIdentifierStart, which are quite similar but allow $ and _, too.
In Java 5.0 and later, most of the functions that correspond to Unicode properties are
defined both for character (char) and integer (int) arguments. In the latter case, the
argument is treated as a code point, which may refer outside the BMP. Thus, you can
relatively conveniently work with non-BMP characters, too.
The return values of functions that correspond to Unicode properties with enumerated
values are technically of type byte or int. The values, encoded ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Unicode Demystified

Unicode Demystified

Richard Gillam
Fonts & Encodings

Fonts & Encodings

Yannis Haralambous
Core Java™ Data Objects

Core Java™ Data Objects

Sameer Tyagi, Keiron McCammon, Michael Vorburger, Heiko Bobzin

Publisher Resources

ISBN: 059610121XCatalog PageErrata