O'Reilly logo

Unicode Demystified by Richard Gillam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Other Categories

Over time, it's become necessary to draw finer distinctions between characters than the general categories let you do. It's also been noticed that some overlap exists between the general categories. Another set of categories, defined mostly in PropList.txt, has been created to capture these distinctions.

  • Whitespace. Many processes that operate on text treat various characters as “whitespace,” important only insofar as it separates groups of other characters from one another. In Unicode, “whitespace” can be thought of mainly as consisting of the characters in the Z (separator) categories. This is one case in which the ISO control characters have real meaning—most processes want the code points corresponding to the old ASCII and ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required