Special Characters

Not every Unicode code point value represents something most people would actually call a “character.” Some Unicode code point values have no glyph shape at all, but exist solely to influence the behavior of some process operating on the surrounding text. In addition to the “control characters” with which we're all familiar from ASCII, Unicode includes “characters” whose purpose is to influence line breaking, glyph selection, bidirectional text layout, and various other text processes.

Line and Paragraph Separators

Arguably the most important special characters in Unicode are those used to mark a line or paragraph division. Unfortunately, the situation in Unicode with regard to line and paragraph divisions is rather complicated. ...

Get Unicode Demystified now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.