Skip to Content
Unicode Explained
book

Unicode Explained

by Jukka K. Korpela
June 2006
Beginner
688 pages
26h 18m
English
O'Reilly Media, Inc.
Content preview from Unicode Explained
Character Requirements of Languages
Although Unicode contains almost all characters used in currently used languages, it
is still and will always be relevant to consider the character requirements that different
languages impose. Here we will first list some of the reasons for this, and then analyze
the concept of “character requirements,” and finally study some specific languages.
The Impact of Character Repertoire
As mentioned in the section “Definitions of Character Repertoires” in Chapter 1, there
are good reasons to try to estimate the repertoire of characters that will appear in a
document or in an application. In more detail, the reasons include the following:
A font typically supports a limited character repertoire only. Full Unicode fonts
are rare, and usually not suitable for copy text.
In particular, artistic or otherwise special fonts, such as those used for headings
and buttons, often have a very limited character repertoire.
A program that will be used for processing your document in some way might be
prepared to handle a limited repertoire only.
Special characters in normal text often result from mistyping or other errors. When
checking input data, it is often useful to detect any “unusual” characters and issue
warnings about them.
In particular, character recognition (in scanning text or in processing handwritten
characters) works best if the assumed repertoire is small. ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Unicode Demystified

Unicode Demystified

Richard Gillam
Fonts & Encodings

Fonts & Encodings

Yannis Haralambous
The Java® Language Specification, Java SE 8 Edition

The Java® Language Specification, Java SE 8 Edition

James Gosling, Bill Joy, Guy L. Steele Jr., Gilad Bracha, Alex Buckley

Publisher Resources

ISBN: 059610121XCatalog PageErrata