Skip to Content
Unicode Explained
book

Unicode Explained

by Jukka K. Korpela
June 2006
Beginner
688 pages
26h 18m
English
O'Reilly Media, Inc.
Content preview from Unicode Explained
characters may be lost. However, an implementation of Unicode is required to preserve
characters, instead of, for example, dropping out characters that it does not recognize.
It may well fail to display them, but they should be available in the data by other means.
For example, for the expression m², the first two methods just discussed imply that in
cut and paste, the result preserves the information: m². (For method 2, we assume that
you cut from the formatted document, not from XML or HTML source.) For methods
3 and 4, cut and paste normally converts the text to “m2,” unless the operation takes
place inside a program or between programs that recognize the method used. Thus, if
you copy and paste the string “m
2
” where “2” is formatted as a superscript, the for-
matting is preserved when working inside a word processor, but not when copying
from it into a plain text editor like Notepad. When method 5 is used, the data copied
is of course “m2.”
Similarly, when data is read by a program, information expressed at the character level
is always available to the program, though it may not make use of it. Information ex-
pressed in markup is normally available, too, since programs normally read the markup
source, but they would need to recognize the markup—at least to the extent that it can
skip it, instead of treating markup as data! Reading data in a word processor’s internal
format is possible, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Unicode Demystified

Unicode Demystified

Richard Gillam
Fonts & Encodings

Fonts & Encodings

Yannis Haralambous
The Java® Language Specification, Java SE 8 Edition

The Java® Language Specification, Java SE 8 Edition

James Gosling, Bill Joy, Guy L. Steele Jr., Gilad Bracha, Alex Buckley

Publisher Resources

ISBN: 059610121XCatalog PageErrata