
Basic Concepts and Terminology FAQ
|
19
as used by soware, typically consists of a language identier and a country or region
identier.
What Is Unicode?
Unicode is the rst truly successful multilingual character set standard, and it is sup-
ported by three primary encoding forms, UTF-8, UTF-16, and UTF-32. Unicode is also a
major focus of this book.
Conceived 20 years ago by my friend Joe Becker, Unicode has become the preferred char-
acter set and has been successful in enabling a higher level of internationalization. In
other words, Unicode has trivialized many aspects of soware internationalization.
How Are Unicode and ISO 10646 Related?
Make no mistake, Unicode and ISO 10646 are dierent standards.
*
e development of
Unicode is managed by e Unicode Consortium, and that of ISO 10646 is managed by the
International Organization for Standardization (ISO). But, what is important is that they
are equivalent, or rather kept equivalent, through a process that keeps them in sync.
ISO 10646 increases its character repertoire through new versions of the standard, ad-
ditionally designated by year, along with amendments. Unicode, on the other hand, does
the same through new versions. It is possible to correlate Unicode and ISO 10646 by indi-
cating the version of the former, and the year and amendments of the latter.
More detailed coverage of Unicode can be found in Chapter 3, and ...