O'Reilly logo

Unicode Demystified by Richard Gillam

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Multiple-Byte Encoding Systems

These distinctions between layers become more interesting when you start talking about East Asian languages such as Chinese, Japanese, and Korean. These languages all make use of the Chinese characters. No one is really sure how many Chinese characters exist, although the total probably exceeds 100,000. Most Chinese speakers have a working written vocabulary of some 5,000 characters or so. Japanese and Korean speakers, who depend more on auxiliary writing systems to augment the Chinese characters, have a somewhat smaller vocabulary.

East Asian Coded Character Sets

With that many characters, you must start by officially defining particular sets of Chinese characters in which you're interested. The Japanese government, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required