
Legacy Encoding Methods
|
255
In most cases, EUC encoding is simply ISO-2022 encoding with the high bits set and
without escape or shi sequences of any kind.
*
Algorithms for code conversion are dis-
cussed in Chapter 9.
Locale-Specic Legacy Encoding Methods
Included here are descriptions of legacy encoding methods used for specic locales. I
nd it intriguing that all ve of these CJKV locales each have at least one locale-specic
encoding method. Table 4-61 lists some CJKV character set standards, along with their
locale-specic encoding methods.
Locale-specic legacy encoding methodsTable 4-61.
Character set Encoding method Locale
GBK GBK China
GB 18030-2005 GB 18030 China
Big Five
a
Big Five Taiwan
Big Five Plus Big Five Plus Taiwan
Hong Kong SCS-2008 Big Five Hong Kong
JIS X 0208:1997
b
Shift-JIS Japan
KS X 1001:2004 Johab Korea
Keep in mind that CNS 11643-2007 Planes 1 and 2 are equivalent to Big Five.a.
JIS X 0213:2004 is explicitly excluded from this table, because the Shift-JIS encoding for the characters specic to JIS X 0213:2004, meaning b.
those characters beyond JIS X 0208:1997, is Informative, not Normative. JIS X 0213:2004 is expected to be implemented through the use of
Unicode and its encodings.
e following sections cover each of these encoding methods in signicant detail, and
some also provide encoding examples similar to those found in earlier sections. Compari-
sons ...