
288
|
Chapter 4: Encoding Methods
Round-trip Unicode mapping for Microso Windows-J character setTable 4-87.
Character Shift-JIS code points To Unicode Back to Shift-JIS
3, 44 2174 44
4, 45 2175 45
5, 46 2176 46
6, 47 2177 47
7, 48 2178 48
8, 49 2179 49
, 55 4 55
, 56 7 56
, 57 2 57
Korean Code Conversion
Korean code conversion, when dealing with only EUC-KR and ISO-2022-KR encodings,
is trivial. However, when Johab encoding is added to the mix, things become somewhat
less trivial. Converting EUC-KR and ISO-2022-KR encodings to Johab encoding results
in no loss of data, because all of their characters can be represented in Johab encoding.
However, the 8,822 additional hangul made available in Johab encoding cannot convert to
EUC-KR and ISO-2022-KR encodings, except as strings of individual jamo, all of which
are encoded in EUC-KR and ISO-2022-KR encodings. Hcode is a very popular Korean
code conversion program that supports a wide variety of Korean encodings. Its URL can
be found in Table 4-85.
Perl subroutines that illustrate algorithmic conversion to and from Johab encoding—they
apply to all KS X 1001:2004 characters, except to the 51 modern jamo and hangul sylla-
bles—are provided in Appendix C.
Code Conversion Across CJKV Locales
All of the dicult, yet very interes ...