
290
|
Chapter 4: Encoding Methods
Korean-enabled OS and receive an ISO-2022-KR–encoded le, it may be dicult to de-
termine whether code conversion is necessary. Even well-formed ISO-2022-KR–encoded
les may not display on a Korean-enabled OS, because it may instead expect data to be
encoded according to EUC-KR or Johab encodings.
Consider the following past scenario. You were using a Japanese-capable operating sys-
tem, such as Mac OS-J. You received two les. One le was EUC-JP–encoded, and the
other was ISO-2022-JP–encoded, but it was damaged because its escape characters had
been stripped away. Both les displayed equally un-well when opened in a typical text
editor or other text-processing application. Until the EUC-JP–encoded le was converted
to Shi-JIS encoding, it could not be used on Mac OS-J. Likewise, until the ISO-2022-JP–
encoded le was repaired and converted to Shi-JIS encoding, it also could not be used
on Mac OS-J.
e subject of handling damaged or otherwise unreadable CJKV text, which is closely
related to code conversion, is covered in the next section.
Repairing Damaged or Unreadable CJKV Text
CJKV text les can be damaged in dierent ways depending on which encoding was used
in the le. ISO-2022–encoded text is usually damaged by unfriendly email clients (and
news readers) that remove control characters, including the all-important escape and shi
characters. ...