
618
|
Chapter 9: Information Processing Techniques
I have written a tool, called JChar, that generates these problematic character sets (and
nonproblematic ones, too!). Listings of these noncoded Japanese character set standards,
as generated by JChar, are in Appendix J.
e JChar tool has many features and options that you will nd useful at some time or
another—the main ones are as follows:
Supports the JIS X 0208:1997, ASCII/JIS-Roman, half-width katakana, Jōyō Kanji, •
Gakushū Kanji, and Jinmei-yō Kanji character sets
Outputs data in ISO-2022-JP, Shi-JIS, or EUC-JP encoding•
Wraps output lines at • n columns
Can suppress header information•
Algorithms used in this tool are primarily loops (for the coded character sets) and data
structures (for the noncoded character sets). Encoding range bounds are used, though, to
generate the whole character encoding space, and not just the code positions that contain
characters. For example, when choosing to generate the JIS X 0208:1997 list, it does not
generate 6,879 code positions, but does 8,836 code positions, which is what you get from
a complete 94×94 matrix.
Here is this tool’s help page:
** jchar v3.0 (July 1, 1993) **
Written by Ken R. Lunde, Adobe Systems Incorporated
lunde@adobe.com
Usage: jchar [-options] [outfile]
Tool description: This tool is a utility for generating various Japanese
character sets in any code. This includes all ...