
study to determine the actual encoding used, since, for example, Swedish, German,
and Finnish ASCII variants all have ä as a replacement for {, but there are differ-
ences in other replacements.
Ã
¤
The data is evidently in UTF-8 encoding. Notice that the characters à and ¤ stand
here for octets C3 and A4, which might be displayed differently depending on the
program and device used. Generally, the frequent appearance of uppercase à is a
strong indication of the problem that UTF-8 encoded data is being interpreted as
ISO-8859-1 encoded.
+AOQ-
The data is in UTF-7 encoding.
Ì
The data is most probably in Roman-8 encoding (defined by Hewlett-Packard).
=E4
The data is in Quoted-Printable encoding. The original encoding, upon which the
QP encoding was applied, might be ISO-8859-1, or any other encoding that rep-
resents character ä in the same way as ISO-8859-1 (i.e., as octet E4 hexadecimal).
ä
The data is in HTML format; the encoding may vary. See Chapter 9.
ä
The data is in HTML or XML format; the encoding may vary.
‰
This character occupies code position E4 in the old Macintosh character code.
Thus, what has probably happened is that some program received ISO-8859-1
encoded data and interpreted it as if it were in Mac encoding, and then performed
a conversion based on that interpretation. It apparently turned E4 into 89, which
is the code position of the per mille sign in the windows-1252 code. ...