Partial consistency in 8859-x?

From: John Cowan (john_cowan@hotmail.com)
Date: Fri Jul 18 1997 - 13:20:56 EDT


Due to network problems, I can read mail at cowan@ccil.org, but
can't post/reply/send from there. Please direct all private replies
to cowan@ccil.org , not the HotMail address. Thanks.

When preparing a comparison table of the various 8859-x parts,
I noticed an odd property of partial consistency between 8859-1-2-3-4.
Any character encoded by more than one coded character set is
always encoded at the same codepoint. (8859-9 does not have
this property; its Turkish letters don't agree with the 8859-3
encoding.)

For example:

        DEGREE SIGN is always encoded at B0;
        CENT SIGN is at A2 in 8859-1, doesn't appear in 8859-2-3-4;
        LATIN SMALL LETTER KRA is at A2 in 8859-4, doesn't appear
           in 8859-1-2-3;
        LATIN CAPITAL LETTER C WITH CEDILLA is at C7 in 8859-1-2-3,
           doesn't appear in 8859-4.
        LATIN SMALL LETTER A WITH OGONEK is at B1 in 8859-2-4,
           doesn't appear in 8859-1-3;
        LATIN SMALL LETTER S WITH CEDILLA is at BA in 8859-2-3,
           doesn't appear in 8859-1-4.

What I'm wondering is whether this property was carefully designed
into 8859-1-2-3-4 when they were specified, or whether it is more or
less an accident of copying.

Does anyone know?
John Cowan cowan@ccil.org
        Please do not use "Reply"
        e'osai ko sarji la lojban.
______________________________________________________
Get Your Private, Free Email at http://www.hotmail.com



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:36 EDT