From: John Cowan (jcowan@reutershealth.com)
Date: Tue Oct 08 2002 - 11:01:12 EDT
Marco Cimarosti scripsit:
> All 8859 tables would be more succint.
Well, I checked the 8859-2 mapping table, and the only contiguous ranges
are of length 2, namely 0xA7-0xA8, 0xC1-0xC2, 0xCD-0xCE, 0xD3-0xD4,
0xD6-0xD7, 0xDC-0xDD, 0xE1-0xE2, 0xF3-0xF4, 0xF6-0xF7, 0xFC-0xFD.
All of these are places where Latin-1 and Latin-2 coincide.
> Latin sections are a worse case, but they still benefit slightly, because
> characters shared with Latin-in stay the same positions.
This is strongly true of Latin-1, -2, -3, -4: a character appears in the
same codepoint or not at all. Unfortunately, Latin-3 is used only for
Esperanto and Maltese, and Latin-4 is dead. The later Latins share only
with Latin-1.
-- A mosquito cried out in his pain, John Cowan "A chemist has poisoned my brain!" http://www.ccil.org/~cowan The cause of his sorrow http://www.reutershealth.com Was para-dichloro- jcowan@reutershealth.com Diphenyltrichloroethane. (aka DDT)
This archive was generated by hypermail 2.1.5 : Tue Oct 08 2002 - 11:51:03 EDT