Re: Unicode Latin combining diacritics - Looking for real-world example documents

From: Frank da Cruz (fdc@columbia.edu)
Date: Tue Apr 02 2002 - 09:33:43 EST


> We're doing some testing of Latin Diacritic support for IPA and African
> languages, romanizations, etc., and it is (understandably) very hard to
> find any "real" text in languages that require this support...
>
Well, so far we have "I can eat glass" in Yoruba and Twi:

  http://www.columbia.edu/kermit/utf8.html

For those who haven't looked lately, this little sample page has been
collecting entries at a slow but steady clip -- we're up to 109 now, nearly
1% of the world's languages (send more!) Of course half of them are German
and Italian dialects...

I recently installed Windows XP, and with its standard complement of fonts,
this page doesn't look half bad -- the only entries that don't show up are
the the Runic ones, CJK, and the Middle English letter Yogh. More of the
entries are coming through in monospace too -- even Armenian, Georgian, and
Hindi.

Progress!

- Frank



This archive was generated by hypermail 2.1.2 : Tue Apr 02 2002 - 10:36:53 EST