Grumping about Greek (was: Re: UTF-8 can be used for more ... )

From: Kenneth Whistler (
Date: Fri Jun 09 2006 - 21:44:03 CDT

  • Next message: Philippe Verdy: "Re: Grumping about Greek (was: Re: UTF-8 can be used for more ... )"

    > So may be the best solution is to ...

    The best solution is to stop expecting out-of-the-box
    casing implementations to do a hunky-dory, perfect, nobody-
    could-complain-about-anything job of casing and folding
    for modern *and* ancient Greek, monotonic *and* polytonic,
    precomposed *and* decomposed, based on a cobbled together
    mixed-up encoding of Greek that was not engineered in the
    first place to facilitate casing operations on Greek.

    The Greek encoding in Unicode is a mess. Well, not quite
    as big a mess as the Hangul encoding in Unicode, but close.

    If people are expecting perfection in the handling of
    Greek data, then they need to pre-process their Greek
    data and then write specialized, perfect algorithms to
    produce the results they want.


    *grump* *grump* *grump*

    This archive was generated by hypermail 2.1.5 : Fri Jun 09 2006 - 21:46:07 CDT