RE: Case mapping of dotless lowercase letters

From: Michael Everson (everson@evertype.com)
Date: Tue Dec 16 2003 - 06:27:41 EST

  • Next message: Michael Everson: "Re: Stability of WG2"

    At 11:03 +0100 2003-12-16, Philippe Verdy wrote:
    >Doug Ewell <dewell@adelphia.net> writes:
    >> > Wrong here: I have found occurences of dotless lowercase i, used
    >> > instead of soft-dotted lowercase i, as base letters for diacritics
    >> > added above it (it was an accute accent...)
    >>
    >> Don't do that.
    >
    >What? This is VALID UNICODE to have texts coded like this.

    In Irish, it is INCORRECT to spell "físeán"
    'video' with a DOTLESS I + COMBINING ACUTE. It is
    a spelling error, and will fail in
    spell-checking. The correct spelling is either I
    + COMBINING ACUTE or precomposed I WITH ACUTE.

    It is VALID UNICODE to follow LATIN CAPITAL
    LETTER Q with DEVANAGARI VOWEL SIGN E but that
    doesn't mean it's the right way to write anything.

    >For whatever reason, encoded texts exist before correct fonts are used to
    >render them. So there does exist texts which use dotless lowercase i before
    >a diacritic above, simply because the author of the text did not want it to
    >be rendered with a superposed dot.

    Texts which contain spelling errors. Or old IPA
    texts using any number of ad-hoc IPA font
    solutions. Those texts have to be transcoded to
    proper Unicode at some stage. What you suggest is
    Not Recommended.

    -- 
    Michael Everson * * Everson Typography *  * http://www.evertype.com
    


    This archive was generated by hypermail 2.1.5 : Tue Dec 16 2003 - 07:09:53 EST