Re: U+2047 double question mark collation

From: Ake Persson (ake.persson@mimer.se)
Date: Wed Jan 15 2003 - 07:28:44 EST

  • Next message: Christoph Päper: "Small Latin Letter m with Macron"

    Vadim Godunko wrote:
    > Hi,
    >
    > I have a problem with creating collation key for U+2047 (double question
    mark).
    >
    > Explicit collation keys for this symbol is absent in allkeys.txt.
    >
    > In UnicodeData.txt this symbol have compatibility decomposition map.
    >
    > 2047: ... :<compat> 003F 003F: ...
    >
    > Based on this and as defined in UTR #10 Unicode Collation Algoriphm this
    > symbol must have these collation keys:
    >
    > 003F [*024E.0020.0004]
    > 003F [*024E.0020.0004]
    >
    > But in CollationTest_NON_IGNORABLE.txt assumes that symbol have implicit
    > collation key [FBC0.0020.0002] [A047.0000.0000].
    >
    > What is incorrect?

    The header in allkeys.txt states: version 3.1.1. There are 1016 characters
    defined in Unicode 3.2, which are not mapped (yet). See:
    http://www.unicode.org/Public/3.2-Update/DerivedAge-3.2.0.txt

    I believe the version and repertoire of the Unicode Collation Algorithm
    (UCA) will be synchronized with versions of the Unicode Standard starting
    with Version 4.0.

    Best regards,
    Åke Persson



    This archive was generated by hypermail 2.1.5 : Wed Jan 15 2003 - 08:13:15 EST