RE: Conflicting principles

From: Kent Karlsson (kentk@cs.chalmers.se)
Date: Thu Aug 07 2003 - 17:52:18 EDT

  • Next message: Karljürgen Feuerherm: "Re: Colourful scripts and Aramaic"

    > > Collation isn't really based on combining sequences (even though UTS
    10
    > > specifies a certain "spanning" over non-blocking (combining)
    >
    > This is a very ignorant question: where in your public documentation
    > are these issues discussed?
    ...
    > I still don't understand even what happens with basic
    > collation in Hebrew, what
    > effect the shin / sin dots have.

    Ignored at level 1, considered at level 2. From the 14651 data file:

    <U05C1> IGNORE;<SHINP>;<MIN>;<U05C1> % HEBREW POINT SHIN DOT
    <U05C2> IGNORE;<SINPT>;<MIN>;<U05C2> % HEBREW POINT SIN DOT

    > And, of course, I don't
    > understand any of the
    > more complicated issues either, such as what will happen when
    > your database
    > sorts un-pointed Hebrew epigraphy (just the consonants) and
    > pointed medieval
    > Hebrew (all the jots and tittles added).

    Re. collation, see UTS 10, and associated data files, and if you're
    really interested, see ISO/IEC 14651 (sort of a parallel to UTS 10,
    but different), and its data file.

                    /kent k



    This archive was generated by hypermail 2.1.5 : Thu Aug 07 2003 - 18:37:00 EDT