Re: Collated lists of code points

From: Markus Scherer (markus.scherer@jtcsv.com)
Date: Fri Aug 23 2002 - 17:02:07 EDT

Previous message: Peter_Constable@sil.org: "Re: Recent changes to i18n standards"
In reply to: J M Craig: "Collated lists of code points"
Next in thread: J M Craig: "Re: Collated lists of code points"
Next in thread: rick@unicode.org: "Re: Collated lists of code points"
Reply: J M Craig: "Re: Collated lists of code points"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

For example, ICU (http://oss.software.ibm.com/icu/) comes with collation.
The default sort order for all of Unicode is from the UCA (http://www.unicode.org/reports/tr10/).
For some languages we have tailorings to collate certain code points or sequences differently.

The source files for those are in our locale resource bundles, http://oss.software.ibm.com/cvs/icu/icu/source/data/locales/
See files ja.txt, ko.txt, and variations of zh*.txt.
For the format of the collation tailorings see http://oss.software.ibm.com/icu/userguide/Collate_Customization.html

There are charts that show the tailorings in a more graphic way, although they cannot convey all the details (like contractions):
http://oss.software.ibm.com/icu/charts/collation/index.html

Best regards,
markus

J M Craig wrote:

> ... Does anyone know
> where I can get ( free :-) or not ) lists of Unicode code points
> collated according to some particular approach. ...

Previous message: Peter_Constable@sil.org: "Re: Recent changes to i18n standards"
In reply to: J M Craig: "Collated lists of code points"
Next in thread: J M Craig: "Re: Collated lists of code points"
Next in thread: rick@unicode.org: "Re: Collated lists of code points"
Reply: J M Craig: "Re: Collated lists of code points"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.2 : Fri Aug 23 2002 - 15:17:49 EDT