For example, ICU (http://oss.software.ibm.com/icu/) comes with collation.
The default sort order for all of Unicode is from the UCA (http://www.unicode.org/reports/tr10/).
For some languages we have tailorings to collate certain code points or sequences differently.
The source files for those are in our locale resource bundles, http://oss.software.ibm.com/cvs/icu/icu/source/data/locales/
See files ja.txt, ko.txt, and variations of zh*.txt.
For the format of the collation tailorings see http://oss.software.ibm.com/icu/userguide/Collate_Customization.html
There are charts that show the tailorings in a more graphic way, although they cannot convey all the details (like contractions):
http://oss.software.ibm.com/icu/charts/collation/index.html
Best regards,
markus
J M Craig wrote:
> ... Does anyone know
> where I can get ( free :-) or not ) lists of Unicode code points
> collated according to some particular approach. ...
This archive was generated by hypermail 2.1.2 : Fri Aug 23 2002 - 15:17:49 EDT