Re: Problems/Issues with CJK and Unicode

From: John Cowan (jcowan@reutershealth.com)
Date: Fri Apr 07 2000 - 14:35:22 EDT


> Hoon Kim wrote:
>
> "Sort" would be one of those problem.
> (For Korean and Japanese, you would expect to sort by pronunciation, which would be different than the order Unihan characters were placed on)

Sorting can't be done by Unicode codepoints for any language, and depends on
the language of the reader, not the language of the text. For example, an
index of Swedish names for an English reader would sort ö with o, not after z.

-- 

Schlingt dreifach einen Kreis um dies! || John Cowan <jcowan@reutershealth.com> Schliesst euer Aug vor heiliger Schau, || http://www.reutershealth.com Denn er genoss vom Honig-Tau, || http://www.ccil.org/~cowan Und trank die Milch vom Paradies. -- Coleridge (tr. Politzer)



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:01 EDT