Re: Problems/Issues with CJK and Unicode

From: John Jenkins (
Date: Fri Apr 07 2000 - 14:28:12 EDT

on 4/7/00 10:58 AM, Hoon Kim at wrote:

"Sort" would be one of those problem.
(For Korean and Japanese, you would expect to sort by pronunciation, which
would be different than the order Unihan characters were placed on)

Unihan characters are placed in a pseudo-radical/stroke order which is
algorithmically derivable but guaranteed to be culturally incorrect in any
real world situation. That is, it's always wrong to sort Unihan in code
point order. This is particularly true now that there are official
non-compatibility ideographs outside of the U+4E00 - U+9FFF range.

John H. Jenkins

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:01 EDT