Character classification and casing and locales

From: Bhupinder Singh (Bhupinder.Singh@Bentley.COM)
Date: Tue Nov 26 1996 - 12:12:00 EST


     

Is anyone working on Locale support for Unicode encoded strings ? Unless I
misunderstand the issue, if I have a string of 16bit characters encoded in
Unicode, and I wish to uppercase them according to the say, Czech locale, I now
have to convert the string to Latin-2, (or whichever charset the particular OS
supports for the Czech language), use the OS, uppercase them, and covert it
back.
Are there any better ways to do this, and other locale dependent character
operations ? I guess this is a platform specific question, however we are trying
to provide a platform independent solution, using the OS services whenever
possible (Unix, NT etc.) All the definitions for locale that I have found have
language, Region, character set. I havent found on any OS I have looked, support
for Unicode locales (would that be iso10646 ???) I did find UTF8 on the HP for
the C locale.

Thanks for any insight. I am a fairly new person to the world of Unicode.

Bhupinder Singh
Bentley Systems



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:32 EDT