This talk discusses some problems one encounters in manipulating multilingual Unicode text. The problems include using multiple fonts to display Unicode plain text, dealing with neutral characters, deunifying characters that look different in different scripts, dealing with complex scripts, using keyboards to enter Unicode characters conveniently, keeping compatible with previous character sets, glyph variants, and navigating through text that includes "multicharacters" such as Unicode-surrogate pairs, combining-mark sequences, and variable length end-of-paragraph marks. Solutions to some of the problems will be demonstrated using the RichEdit 3.0 controls shipped with Office 2000 and to be shipped with Windows NT 5.0. Glyph variants and surrogate pairs comprise a couple of ongoing topics that lend themselves to several possible approaches. |
When the world wants to talk, it speaks Unicode |
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS).
GMS is pleased to be able to offer the International Unicode Conferences under an exclusive
license granted by the Unicode Consortium. All responsibility for conference finances and
operations is borne by GMS. The independent conference board serves solely at the pleasure
of GMS and is composed of volunteers active in Unicode and in international software
development. All inquiries regarding International Unicode Conferences should be addressed
to info@global-conference.com. Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission. 23 January 1999, Webmaster |