From: Tay, William (William.Tay@xerox.com)
Date: Mon Aug 23 2004 - 13:33:30 CDT
Hi,
Can anyone explain why an accented character is sometimes represented as a base character plus its accent? For example, the utf-8 representation for é is 65 CC 81, which is the utf-8 representation for e and the accent, instead of C3 A9? I find that this is how MacOS X represents accented characters.
How can a C application that receives such utf-8 encoded characters handle them correctly? Appreciate your comments.
Thanks.
Will
This archive was generated by hypermail 2.1.5 : Mon Aug 23 2004 - 13:35:40 CDT