From: Roozbeh Pournader (roozbeh@htpassport.com)
Date: Tue May 24 2011 - 18:28:24 CDT
On Mon, 2011-05-23 at 08:17 -0700, Phillips, Addison wrote:
> [...] you generally should not emit NFD "into the wild"
In the real world, of course, you should actually not emit NFC either.
A famous case that comes to bite me again and again, is that some
XP-era Microsoft applications don't render canonically equivalent
strings the same way, so if you normalize something, you lose its
preferred display and semantics. For example, the sequence <ARABIC
LETTER SEEN, ARABIC SHADDA, ARABIC FATHA>, which is a kind of very
normal and rather common sequence in Arabic, will be displayed wrongly
in Windows XP's Uniscribe if one actually normalizes it (to either NFC
or NFD), becoming <SEEN, FATHA, SHADDA>, which is displayed wrongly in
both Notepad and Word 2003 under Windows XP.
Roozbeh
This archive was generated by hypermail 2.1.5 : Tue May 24 2011 - 18:31:27 CDT