From: Avraham Shapiro (asha@loc.gov)
Date: Tue Jul 12 2005 - 12:44:24 CDT
** Low Priority **
We have an XML based application that specifies UTF-8 files as input. Occasionally users will
include numeric character entites, for example é for e acute instead of the UTF-8
equivalent of C3 A9. My question is: Is this legal UTF-8? And are numeric or symbolic character
entites valid for Ascii-7 characters such as "<"? My guess is the first one is not legal,
and the second one is application defined, i.e. Unicode says nothing about it. Am I
right?
This archive was generated by hypermail 2.1.5 : Tue Jul 12 2005 - 12:47:32 CDT