character entities in UTF-8 files

From: Avraham Shapiro (asha@loc.gov)
Date: Tue Jul 12 2005 - 12:44:24 CDT

  • Next message: John Hudson: "Re: Missing capital H from Unicode range (see 1E96)"

    ** Low Priority **

    We have an XML based application that specifies UTF-8 files as input. Occasionally users will
    include numeric character entites, for example é for e acute instead of the UTF-8
    equivalent of C3 A9. My question is: Is this legal UTF-8? And are numeric or symbolic character
    entites valid for Ascii-7 characters such as "<"? My guess is the first one is not legal,
    and the second one is application defined, i.e. Unicode says nothing about it. Am I
    right?



    This archive was generated by hypermail 2.1.5 : Tue Jul 12 2005 - 12:47:32 CDT