>A sublety: the i18n spec refers to UCS, which has a consquence
>when going beyond BMP. There UCS has well defined numbers, while I
>do not know whether Unicode has this.
True. The numbers would correspond to UCS-4 past the BMP, as the i18n
draft says. Unicode would represent these codes as UTF-16.
This raised a question in my mind about the i18n draft and surrogates,
and I discovered that it says nothing. Since numeric character references
are to the UCS-4 form, it probably would have been better if the
surrogate range had been excluded.
David
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:33 EDT