Re: Displaying Plane 1 characters

From: Michael Everson (everson@indigo.ie)
Date: Thu Nov 12 1998 - 05:51:40 EST


Ar 18:02 -0800 1998-11-11, scríobh Keld J|rn Simonsen:
>> >Java is also going to get problems: "\u10208" would be mistaken as
>> >U+1020 <undefined Mongolian character> U+0038 DIGIT EIGHT instead
>> >of U-00010208 ETRUSCAN LETTER TH.
>>
>> \uD800\uDE08 is an obvious answer for Java, since Java's 16-bit data
>> type implies its use of UTF-16.
>
>Yoou should not use \uxxxx nothation for surrogates,
>as surrogates are not charcters in neither Unicode nor 10646,
>and thus the short identifiers cannot be used.

WG2 has provisionally accepted and provisionally allocated Etruscan,
Gothic, Western Musical Symbols, and Byzantine Musical Symbols to Plane 1.
Yes, it hasn't been published or ballotted or anything, but one has to have
a way of referring to those (provisional) code positions.

--
Michael Everson, Everson Gunn Teoranta ** http://www.indigo.ie/egt
15 Port Chaeimhghein Íochtarach; Baile Átha Cliath 2; Éire/Ireland
Guthán: +353 1 478-2597 ** Facsa: +353 1 478-2597 (by arrangement)
27 Páirc an Fhéithlinn;  Baile an Bhóthair;  Co. Átha Cliath; Éire



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT