Private Use Surrogate Pairs

From: James E. Agenbroad (jage@loc.gov)
Date: Wed May 08 2002 - 15:03:07 EDT


                                         Wednesday, May 8, 2002
On page 322 of version 3.0 of the Unicode Standard in describing the
Private-Use High Surrogates: "This mechanism allows for a total of 131,068
(= 128 x 1024 - 4) private-use characters representable by means of
surrogate pairs." I understand that the 128 is the codes U+DB80 to U+DBFF
for the high-surrogates. It would be helpful if it were stated that the
low-surrogate codes are from the 1,024 at U+DC00 to U+DFFF and which four
codes are excluded. (For a while I had thought that the codes for the
low-surrogate could be any of the private use codes from U+E000 to U+DFFF
but 128 x 6399 = 817,082 so that couldn't be right.)

     Regards from the mathematically challenged,
          Jim Agenbroad ( jage@LOC.gov )
     "It is not true that people stop pursuing their dreams because they
grow old, they grow old because they stop pursuing their dreams." Adapted
from a letter by Gabriel Garcia Marquez.
     The above are purely personal opinions, not necessarily the official
views of any government or any agency of any.
     Addresses: Office: Phone: 202 707-9612; Fax: 202 707-0955; US
mail: I.T.S. Sys.Dev.Gp.4, Library of Congress, 101 Independence Ave. SE,
Washington, D.C. 20540-9334 U.S.A.
Home: Phone: 301 946-7326; US mail: Box 291, Garrett Park, MD 20896.



This archive was generated by hypermail 2.1.2 : Wed May 08 2002 - 16:18:21 EDT