Michael Kaplan wrote,
>
> The algorithm is described in the standard, a list of the code points would
> not only be huge but it would also be a waste....
>
There's no doubt that it's huge.
For some text based apps, having all this kind of data already
parsed and in a look-up table or database can be quite handy,
though. Saves pasting code into various and sundry apps, for
one thing, and for another, makes look-ups almost trivial.
If anyone would benefit from a plain text listing of Unicodes
and their corresponding surrogate pairs, please contact me
off list and I'll be pleased to send it along.
Format is (space) Unicode (space) SurrogatePair... on each line,
the lines separated by 0D0A. Let me know if you need it with
a different line separator, or comma between the Unicode and
the pair, or whatever. File size (*.ZIP) is 603358 bytes. Covers
Planes 1 & 2.
Best regards,
James Kass.
This archive was generated by hypermail 2.1.2 : Thu Nov 22 2001 - 21:04:03 EST