RE: How will software source code represent 21 bit unicode characters?

From: Martin Duerst (duerst@w3.org)
Date: Tue Apr 17 2001 - 12:14:25 EDT


At 09:29 01/04/17 -0500, jarkko.hietaniemi@nokia.com wrote:
> > In a perfect world, we would probably have an enclosing symbol (e.g.
> > '\<4E00>') so that the number can be variable length.
>
><tuning into another language channel>
>In Perl the notation is \x{...}, where ... is hexdigit sequence:
>\x{41} is LATIN CAPITAL LETTER A while \x{263a} is WHITE SMILING FACE,
>and \x{1D400} is MATHEMATICAL BOLD CAPITAL A.

And in XML it's &#x4E00; which can also be variable length.
There is also &#ddddd; where ddddd is a decimal number, but
that's not that useful anymore now that standards use hexadecimal.

Regards, Martin.



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:16 EDT