RE: Unicode to UTF-8

From: Marco.Cimarosti@icl.com
Date: Wed Mar 15 2000 - 13:02:25 EST


James Agenbroad wrote:
> Could someone more numerate than I confirm two of my attempts
> to convert
> from standard 16-bit Unicode to UTF-8 (or if I'm wrong tell
> me what they should be)?

I am not sure whether I am more numerate, but thanks anyway for the new (to
me) English adjective. I'll use it on the next occasion.

> Unicode UTF-8 Name
> hex = binary binary = hex

Change "Unicode" in, e.g., "UTF-16". UTF-8 is as standard Unicode as 16-bit
representations.

> 00B1 0000 0000 1011 0001 1100 0010 1011 0001 C2B1
> Plus/minus

Change "Plus/minus" in "PLUS-MINUS SIGN": Unicode names are normative
properties.

> 26D6 0010 0110 0110 1101 1110 0010 1001 1001 1010 1101 E299AD Flat

Change "26D6" in "266D".
Change "flat" in "MUSIC FLAT SIGN".

Better using the U+ prefix presenting Unicode scalar values (e.g. "U+00B1"),
and showing octet boundaries in UTF-8 (e.g. "C2,B1"). These conventions
enhance readability.

The UTF-8 values are all correct, as far as I can see.

_ Marco



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:00 EDT