Re: 'code unit' and 'code point' meaning check

From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Thu May 15 2003 - 13:56:44 EDT

Next message: Michael \(michka\) Kaplan: "Re: 'code unit' and 'code point' meaning check"

Previous message: Philippe Verdy: "Re: 8-bit encodings and ASCII (was: Unicode conformant character encodings and us-ascii)"
In reply to: Otto Stolz: "Re: 'code unit' and 'code point' meaning check"
Next in thread: Michael \(michka\) Kaplan: "Re: 'code unit' and 'code point' meaning check"
Reply: Michael \(michka\) Kaplan: "Re: 'code unit' and 'code point' meaning check"
Reply: Doug Ewell: "Re: 'code unit' and 'code point' meaning check"
Reply: Peter_Constable@sil.org: "Re: 'code unit' and 'code point' meaning check"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

From: "Otto Stolz" <Otto.Stolz@uni-konstanz.de>
> Ben Dougall wrote:
>
> > <guess> that area is full of surrogates. so they need another code point
> > to make up a single character. on their own 0xd800-0xdfff are 1/2
> > characters :) </guess>
>
> Oh, no! Again, you are confusing code-points and code-units
> (in other words: Unicode and its UTFs).
>[snip]
> In a nutshell: Unicode is not UTF-16.

Or in other words, Unicode defines *code points* only, not code units (this is left to specific encodings used to serialize it, including UTF-*, and "compressed" BOCU and CESU encodings, which can all be computed algorithmically from Unicode code points).

Note that some UTF-* encodings are now described by Unicode.org as standards, but is technically an annex to the standard, and not necessary to its definition.

Next message: Michael \(michka\) Kaplan: "Re: 'code unit' and 'code point' meaning check"
Previous message: Philippe Verdy: "Re: 8-bit encodings and ASCII (was: Unicode conformant character encodings and us-ascii)"
In reply to: Otto Stolz: "Re: 'code unit' and 'code point' meaning check"
Next in thread: Michael \(michka\) Kaplan: "Re: 'code unit' and 'code point' meaning check"
Reply: Michael \(michka\) Kaplan: "Re: 'code unit' and 'code point' meaning check"
Reply: Doug Ewell: "Re: 'code unit' and 'code point' meaning check"
Reply: Peter_Constable@sil.org: "Re: 'code unit' and 'code point' meaning check"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Thu May 15 2003 - 14:59:41 EDT