RE: PDUTR #26 posted

From: Marco Cimarosti (marco.cimarosti@essetre.it)
Date: Mon Sep 17 2001 - 05:51:44 EDT


Julie Doll Allen wrote:
> Proposed Draft Unicode Technical Report #26: Compatibility Encoding
> Scheme for UTF-16: 8-Bit (CESU-8) is now available at:
> http://www.unicode.org/unicode/reports/tr26/

Does renaming "UTF-8S" to "CESU-8" fix all the issues that were discussed on
this mailing list at the beginning of last spring?

Specifically:

- How will it be ensured that UTF-8 and CESU-8 (former UTF-8S) will not be
mixed up in the same environment? How should an UTF-8 application behave if
it accidentally receives a CESU-8 surrogate sequence? How does an
application which relies on CESU-8 binary sorting behave if it accidentally
receives an UTF-8 4-byte sequence?

- What is the need for an official document that describes "an alternate
encoding to UTF-8 for internal use"? Lots of applications implement some
sort of internal hacks, but they don't issue UTF's to tell the world about
it.

_ Marco



This archive was generated by hypermail 2.1.2 : Mon Sep 17 2001 - 04:50:54 EDT