RE: GB18030

From: Carl W. Brown (cbrown@xnetinc.com)
Date: Fri Sep 21 2001 - 10:56:25 EDT


Charlie,

GB18030 is designed to support all Unicode characters. It has the capacity
to also encode additional characters. I know of no plans to do so.

I don't think it will have much affect on Unicode. Most systems that handle
GB18030 will want to convert it to Unicode first to reduce processing
overhead. With most of the common MBCS code pages you can determine the
length of the character from the first byte. With GB18030 you some times
have to check the first two characters. UTF-8 for example is an MBCS
character set but if I am going backwards through a string I can do so.
With GB18030 I must start over from the beginning of the string to find the
start of the previous character.

It is smaller that UTF-8 for Chinese and larger for anyone else.

Carl

> -----Original Message-----
> From: unicode-bounce@unicode.org
> [mailto:unicode-bounce@unicode.org]On Behalf Of Charlie Jolly
> Sent: Friday, September 21, 2001 1:42 AM
> To: unicode@unicode.org
> Subject: GB18030
>
>
> GB18030
>
> In what ways will this effect Unicode?
>
> Does it contain anything that Unicode doesn't?
>
>
>
>



This archive was generated by hypermail 2.1.2 : Fri Sep 21 2001 - 09:34:18 EDT