Re: ASCII control codes in sequences of multibyte character sets from Doug Ewell on 2013-09-02 (Unicode Mail List Archive)

From: Doug Ewell <doug_at_ewellic.org>
Date: Mon, 2 Sep 2013 18:08:41 -0600

I asked because, as Philippe said, an octet is the same as an 8-bit
byte.

--
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell 
-----Original Message----- 
From: SteffenDaodeNurpmeso
Sent: Monday, September 2, 2013 12:45
To: Doug Ewell
Cc: unicode_at_unicode.org
Subject: Re: ASCII control codes in sequences of multibyte character 
sets
"Doug Ewell" <doug_at_ewellic.org> wrote:
|How would you define the difference between multi-octet and
|multi-byte?
hm, to me multi-octet is an encoding which uses a fixed amount of
octets (8-bit bytes) per character, e.g., UCS-2, UCS-4 etc.,
whereas a multi[-]byte character set is designed as a 8-bit
character set, but which may use multiple 8-bit bytes per
character, possibly even fixed.
I.e., in the end i think it comes out as "are embedded NUL octets
a regular part of the character set".
You're asking ... i'm sure there is an officially accepted
definition somewhere?
--steffen

Received on Mon Sep 02 2013 - 19:11:12 CDT

This archive was generated by hypermail 2.2.0 : Mon Sep 02 2013 - 19:11:13 CDT