Re: Unicode String Models

This message: [ Message body ] [ Respond ] [ More options ]
Related messages: [ Next message ] [ Previous message ] [ In reply to ] [ Next in thread ] [ Replies ]

From: Hans Åberg via Unicode <unicode_at_unicode.org>
Date: Tue, 11 Sep 2018 19:13:28 +0200

> On 11 Sep 2018, at 13:13, Eli Zaretskii via Unicode <unicode_at_unicode.org> wrote:
>
> In Emacs, each raw byte belonging
> to a byte sequence which is invalid under UTF-8 is represented as a
> special multibyte sequence. IOW, Emacs's internal representation
> extends UTF-8 with multibyte sequences it uses to represent raw bytes.
> This allows mixing stray bytes and valid text in the same buffer,
> without risking lossy conversions (such as those one gets under model
> 2 above).

Can you give a reference detailing this format?
Received on Tue Sep 11 2018 - 12:13:54 CDT

This archive was generated by hypermail 2.2.0 : Tue Sep 11 2018 - 12:13:54 CDT