RE: UTF-8 text samples

From: Kevin Bracey (
Date: Fri Oct 16 1998 - 04:41:05 EDT

In message <>
          Murray Sargent <> wrote:

> Donald's UTF-8 file should begin with a UTF-8 BOM in order to identify it
> as a UTF-8 encoded file. The starting bytes should be 0xEF 0xBB 0xBF.
> These bytes are discarded when reading the file in and added when writing
> the file out.

Don't be silly. It might be helpful to do so for some applications that
attempt to autodetect encodings, but it's not necessary.

What Donald did do wrong (or rather his software/OS did) was to label the
attachment as:

Content-Type: TEXT/PLAIN; charset=ISO-8859-1; name=MES

That totally scuppers display of the attachment in a charset-aware mail

Kevin Bracey, Senior Software Engineer
Acorn Computers Ltd                           Tel: +44 (0) 1223 725228
Acorn House, 645 Newmarket Road               Fax: +44 (0) 1223 725328
Cambridge, CB5 8PB, United Kingdom            WWW:

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT