RE: UTF-8 text samples

From: Kevin Bracey (kbracey@acorn.com)
Date: Fri Oct 16 1998 - 04:41:05 EDT


In message <9810160027.AA10060@unicode.org>
          Murray Sargent <murrays@microsoft.com> wrote:

> Donald's UTF-8 file should begin with a UTF-8 BOM in order to identify it
> as a UTF-8 encoded file. The starting bytes should be 0xEF 0xBB 0xBF.
> These bytes are discarded when reading the file in and added when writing
> the file out.
>

Don't be silly. It might be helpful to do so for some applications that
attempt to autodetect encodings, but it's not necessary.

What Donald did do wrong (or rather his software/OS did) was to label the
attachment as:

Content-Type: TEXT/PLAIN; charset=ISO-8859-1; name=MES

That totally scuppers display of the attachment in a charset-aware mail
reader.

-- 
Kevin Bracey, Senior Software Engineer
Acorn Computers Ltd                           Tel: +44 (0) 1223 725228
Acorn House, 645 Newmarket Road               Fax: +44 (0) 1223 725328
Cambridge, CB5 8PB, United Kingdom            WWW: http://www.acorn.co.uk/



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT