Byte Order Marks

From: Tomas McGuinness (tomas.mcguinness@cmg.nl)
Date: Tue Apr 10 2001 - 05:49:58 EDT

Next message: Tomas McGuinness: "gb2312"
Previous message: Antoine Leca: "Re: Digits in Unicode Names"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Hi,

When looking at a document would it be safe to assume that if you found any
of the following Byte Order Marks
* 0xFFFE (UCS-2 Little Endian)
* 0xFEFE (UCS-2 Big Endian)
* 0xEFBBBF (UTF-8)
That the document is encoded with that encoding format. That means that if I
found the first 3 octets to be EF BB EF could I assume I am dealing with a
UTF-8 Document.

Apart from UTF and Unicode/UCS encoding formats do any other "legacy"
character sets use Byte Order Marks?

Regrads,

Tom.

Tomas McGuinness Consultant
> --------------------------------------------------------------------------
> ----------------
> University Technology Park * +353 21 4933 277
> Curraheen Rd, Cork * +353 21 4933 201
> * tomas.mcguinness@cmg.nl
> --------------------------------------------------------------------------
> ----------------
> CMG Telecom Products Division
> Product Development, Cork
> --------------------------------------------------------------------------
> ----------------
>
>
>

Next message: Tomas McGuinness: "gb2312"
Previous message: Antoine Leca: "Re: Digits in Unicode Names"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:15 EDT