The Unicode Consortium Discussion Forum

The Unicode Consortium Discussion Forum

 Forum Home  Unicode Home Page Code Charts Technical Reports FAQ Pages 
 
It is currently Wed Oct 22, 2014 11:30 pm

All times are UTC - 6 hours [ DST ]




Post new topic Reply to topic  [ 3 posts ] 
Author Message
 Post subject: U+FEFF as a zero width no-break space char is deprecated
PostPosted: Mon Oct 15, 2012 9:54 am 
Offline

Joined: Sat Aug 06, 2011 9:02 am
Posts: 43
I believe there is an error in the following statement in page 63 of Chapter3: Conformance

    • For example, when using UTF-16LE, pairs of bytes are interpreted as UTF-16 code units using the little-endian byte order convention, and any initial <FF FE> sequence is interpreted as U+FEFF zero width no-break space (part of the text), rather than as a byte order mark (not part of the text). (See D97.)

as the use of FEFF is deprecated as a zero width nobreak space character, even when it's not at the beginning of the file. Clearly when it is a the beginning of the file FEFF should be interpreted as the BOM character.


Top
 Profile  
 
 Post subject: Re: U+FEFF as a zero width no-break space char is deprecated
PostPosted: Mon Oct 15, 2012 10:07 pm 
Offline

Joined: Sat Dec 04, 2010 10:25 pm
Posts: 4
A) It may be deprecated, but that doesn't change its meaning when it does occur.

B) In the given example, the point is that you already know the text is little-endian (it was declared UTF-16LE), therefore there is no BOM in the text. If the initial sequence is <FF FE>, this is, indeed, interpreted as U+FEFF zero width no-break space (part of the text), rather than as a byte order mark (not part of the text).

The statement you quoted from the spec is correct as written.


Top
 Profile  
 
 Post subject: Re: U+FEFF as a zero width no-break space char is deprecated
PostPosted: Tue Oct 16, 2012 7:30 am 
Offline

Joined: Sat Aug 06, 2011 9:02 am
Posts: 43
You're right. I should have read Chapter 3, up to pages 97 and 98, where this is clearly explained.

Thanks


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 3 posts ] 

All times are UTC - 6 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


Quick-mod tools:
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Jump to:  
cron
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group
Template made by DEVPPL.com