Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)

From: Peter Kirk (peter.r.kirk@ntlworld.com)
Date: Tue Aug 05 2003 - 18:31:03 EDT

Next message: Kenneth Whistler: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"

Previous message: Michael Everson: "Re: Handwritten EURO sign"
In reply to: Mark Davis: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Next in thread: Curtis Clark: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Reply: Curtis Clark: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

On 05/08/2003 15:09, Mark Davis wrote:

>><< Zs, Zl, and Zp are considered format characters, but their
>>membership in the Z (separator) class takes precedence over their
>>membership in the Cf class, because the General Category assigns
>>
>>
>only
>
>
>>a single value to each character. >>
>>
>>
>
>Whenever you have a question about the status of a character, you need
>to look it up in the UCD. You can either do that by going through the
>unicode website, or if you want a more readable interface, use the ICU
>character browser, which formats that data.
>
>Look at space, U+0020.
>
>http://oss.software.ibm.com/cgi-bin/icu/ub/utf-8/?go=0020&ch.x=4&ch.y=7�O 
>
>The general category is Space_Separator, *not* a format character.
>
>Now wording there could definitely be clearer, but the operant phrase
>is:
>
>
>
>>...but their
>>membership in the Z (separator) class *takes precedence* over their
>>membership in the Cf class...
>>
>>
>
>So it would be cleared to say something like:
>
>In many ways the characters, Zs, Zl, and Zp, are similar to format
>characters, but because their general usage is significantly different
>they are broken out into a separate General Category, as Separator
>characters.
>
>Mark
>__________________________________
>http://www.macchiato.com
>► “Eppur si muove” ◄
>
>
>
>
Thank you, Mark. This helps to clarify things, but still doesn't
explicitly answer my question of how to encode "a sentence like "In this
language the diacritic ^ may appear above the letters ...", but instead
of ^ I want to use a combining character" and want to display exactly
one space before the combining character - do I encode two spaces or one?

-- 
Peter Kirk
peter.r.kirk@ntlworld.com
http://web.onetel.net.uk/~peterkirk/

Next message: Kenneth Whistler: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Previous message: Michael Everson: "Re: Handwritten EURO sign"
In reply to: Mark Davis: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Next in thread: Curtis Clark: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Reply: Curtis Clark: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Tue Aug 05 2003 - 19:23:24 EDT