[unicode] Re: Unicode editing

From: Roozbeh Pournader (roozbeh@sharif.edu)
Date: Thu Mar 22 2001 - 16:45:52 EST


On Wed, 21 Mar 2001, Marco Cimarosti wrote:

> struct MyWysiwygGlyph
> {
> wchar_t GlyphCode;
> int EmbeddingLevel;
> };
>
> I think that Roozbeh had something quite similar in mind.

Yes. I was not sure that if that's enough, but after this discussion, I
believe them to be enough. I will try to implement that as part of
fribidi, the first GPL-compatible Unicode-conformant bidi engine in the
wild.

> 3) The lowest level in each paragraph *must* be either 0 (for a LTR
> paragraph) or 1 (for a RTL paragraph).

I have read UAX #9 many times; where have you concluded that from?

> 4) Embedding levels are defined implicitly (e.g. a number in Arabic has an
> embedding level higher that the surrounding text) or by means of explicit
> bidi controls. In any case, they are *orthogonal* to markup tags. So, if you
> have a tagging scheme that imposes that tags are nested into each other
> (e.g. XML), embedding levels do not necessarily follow the rule. E.g., see
> how tagging and Unicode embedding overlap in: "<BOLD> abc &RLE; def </BOLD>
> ghi &PDF; ijk".

Oh, oh! That kind of thing is illegal in XML. Just take a look at "Bidi
embedding controls" section in UTR #20 at:

        http://www.unicode.org/unicode/reports/tr20/#Bidi

--roozbeh



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:15 EDT