From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Mon Oct 22 2007 - 15:32:13 CDT
> De : Hans Aberg [mailto:haberg@math.su.se]
> Envoyé : lundi 22 octobre 2007 22:24
> À : verdy_p@wanadoo.fr
> Cc : Unicode List
> Objet : Re: FYI: Regex paper for UTC
>
> On 22 Oct 2007, at 22:16, Philippe Verdy wrote:
>
> > Note that L may contain strings containing strings like a base
> > letter followed by a diacritic, which is canonically equivalent to
> > its precomposed form. Would only the precomposed form would be
> > allowed in [L] ? The definition of "length" is not precise enough.
> > Forme the composed nas precomposed letters should behave
> > identically, ans so their "length" should be 1 in both case. If so,
> > then [L] will contain BOTH the precomposed letter and the sequence
> > of a letter and a diacritic.
>
> Read all the stuff. There are different constructions.
No, I've read the "stuff". You simply forget a formal definition of
"length".
> The main point is that the operations you seek are restrictions of
> the language set operations.
No. I read your text the way it is : ambiguous.
This archive was generated by hypermail 2.1.5 : Mon Oct 22 2007 - 15:33:54 CDT