Re: Regular expressions in Unicode (Was: Ethiopic text)

From: Kolbjørn Aambø (k.h.aambo@ub.uio.no)
Date: Fri Mar 13 1998 - 03:15:38 EST


After reading this discussion through for the last few minutes I just
wander if there with UNICODE characters is any alternative to spesifying a
sequence of interesting characters and their relations like this
"aboriginal character spesification":

Aa:á:Àà:â:Ãã,Bb,Cc:Çç,Dd,Ee:Ééèêë,Ff,Gg,Hh,I:¡iíìîï,Jj,Kk,Ll,Mm,Nn:Ññ,Oo:óòô:Õõ:
‘¦,Pp,Qq,Rr,Ss,Tt,Uu:úùû,Vv,Ww,Xx,Yy:Üü,Zz,Ææ:Ää,Øø:Öö,Åå.

call it a local collating sequence if you wish...

THEN the Regualar expresision [A..Å] will at least mean all characters in
the above sequence as I see it. All characters that are NOT mentioned in
the aboriginal character spesification will then be deamed outside by the
regular expression...

only my 2 øre worth...

Kolbjørn



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:39 EDT