On the subject of Ethiopic text, I hope that you know about this work:
@String{j-TUGboat = "TUGboat"}
@Article{Andulem:TB10-3-352,
author = "Abass Andulem",
title = "{{The road to Ethiopic {\TeX}}}",
journal = j-TUGboat,
year = "1989",
volume = "10",
number = "3",
pages = "352--354",
month = Nov,
}
At ftp://ctan.tug.org/language/ethiopia/ there is a collection of TeX
styles, TeX extensions, and TeX fonts that reflect this work by Abass
and others since.
On the subject of regular-expression support for Unicode, the POSIX
definition of regexps includes recognition of character classes. I
believe that the regexp package in GNU gawk, available at
ftp://prep.ai.mit.edu/pub/gnu/gawk-3.0.3.tar.gz ,
has the POSIX definition implemented. While it is still based on
8-bit characters, it might prove a suitable starting point for Unicode
support.
----------------------------------------------------------------------------
- Nelson H. F. Beebe Tel: +1 801 581 5254 -
- Center for Scientific Computing FAX: +1 801 581 4148 -
- University of Utah Internet e-mail: beebe@math.utah.edu -
- Department of Mathematics, 105 JWB beebe@acm.org -
- 155 S 1400 E RM 233 beebe@ieee.org -
- Salt Lake City, UT 84112-0090, USA URL: http://www.math.utah.edu/~beebe -
----------------------------------------------------------------------------
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:39 EDT