From: Doug Ewell (dewell@adelphia.net)
Date: Tue Sep 19 2006 - 21:14:34 CDT
Hans Aberg <haberg at math dot su dot se> wrote:
> It is probably more efficient to translate the stream into code points
> and then use a compression technique on that, because then the full
> character structure is taken into account. Then it does not matter
> which character encoding is used.
If you have not yet read Unicode Technical Note #14, particularly the
sections on "general-purpose compression" and "two-layer compression,"
you might wish to do so.
-- Doug Ewell Fullerton, California, USA http://users.adelphia.net/~dewell/ RFC 4645 * UTN #14
This archive was generated by hypermail 2.1.5 : Tue Sep 19 2006 - 21:20:04 CDT