Re: A UTF-8 based News Service

From: David Starner (dstarner98@aasaa.ofe.org)
Date: Thu Jul 12 2001 - 21:14:25 EDT


> As someone involved in the service I often wish there was some
> form of "compressed" Unicode encoding. The 3-byte penalty that
> Ethiopic bears under UTF-8 turns into higher bandwidth that web
> hosting services meter and charge for by the megabyte. For a
> popular site this soon makes UTF-8 a costly option to support.
>
> A system analagous to iso-8859-x whereby Ethiopic and other scripts
> in the 3 byte range could be shifted back into the 2 byte range
> might help (generally only English and Ethiopic is desired together).
>
> Fortunately there is mod_gzip for Apache. I would appreciate any
> information about other options.

What about UTF-16? Encode all characters as 2 bytes, and your problem is
solved, and UTF-16 should be supported by all recent Unicode-supporting web
browsers.

--
David Starner - dstarner98@aasaa.ofe.org



This archive was generated by hypermail 2.1.2 : Thu Jul 12 2001 - 23:13:48 EDT