Re: A UTF-8 based News Service

From: David Starner (dstarner98@aasaa.ofe.org)
Date: Fri Jul 13 2001 - 14:15:29 EDT


From: Keld Jørn Simonsen <keld@dkuug.dk>
> UTF-16 is not just 2 bytes, it is sometimes 2 and sometimes 4 bytes.
> IETF is recommending UTF-8 as the prime charset in all Internet protocols.

Blah. For his purposes, UTF-16 is 2 bytes. The odds his newspaper will have
significant quantities of non-BMP in the near future seems very unlikely.
Yes, UTF-8 is the prime charset in all Internet protocols; that doesn't mean
they can't use others, it just means that the default should be UTF-8, and
in some situations, like the dict protocol, where you don't want to mess
with charset negotiation, you can hardwire it to UTF-8. In web and email,
you have the choice of charsets, and if a non-UTF-8 charset will make your
life and the life of your readers easier, go for it.

--
David Starner - dstarner98@aasaa.ofe.org



This archive was generated by hypermail 2.1.2 : Fri Jul 13 2001 - 16:03:30 EDT