Re: utf-8 != latin-1

From: Doug Ewell (dewell@compuserve.com)
Date: Sat Oct 14 2000 - 01:13:58 EDT

Next message: Doug Ewell: "Re: "Giga Character Set": Nothing but noise"
Previous message: Steven R. Loomis: "utf-8 != latin-1"
Maybe in reply to: Steven R. Loomis: "utf-8 != latin-1"
Next in thread: George Zeigler: "Re: utf-8 != latin-1"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

"Steven R. Loomis" <srl@jtcsv.com> wrote:

> What happened was that the sequence AD 63 61 73 was
> interpreted as U+E54E U+DC73..

Why? As an illegal UTF-8 sequence, it shouldn't be interpreted as
anything.

John Cowan's "utf" perl script (which carries the appropriate
disclaimers about no error checking) converts that sequence to U+D94E
U+DC73, which seems a bit more reasonable -- at least it's a complete
surrogate pair.

-Doug Ewell
Fullerton, California

Next message: Doug Ewell: "Re: "Giga Character Set": Nothing but noise"
Previous message: Steven R. Loomis: "utf-8 != latin-1"
Maybe in reply to: Steven R. Loomis: "utf-8 != latin-1"
Next in thread: George Zeigler: "Re: utf-8 != latin-1"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:14 EDT