From: Markus Scherer (markus.icu@gmail.com)
Date: Fri Mar 17 2006 - 12:20:54 CST
Just a guess: U+20AC (Euro) got converted to windows-1252 0x80 (Euro),
mis-interpreted as U+0080 (C1 control), and re-encoded in UTF-8.
windows-1252 is a sort of superset of ISO 8859-1 in that it replaces
most C1 control codes with various useful graphic characters like the
Euro sign and curly quotes.
markus
On 3/17/06, Kornkreismuster@web.de <Kornkreismuster@web.de> wrote:
> In the Unicode-Charts a € (Euro-sign) has the hex20 AC.
> If I convert this into UTF-8 I get a hexE2 82 AC.
>
> Now the receiving System gets this, handles this as a UTF-8 encoding, but can't display the character.
> What the receiving System can display is a hexC2 80, which is the UTF-8 encoded form of hex80.
This archive was generated by hypermail 2.1.5 : Fri Mar 17 2006 - 12:26:19 CST