RE: Normalization and the sample code

From: Addison Phillips [wM] (aphillips@webmethods.com)
Date: Thu Jun 14 2001 - 00:36:50 EDT


"Never Mind"

I found that recompiling the data tables for ICU4J fixed the problem. It wasn't very helpful that I had a slightly older ICU4J. The one they released tonight is fine.

This does mean that the sample Java applet on the Unicode website has this problem and ought to be corrected. The tables and charts, of course, are correct, but it took a lot of spelunking to figure out that I wasn't crazy and #2 below was what it was.

Best Regards,

Addison

Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3659 (mobile)
-------------------------------------------------
"Our opportunity is to integrate up to 75% of the world's economy. Is anyone excited yet?"
--- Phillip Merrick, CEO webMethods.

> -----Original Message-----
> From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org]On
> Behalf Of Addison Phillips [wM]
> Sent: Wednesday, June 13, 2001 3:15 PM
> To: unicode@unicode.org
> Cc: mark.davis@us.ibm.com
> Subject: Normalization and the sample code
>
>
> All,
>
> I have been playing with the sample code for normalization in
> UAX15 and the ICU4J classes that are, shall we say, "closely
> related" to the sample code.
>
> If I ask for NKFC of the string U+0060 or U+005F (or of
> U+FF40 and U+FF3F, which are the wide equivalents and the
> initial source of my woes), I get the sequence U+0020 U+0300
> (or U+0020 U+0332). The wording of the UAX implies that this
> is the "correct" behavior, as long as you don't consider the
> non-spacing marks to be a "combination" of space and the
> non-spacing version of the character.
>
> The conformance test file says that FF40 and FF3F should
> become 0060 and 005F, but nothing about 0060 and 005F
> ultimately. Neither does it handle 0020 + 03xx in any way.
>
> So, what's right?
>
> 1. I should get the sequence I get; or
> 2. There is a bug in the code; or
> 3. There is an omission in the tables.
>
> Best Regards,
>
> Addison
>
> Addison P. Phillips
> Globalization Architect / Manager, Globalization Engineering
> webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
> +1 408.962.5487 (phone) +1 408.210.3659 (mobile)
> -------------------------------------------------
> Internationalization is an architecture. It is not a feature.
>
>
>



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:18 EDT