Re: NFD -> NFC

From: Mark Davis ☕ <mark_at_macchiato.com>
Date: Tue, 11 Mar 2014 17:19:06 +0100

Not sure about your exact case, but ICU's normalization does handle those
characters.

http://unicode.org/cldr/utility/transform.jsp?a=nfc%3Bhex&b=%5Cu30B9%5Cu3099

(That tool uses ICU for NFC).

Mark <https://google.com/+MarkDavis>

*— Il meglio è l’inimico del bene —*

On Tue, Mar 11, 2014 at 4:50 PM, Markus Doppelbauer <doppelbauer_at_gmx.net>wrote:

> Hello,
>
> I have an other problem making the normalization process binary
> compatible with ICU.
> Why does "30B9 3099" not combine to "30BA"?
>
> Steps to reproduce:
> wget http://doppelbauer.name/katakana.txt
> uconv -f utf8 -t utf8 -x nfd <katakana.txt >ndf.txt
> uconv -f utf8 -t utf8 -x nfc <ndf.txt >nfc.txt
> diff katakana.txt nfc.txt
>
> Expected result: "katakana.txt" == "nfc.txt"
>
> uconv v2.1 ICU 4.8.1.1
>
> Thanks a lot
> Markus
>
>
>
> _______________________________________________
> Unicode mailing list
> Unicode_at_unicode.org
> http://unicode.org/mailman/listinfo/unicode
>
>

_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode
Received on Tue Mar 11 2014 - 11:20:29 CDT

This archive was generated by hypermail 2.2.0 : Tue Mar 11 2014 - 11:20:29 CDT