I have code a HankakuToZenkaku (Unicode to Unicode) text transformation in
mozilla source code from Ken Lunde's non Unicode base algorithm in Understanding
CJKV Information Porcessing. I am not sure it is bug free. It should be easy to
conert to Java....
see
http://lxr.mozilla.org/seamonkey/source/intl/unicharutil/src/nsHankakuToZenkaku.cpp
"Peck, Jon" wrote:
> We need to convert Japanese Windows (cp932) encoded Java resource bundles
> into UTF-8. The Java nativetoascii converter seems not to allow us to
> preserve the half-width katakana characters, mapping them to their
> full-width forms instead of using the characters in the surrogate area.
> Since the half-width form is what our folks want, we need to do this for the
> user interface materials of a Java app.
>
> Surprisingly, I haven't been able quickly to locate a (preferably batch)
> converter that will do this, but surely there must be many. Can anyone
> point me to a tool with this capability? I'd certainly prefer not to write
> one.
>
> Thanks in advance.
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:56 EDT