Re: Ternary search trees for Unicode dictionaries

From: John Cowan (cowan@mercury.ccil.org)
Date: Sun Nov 23 2003 - 01:04:25 EST

  • Next message: Doug Ewell: "Korean compression (was: Re: Ternary search trees for Unicode dictionaries)"

    Jungshik Shin scripsit:

    > Now that you told me you used NFC, isn't this condition similar to
    > Chinese text? How does BOCU and SCSU work for Chinese text? Japanese text
    > might do slightly better with Kana, but isn't likely to be much better.

    The SCSU paper claims that Japanese does *much* better in SCSU than
    UTF-16, thanks to the kana.

    -- 
    Andrew Watt on Microsoft:                       John Cowan
    "Never in the field of human computing          jcowan@reutershealth.com
    has so much been paid by so many                http://www.ccil.org/~cowan
    to so few!" (pace Winston Churchill)            http://www.reutershealth.com
    


    This archive was generated by hypermail 2.1.5 : Sun Nov 23 2003 - 01:50:56 EST