Re: Thai word list

From: Markus Scherer (markus.scherer@jtcsv.com)
Date: Thu Apr 18 2002 - 12:42:45 EDT


Doug Ewell wrote:

> The ICU package includes a sorted Thai word list in a UTF-8 file called
> th18057.txt. Since you may not wish to download the whole package and I
> don't know if the Thai file is available separately, I have uploaded it
> (for a limited time only) to:

Note that ICU has CVS and WebCVS, so you can get any of our files separately.
For this one: http://oss.software.ibm.com/cvs/icu/~checkout~/icu/source/test/testdata/th18057.txt

(ICU uses the X license. See http://oss.software.ibm.com/icu/)

We use this word list for word break iteration, for which we have APIs.

markus

PS: For details about CVS for ICU see http://oss.software.ibm.com/icu/develop/cvs.html



This archive was generated by hypermail 2.1.2 : Thu Apr 18 2002 - 13:25:09 EDT