From: Peter Kirk (peterkirk@qaya.org)
Date: Wed Jan 14 2004 - 11:12:13 EST
On 14/01/2004 07:16, John Burger wrote:
> ...
> By the way, I still don't quite understand what's special about Thai.
> Could someone elaborate?
>
I mentioned Thai because it is the only language I know of which does
not used SPACE, U+0020. It also has at least some of its own
punctuation. So a Thai text need not include any characters U+00xx -
which rules out one suggested heuristic method.
-- Peter Kirk peter@qaya.org (personal) peterkirk@qaya.org (work) http://www.qaya.org/
This archive was generated by hypermail 2.1.5 : Wed Jan 14 2004 - 11:43:37 EST