RE: transliterations (was Compelling Unicode demo)

From: Lars Kristan (lars.kristan@hermes.si)
Date: Mon Nov 19 2001 - 16:35:35 EST


Mark Davis wrote:

>
> In the second row of the Output 1 column, write "nfd;
> [:nonspacing mark:]
> remove; nfc" as the name of the transliterator (without the
> quotes). That
> produces:
>

OK, a clear case of rtfm on my side. And I failed to look at it the first
time Mark provided me the link. My point would be that this transliteration
would deserve to be in the list along with (or within) the "Accents"
category.

I suppose I still have a lot to read on this subject... For now, I will just
ask one more question, more in line with what I had in mind when I first
mentioned the transliteration. Does any of the (major) web search engines
provide a search that would be "accent-insensitive"?

Hmmm, it is surprising how much one learns simply by typing a question. I
thought I just invented the term "accent-insensitive", but a search for it
revealed it is all over the web already. Good! Hope to see more of it.

But then soon we might be complaining about it - I just stumbled upon an
article that was dealing with case sensitivity in web search engines,
complaining that the option to specify queries as case sensitive is more or
less gone from all the major search engines...

Lars

-----------------------------------------------------
Lars Kristan (lars@hermes.si)
Storage & Data Management Lab
HERMES SoftLab, Litijska 51, 1000 Ljubljana, Slovenia
phone: +386 (1) 586-5-306 www.hermes-softlab.com
-----------------------------------------------------



This archive was generated by hypermail 2.1.2 : Mon Nov 19 2001 - 17:32:25 EST