Re: UCA and Russian letter Ё

From: Leo Broukhis <leob_at_mailcom.com>
Date: Sun, 30 Dec 2012 17:52:55 -0800

On Wed, Dec 26, 2012 at 11:18 AM, Whistler, Ken <ken.whistler_at_sap.com> wrote:
> Leo asked:
>
>> My question was narrower: assuming that the strings being compared are
>> words, could it be supported without any markup?
>
> ... where "it" refers to conditional weighting based on the (identified) word boundary. And the answer to that is no, unless the word boundary was explicitly indicated with some kind of a markup character, and then the sequence of that markup character plus the target character of interest (in this case Russian Yo) was given a tailored contraction in the weight table which weighted it differently from any Russian Yo not in that particular contraction sequence.

I see your point: if something can be trivially emulated with a markup
character, there is no need to augment the algorithm (emulating
backward accents with markup is possible but much more cumbersome).

What characters should be used for such markup, if need be?

Thanks,
Leo
Received on Sun Dec 30 2012 - 19:55:36 CST

This archive was generated by hypermail 2.2.0 : Sun Dec 30 2012 - 19:55:37 CST