From: Mark Davis ⌛ (mark@macchiato.com)
Date: Fri Aug 07 2009 - 11:40:39 CDT
The Unicode Standard itself has limited support for language-specific casing
rules, those in Special Casing. The decision was made to make any further
special casing rules available via Unicode Locales (CLDR). There is a bug
open for this (http://unicode.org/cldr/bugs/locale-bugs?findid=1493), but
its resolution would need an authoritative source to submit the rules in the
form specified by http://unicode.org/reports/tr35/#Transform_Rules .
Here is an example that encodes the rules you just gave:
The rules would need to apply to the characters in
http://unicode.org/cldr/utility/list-unicodeset.jsp?a=
\p{greek}%26\p{lowercase}-\p{block=phoneticextensions}
See also
- http://cldr.unicode.org/index/cldr-spec/transliteration-guidelines
- http://unicode.org/cldr/data/common/transforms/
Mark
On Fri, Aug 7, 2009 at 07:50, Andreas Prilop <andreasprilopwww@trashmail.net
> wrote:
> On Thu, 6 Aug 2009, karl williamson wrote:
>
> > I'm confused about context-dependent case changing for Greek.
>
> Case changing in Greek may indeed depend on context.
> Greek ypsilon/upsilon may get a diaeresis as capital letter:
>
> y --> Y
> 03C5 --> 03A5
>
> áy --> AŸ
> 03AC 03C5 --> 0391 03AB
>
> Does the Unicode standard actually deal with this?
>
>
This archive was generated by hypermail 2.1.5 : Fri Aug 07 2009 - 11:42:28 CDT