Re: Greek Diacritics Again

From: Kenneth Whistler (kenw@sybase.com)
Date: Tue Nov 28 2000 - 15:14:21 EST


Antoine Leca re-inquired:

> Kenneth Whistler wrote:
> >
> > There is a note "Tonos" appended to the character name for U+030D in
> > ISO/IEC 10646-1:2000, but *not* in the Unicode Standard.
>
> Sorry Ken, but I cannot understand you here.
> As I said, I was referring myself to NamesList-3.0.0.txt
> <URL:http://www.unicode.org/Public/3.0-Update/NamesList-3.0.0.txt>,
> because Dr. Freytag explained us a while ago that this is better source
> about the meaning of a character than the database is.
> As far as I know, this file has not been updated since Aug 25, 1999.
> This file says:
> 030D COMBINING VERTICAL LINE ABOVE (Tonos)
> * Marshallese
> x (modifier letter vertical line - 02C8)

This is part of the danger of relying on online versions and not
referring to the printed version of the standards. ;-) You need to
understand the function of the online namelist -- it is used to drive
a formatting program that produces *both* the Unicode Standard's name
list and the ISO/IEC 10646-1:2000 name list. For generating the
Unicode Standard *printed* version of the name list, the parenthetical
annotations after the names are omitted. For generating the 10646
name list, all lines except the code/name/annotation line are omitted.

>
> I also checked the database. The most recent version I can see is 3.0.1
> <URL:http://www.unicode.org/Public/3.0-Update1/UnicodeData-3.0.1.txt>.
> Which says:
> 030D;COMBINING VERTICAL LINE ABOVE;Mn;230;NSM;;;;;N;NON-SPACING VERTICAL
> LINE ABOVE;Tonos;;;
>
> I do not know the exact technical term that is to be used in this context,
> but the /mention/ "Tonos" added after the English name of the character
> definitively looks like a /note/ to me.

It *is* a note, but it occurs in the ISO Comment field in the database.
That field is there precisely to track these notes for 10646, which
are required in order to print 10646 accurately. They are not for
any other purpose, and certainly not for the general annotations that
the Unicode editors add to the Unicode Standard's name list.

> I know Unicode and ISO/IEC 10646 have to be aligned, so I understand
> perfectly this can be the very reason why this /mention/ is ever present.
> I also understand such a note could not be removed, how irrelevant it is,
> because of the backward compatibility problems.

Actually, it could easily be removed, but only with the concurrence
of WG2 and the ISO/IEC 10646 editor.

> My only point is that, when someone seeks after "tonos" in NamesList,
> there are two good things that are present, and another which is not:
> - good thing, as I said, under U+0301, it is said this is the Greek tonos;
> - good thing, under U+0344, it is said this character is deprecated, and
> it is canonically decomposed as U+0308 U+0301;
> - another missing point is that, under U+030D, no mention exists to explain
> that despite the name and /mention/, this is *not* the Greek tonos

This clarification can easily be added to the Unicode name list.
Thank you for the suggestion.

--Ken



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:15 EDT