From: Michael Everson (everson@evertype.com)
Date: Mon Nov 08 2004 - 06:47:20 CST
At 12:04 +0000 2004-11-08, Peter Kirk wrote:
>No, my desire is that informative, explanatory text should not give
>misinformation and obfuscation. Note that I was not actually arguing
>for character encodings to follow this informative text, but for the
>text to be changed to match the reality of character encodings and
>the apparent decision of the UTC
No different decision has been made by the UTC or WG2. I have tried
to put into words
>to encode scripts on the basis of significant nodes rather than
>semantic distinctions.
SCRIPTS ARE NOT DISTINGUISHED ONE FROM ANOTHER ON BASIC OF "SEMANTICS".
This red herring which you have said over and over again is really
irritating. Cyrillic is not different from Greek or Latin because of
"semantic distinctions". There are no semantic distinctions which
separate ANY script from ANY OTHER. Every Brahmic KA, KHA, or GHA is
semantically identical to every other Brahmic KA, KHA, or GHA.
>And I was pointing out that many people, including an important
>group of Semitists, who (at least from your perspective) have
>misunderstood the situation are only following what they have read
>in informative, explanatory text in the Standard.
Peter, we have tried to explain this to your particular group of
Semiticists, important or not, and you none of you have actually
heard what we have said, because you don't WANT to, I suspect.
Indeed, you have harped on this "semantic distinction" point over and
over, as though you could force us not to encode Phoenician because
of this legalistic little point.
Other Semiticists have understood, and approved what we are doing. At
this point, I can't say that I care a fig any more whether you or Ms
Keown or Mr Snyder are happy with Phoenician. It is right to encode
it. It is wrong to consider it a font variant of Hebrew.
>>ARABIC LETTER SHEEN is a different letter, and a different
>>character from SYRIAC LETTER SHIN. DEVANAGARI LETTER KA is a
>>different letter, and a different character, from ORIYA LETTER KA.
>>PHOENICIAN LETTER NUN is a different letter, and a different
>>character, from HEBREW LETTER NUN.
>
>The first two, yes by definition, because they are in the Standard.
>The last one, only provisionally because it is subject to an ISO
>ballot.
No, in the real world, these things are different. We encode them
because they ARE different. They are not different because we encoded
them.
>Is this the position of the UTC? Or does the UTC hold that your
>"significant node" scripts are semantically distinct, although you
>disagree? Or does the UTC not in fact accept your principle that
>"significant node" scripts should be encoded, despite their decision
>on Phoenician? Perhaps this should be clarified first.
I, an expert in the world's writing systems, who has worked for a
decade encoding scripts and managing the roadmap and all, have tried
to explain that we have always been encoding according to the science
of the study of the world's writing systems. Whether the text of the
standard supplies enough text to make you comfortable or not is not a
priority for me. Perhaps Ken Whistler and I, in our abundant spare
time, might try to wordsmith the standard with regard to this issue.
But your insistence that some legalistic interpretation of that text
will determine what is and what is not a script is tiresome.
>Well, I thought we were agreed on at least one thing, that the
>distinction between Phoenician and Hebrew should not be described as
>"semantic distinctiveness". And since, according to an informative
>part of the Standard, p.15, "semantic value" is the only criterion
>for a distinct character, it is hardly surprising that people are
>confused.
A distinct character is not the same thing as a distinct script, and
the Standard is not the Law with all the world's answers in it.
-- Michael Everson * * Everson Typography * * http://www.evertype.com
This archive was generated by hypermail 2.1.5 : Mon Nov 08 2004 - 06:55:09 CST