From: Jim Allan (jallan@smrtytrek.com)
Date: Mon Dec 29 2003 - 12:32:20 EST
D. Starner wrote:
> Intra-script, a difference in appearance has call for seperate codings.
> Inter-script, if the appearance is dissimilar enough to be a bar to
> reading, and there's a disjoint population of users (so that one is
> not a handwriting or cipher variant of another), there is reason to
> encode a seperate script.
I agree that dissimilar appearance enough to be a bar to reading is one 
reason to encode a script or set of scripts separately.
But is that *alone* a reason to encode separately if there no other 
reasons to encode separately and there are countervailing reasons to unify?
The Unicode encoding of Runic unifies separate scripts for separate 
languages using separate alphabets. The difference between staveless 
runes and the early Germanic futhark is as great or greater than any 
difference between Aramaic scripts and Phoenician.
All the ancient scripts are mostly handwriting variants of one another, 
if you also consider scratching into potsherds as a form of handwriting. 
There are also variants developed for carving into stone but of course 
carving in stone was not the norm.
The northwest Semitic abjad is *normally* used to cover the same small 
groups of languages which are closely related and share many words, 
especially as spelled without vowel pointing.
The argument here should be, I think, whether in this case the unity of 
the northwest Semitic abjad/alphabet and the common origin of the 
characters and the usefulness of searching different texts overrides the 
dissimilarity of appearance that developed.
> A Word document doesn't embed fonts (usually?), ...
But when used in publishing on the web usually the fonts are embedded if 
font style is important.  Similarly if e-mailing or other transferring 
to another party.
> As for Phoenician, perhaps a scholar may be happy with it as a font 
> variant
> of Hebrew, but I don't see why it's not equally a font variant of Greek.
A reasonable question.
Greek is a variant of Phoenician in origin.
Why would it not be useful to code it and Phoenician the same?
First there is the matter of directionality for all but the earliest 
inscriptions, left-to-right instead or right-to-left.  Then there is the 
addition of further characters in Greek and the change of sound value 
for many of the characters. The alphabet is no longer the same in its 
size or in its values. The phonetic structure of Greek was very 
different from the phonetic structure of the northwestern Semitic languages.
A Phoenician name will be spelled very differently by a Greek. Searches 
won't find it. The same letters probably won't be used. Similarly a 
Greek name would not be written the same by a Phoenician.
Difference of language means there isn't much use in doing 
cross-searches between material written in Phoenician and material 
written in Greek. The same is not true about cross-searching material 
written in any northwest Semitic language. The languages are very close. 
Names will usually appear identically. It is impossible to say, for 
example, whether the Gezer calendar is written in Phoenician language or 
the Hebrew language or some other closely related language.
The Hebrew Torah exists in slightly variant versions, but all use this 
same 22-letter northwest Semitic abjad. Other than some scribal and 
editorial variations and a few spelling variations that have nothing to 
do with the script used, the texts are as identical as most texts that 
one finds in disparate copies in the ancient world or medieval world.
Does it make sense that Unicode introduce *four* separate encodings to 
represent variants of essentially the same text written with the same 
characters just because the glyphs differ in appearance.
Unicode encodes characters, not glyphs, and here the characters are the 
same except in their glyphic appearance.
> No
> non-scholarly user (and Phoenician may well have a few) will 
> understand why
> Phoenician is considered Hebrew, because they don't look alike.
Phoenician should *not* be considered Hebrew. They are different 
languages, though very closely related.
Phoenician scripts (and there are more than one) should *not* be 
considered the same as the modern square Hebrew script or modern cursive 
Hebrew script or the Rashi Hebrew script or the modern Samaritan or 
medieval Samaritan or papyrus Aramaic or Palmyrene Aramaic any more than 
Blackletter Latin script is the same as Uncial Latin script or Italic 
Latin script or any more than Chinese characters are considered the same 
as Japanese characters or Korean characters.
Because one often wishes, rightly, to distinguish such differences, does 
not mean encoding those differences within Unicode is the most useful 
course to take.
If a non-scholarly user wants to display something in a particular 
Phoenician style (and there is more than one of these) then that user 
must select the Phoenician font the user wishes (if the user has more 
than one). That is true whether Phoenician scripts and Hebrew scripts 
are unified in Unicode or not unified in Unicode.
A non-scholarly user of runes may also not understand why Swedish 
staveless runes are considered the same as the Germanic futhark runes 
when they don't look alike. Some modern Japanese still don't understand 
how Japanese kanzi can be unified with Chinese characters when they 
don't look alike.
But if northwestern Semitic unification is wholly or partly accepted by 
the Unicode consortium, a simple chart showing variations in form would 
make it clear to a non-scholarly user how the Phoenician script did 
evolve though intermediate steps to the modern square Aramaic letters 
that most consider standard modern Hebrew letters.
Jim Allan
This archive was generated by hypermail 2.1.5 : Tue Dec 30 2003 - 14:00:20 EST