Re: (SC2WG2.609) New contribution N2705

From: Peter Kirk (peterkirk@qaya.org)
Date: Tue Feb 17 2004 - 18:08:52 EST

  • Next message: Peter Kirk: "Re: (SC2WG2.609) New contribution N2705"

    On 17/02/2004 13:32, Peter Constable wrote:

    >>From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org]
    >>
    >>
    >On Behalf
    >
    >
    >>Of Michael Everson
    >>
    >>
    >
    >
    >
    >
    >>>*hxC(V)- ~ *shxC(V)- [the x's to be subscripted]
    >>>
    >>>are more like mathematical formulae than text.
    >>>
    >>>
    >>They are not mathematical formulae. It is a kind of linguistic
    >>(though not phonetic) notation.
    >>
    >>
    >
    >Yes. In case anyone isn't sure what the notation means, I believe it is
    >as follows (assuming these works are typical of works in historical
    >linguistics):
    >
    >* precedes a transcription to indicate it is a historical reconstruction
    >posited by inference from data obtained from later periods in time
    >
    >~ is used to indicate an alternation; thus, *hxC(V) is in alternation
    >with *shxC(V)
    >
    >s and h are symbols for particular phonemes; as indicated in the doc, hx
    >is being used to represent the laryngeal with uncertain vowel coloring
    >
    >C and V, of course, represent an arbitrary consonant and vowel
    >
    >( ) denotes optionality; thus, the above notation is short hand for *hxC
    >~ *shxC and *hxCV ~ *shxCV
    >
    >
    >So, the expression *hxC(V)- ~ *shxC(V) is saying, in relation to certain
    >phoneme sequences known to exist in later varieties, that an earlier
    >precedessor to the language(s) in question is believed to have had hC or
    >shC, and hCV or shCV (with the vowel colouring on the h unknown or left
    >unspecified).
    >
    >
    >
    Thank you, Peter. I learned about such things during my training with SIL.

    My point is that this is a linguistic notation for a kind of formula,
    where C replaces any actual consonant, ~ acts like a mathematical
    equivalence operator, and x is a subscript identifying a particular
    variant of h. This is not some kind of unusual orthography but a
    specialist scientific notation. It is the same notation as h1, h2, h3 or
    ha, hb, hc etc (the second character subscripted in each case) used in
    all kinds of notational conventions but primarily mathematical and
    scientific ones. Some lingustics textbooks are full of this kind of
    notation. For an example chosen almost at random, I found the following
    in an old paper by Kenneth Pike (in Ruth M. Brend ed. "Advances in
    Tagmemics", North-Holland 1974, p.238):

    (2) eMk = eaTCaf, eaTCgf, egTCaf, egTCgf

    where all the lower case letters are subscripted, and examples of this
    in which the word "catch" is followed by subscript af or gf.

    My point here is that if we once start on encoding subscript letters
    used in specialist scientific notation, there is no easy place to stop.
    Either we need to accept the principle that subscripts are encodable and
    set aside space for a whole alphabet of them (and an upper case alphabet
    and a Greek alphabet as well, plus punctuation); or else we need to say
    from the start that these things are not plain text and should not be
    encoded in Unicode.

    -- 
    Peter Kirk
    peter@qaya.org (personal)
    peterkirk@qaya.org (work)
    http://www.qaya.org/
    


    This archive was generated by hypermail 2.1.5 : Tue Feb 17 2004 - 18:43:42 EST