Re: Writing a proposal for an unusual script: SignWriting

From: Stephen Slevinski (slevinski@gmail.com)
Date: Mon Jun 14 2010 - 11:21:38 CDT

Next message: Leonardo Boiko: "Re: Most commonly used characters not in BMP"

Previous message: John H. Jenkins: "Re: Most commonly used characters not in BMP"
In reply to: Asmus Freytag: "Re: Writing a proposal for an unusual script: SignWriting"
Next in thread: Asmus Freytag: "Re: Writing a proposal for an unusual script: SignWriting"
Reply: Asmus Freytag: "Re: Writing a proposal for an unusual script: SignWriting"
Reply: Andr� Szabolcs Szelp: "Re: Writing a proposal for an unusual script: SignWriting"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Greetings Asmus Freytag,

Plain text SignWriting should be able to write actual sign language,
such as "hello world."

This is a combination of 2 signs and 1 punctuation. The first sign has
2 symbols. The second sign has 4 symbols. Punctuation is always used
by itself.

You can see this online being processed in JavaScript:
http://signbank.org/swis/bswref.html?bsw=0fb14c38e3924a54bd27138c3984bb4cf0fb18738c39c4c04c618738c3934b94b020538c3924d14b32ef38c3924c949c38838c392#v

I dislike the idea of requiring a higher level protocol in order to
encode plain text SignWriting. I have used CSS to change the color and
size of SignWriting. I chose not to include color or size in the plain
text representation of SignWriting because color and size do belong in
the higher level protocol.

> From the way you describe the requirements (faithfully representing
> the minutest details of the authors choice of placements, etc.) and
> your claim that the plain text level should not / does not encode
> semantic contents, I get the impression that you have not fully
> thought through what information should be represented at what level
> of the text architecture.
I prefer the term phonemic rather than semantic. Most symbols are
phonemic. A minute few of the symbols are featural. It requires
several featural symbols to build a phonemic representation, such as a
new handshapes that a particular author feels is absolutely needed.

While not a linguist, I would say that the semantic meaning of
SignWriting is contained in the spatial layout of the symbols. We do no
encode the semantic meaning directly, but the semantics can be perceived
when a sign is considered as a whole. Some of the semantics are not
even included in the writing, but left for the reader to infer. An
example would be a starting handshape that is not written but can be
assumed.

>
> Concretely: do you see the need for, existence of a SignWritingML?
I see no advantage to requiring XML for plain text. Years ago, I was
using a comma delimited format for the data. I took the advice of
others and moved to an XML format: specifically SWML-S. It was a
misstep. Same data, but more complex processing. Now, with the
character encoding, I can represent plain text SignWriting as character
data. Easy to parse, search, sort, and display. There is no advantage
to using XML; however, I have created an equivalent XML (called BSWML)
with roundtrip mapping between character data and XML. I personally
will not be using XML for plain text, but I thought others might
appreciate the possibility.

> Do you think, existing HTML could correctly render SignWriting if that
> was presented as part of the plain text data (under your proposal)?
I have both client side and server side processing of SignWriting. The
server side uses PHP and passes completed sign or column images to the
client. The client side processing uses JavaScript, CSS, and HTML to
correctly represent the SignWriting. The HTML specifically uses DIV
tags with relative positioning. Each symbol is positioned with it's own
DIV.

For the future, I am considering a browser plugin that will detect and
render SignWriting character data. A regular expression could scrape
the appropriate PUA characters. Another regular expression could
validate that the characters represent valid structures. Then the
SignWriting display could be built using individual symbols, completed
signs, or entire columns.

> What happens when a user agent selects a different font, because the
> one the author used is unavailable on the system used by the reader?
For SignWriting the font designer has strict rules of size and shape.
The font designer can modify the symbol glyphs only under these
restrictions. With the number of symbols involved, no font designer
would break these rules and waste their time designing a font that would
not work with the writing system.

The major choice will be between a raster font and an SVG font. The
raster font is completed. The SVG font is a work in progress.
SignWriting has the unusual requirement of a 2 color font. One font
color for the line of the symbols and another for the fill. The fill is
needed when symbols overlap.

Here's a few simplified examples.

First the sign for dinner in American Sign Language.
http://signbank.org/swis/glyphogram.php?bsw=0fb2ff38c3924f54f510138c392500500

Next is the sign for German in German Sign Langauge:
http://signbank.org/swis/glyphogram.php?bsw=0fb2ff38c3924f550910038d3925014f5

Each sign has two symbols: one hand and one head. The fill color of
each hand symbol covers part of the head line. The symbols represent
the phonemic information, while the semantics are perceived by the
symbol position.

In SignWriting we write the symbols in space, so all we encode is the
positions of the symbols. In HamNoSys, you would encode dinner as round
index hand on chin, and German as closed index hand on forehead.

The difference effects how the writers think while they write. For sign
language, everything is visual. The language center of their brain is
wired to their eyes. They look at the symbols and see phonemic
information. They construct a sign by placing symbols on a canvas.
They perceive the semantic information not only between any two symbols,
but the semantic information contained by all of the symbols taken
together. They look for shape and pattern. This greatly affects how
someone reads.

We've had large discussions about spelling on the SignWriting list.
Someone will write a sign and others will comment. People will try to
rewrite the sign to be easier to read. It's amazing what a difference a
small adjustment in symbol placement can make. A sign that was
difficult to read automatically becomes easy and clear. The improvement
in the sign was only possible because of an excellent writer who
understood the sign as a whole and was able to build a cohesive
representation. SignWriting is part artistry. Is simple to start, but
the best writers have an eye for symbol choice and symbol position.

Maybe that's why I'm such a proponent for exact symbol placement: it's
the only way to achieve the best writing. I'm sure a system could be
devised that produced average or adequate writing, but it will never be
able to produce the quality of writing that an experienced writer can
produce.

My focus is to make the best encoding for the reader and the writer.
Let the designers and programmers handle the extra complexity.

> In some of your answers you've given a few hints, but for someone like
> me who has no firsthand experience of signing and difficulties
> visualizing sign writing, you probable will want to be way more
> explicit and concrete in your description and examples, so that it
> becomes possible to evaluate whether your choices in the encoding
> model are the correct ones, or possibly the only ones, or whether, on
> the contrary, the represent an unnecessary departure from the way
> Unicode deals with non-linear notations.
>
I appreciate your consideration and thought. I will put more detail in
the official proposal later this year. The current documents are to
define the open standards for data sharing between projects.

Perhaps you'd like to read a short piece by Adam Frost titled "Why
SignWriting?" in English and ASL.
http://www.signwriting.org/archive/docs6/sw0526-US-Why-SignWriting-Frost.pdf

Regards,
-Steve

Next message: Leonardo Boiko: "Re: Most commonly used characters not in BMP"
Previous message: John H. Jenkins: "Re: Most commonly used characters not in BMP"
In reply to: Asmus Freytag: "Re: Writing a proposal for an unusual script: SignWriting"
Next in thread: Asmus Freytag: "Re: Writing a proposal for an unusual script: SignWriting"
Reply: Asmus Freytag: "Re: Writing a proposal for an unusual script: SignWriting"
Reply: Andr� Szabolcs Szelp: "Re: Writing a proposal for an unusual script: SignWriting"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Mon Jun 14 2010 - 11:23:14 CDT