L2/00-412
DATE: 2000-11-08
DOC TYPE: |
Expert contribution |
TITLE: |
Special mathematical operators used in plain-text mathematical expressions |
SOURCE: |
Murray Sargent III |
PROJECT: |
|
STATUS: |
Proposal |
ACTION ID: |
FYI |
DUE DATE: |
-- |
DISTRIBUTION: |
Worldwide |
MEDIUM: |
Paper and html |
NO. OF PAGES: |
4 |
A. Administrative
|
|
1. Title |
Special mathematical operators used in computational algebra |
2. Requester's name |
Murray Sargent III |
3. Requester type |
Expert request. |
4. Submission date |
07-11-2000 |
5. Requester’s reference |
|
6a. Completion |
Complete proposal |
6b. More information to be provided? |
If requested |
B. Technical -- General
|
|
1a. New script? Name? |
No. |
1b. Addition of characters to existing block? Name? |
Miscellaneous technical |
2. Number of characters |
5 |
3. Proposed category |
|
4. Proposed level of implementation and rationale |
Level 1 |
5a. Character names included in proposal? |
Yes. |
5b. Character names in accordance with guidelines? |
Yes. |
5c. Character shapes reviewable? |
|
6a. Who will provide computerized font? |
Microsoft Symbol font |
6b. Font currently available? |
Microsoft Symbol font |
6c. Font format? |
TrueType |
7a. Are references (to other character sets, dictionaries, descriptive texts, etc.) provided? |
Yes. |
7b. Are published examples (such as samples from newspapers, magazines, or other sources) of use of proposed characters attached? |
No |
8. Does the proposal address other aspects of character data processing? |
No |
C. Technical -- Justification
|
|
1. Contact with the user community? |
Yes. TeX and PS Technical Word Processor |
2. Information on the user community? |
Mathematical software |
3a. The context of use for the proposed characters? |
Used in plain-text encoding of mathematics |
3b. Reference |
See references at end of this proposal |
4a. Proposed characters in current use? |
Yes. |
4b. Where? |
PS Technical Word Processor. |
5a. Characters should be encoded entirely in BMP? |
Yes. |
5b. Rationale |
Small number of characters |
6. Should characters be kept in a continuous range? |
Preferably (easier to parse) |
7a. Can the characters be considered a presentation form of an existing character or character sequence? |
No, although existing characters like ^ and _ are overloaded to have the same semantics |
7b. Where? |
TeX |
7c. Reference |
|
8a. Can any of the characters be considered to be similar (in appearance or function) to an existing character? |
Yes |
8b. Where? |
|
8c. Reference |
|
9a. Combining characters or use of composite sequences included? |
No |
9b. List of composite sequences and their corresponding glyph images provided? |
na |
10. Characters with any special properties such as control function, etc. included? |
No |
D. SC2/WG2 Administrative
To be completed by SC2/WG2 |
|
1. Relevant SC 2/WG 2 document numbers: |
|
2. Status (list of meeting number and corresponding action or disposition) |
|
3. Additional contact to user communities, liaison organizations etc. |
|
4. Assigned category and assigned priority/time frame |
|
Other Comments |
|
Three additional operators allow most mathematical expressions to be encoded in a simple plain-text format. The first two of these characters are similar in spirit to the Unicode FRACTION SLASH (2044), in that they are used to request built-up display. Hence I recommend that they follow 2044.
These characters are:
SUPERSCRIPT OPERATOR
× (up arrow ® 2191)
¯ SUBSCRIPT OPERATOR
× (down arrow ® 2193)
L LITERAL OPERATOR
× (circled Latin capital letter L ® 24C1)
{ MATHEMATICS ON (symbol in dashed box)
} MATHEMATICS OFF (symbol in dashed box)
The simplest example of subscript/superscript operator usage is to replace TEX’s math use of ^ and _, respectively, which currently have to be “escaped” to be displayed. The glyphs are created by a superscripted bold up arrow and a subscripted bold down arrow, respectively. As in TEX, the subscript and superscript operators allow nested subscripts and superscripts. In combination with other Unicode characters, these “operator” characters allow most common mathematical expressions to be represented without overloading the use of display characters like ^. An appropriate rendering algorithm can use them to produce built-up mathematical text.
The literal operator is used to reduce an operator character like the FRACTION SLASH and the subscript/superscript operators to be displayed as an ordinary character, instead of in built-up form.
The math-on/math-off symbols are analogous to the left-to-right U+200E and right-to-left U+200F bidirectional marks.
For further discussion, please see my articles “Unicode Plain-Text Encoding of Mathematics” in the Proceedings of the Unicode Implementers’ Workshop 6, Santa Clara, CA (Sept. 8-9, 1994) and “Unicode, Rich Text, and Mathematics”, in the Proceedings of the Seventh International Unicode Conference, San Jose, CA (Sept. 14-15, 1995)