Concise term for non-ASCII Unicode characters

Philippe Verdy verdy_p at
Mon Sep 21 16:47:50 CDT 2015

You actually don't need any copy to work with it U+0000 to U+007F are
directly bound to US-ASCII. Unicode describe these characters with
character properties (and representative glyphs only for the range
U+0020..U+007E; the "C0" controls, in U+0000 to U+001F and U+007F, have a
pseudo-glyph in charts which may only be usable if you work with them in
"visible controls" mode.)

If you need  ANSI X3.4, it's only about the intended usage of controls, but
only a few are prevalent in plain text: TAB, LF, CR, or CR+LF, and FF (NUL
and DEL are used as fillers depending on environments or may be used as
special escapes or terminators for terminal protocols). Most controls in
US-ASCII have their name and most common functions relarted to
console/keyboard/printers protocols and not intended to be used in text
contents. But there are so many competing protocols that even the ANSI X3.4
descriptions are just informative and deprecated: you'll need to look into
each protocol. Unicode (and MIME in Internet protocols) attempt to create
an equivalence for line termination only (with LF, CR, or CR+LF; Unicode
also added NL for the C1 controls, only for compatibility as well with
EBCDIC data converters).

2015-09-21 22:51 GMT+02:00 Sean Leonard <lists+unicode at>:

> Related question as I am researching this:
> How can I acquire (cheaply or free) the latest and most official copy of
> US-ASCII, namely, the version that Unicode references?
> The Unicode Standard 8.0 refers to the following document:
> ANSI X3.4: American National Standards Institute. Coded character
> set—7-bit American
> national standard code for information interchange. New York: 1986. (ANSI
> X3.4-1986).
> (See page 294.)
> A quick Google search did not yield results. There are public/university
> library hard copies but they are hundreds of miles away from my location.
> Sean
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list