# NameAliases-6.1.0.txt # Date: 2011-08-09, 00:54:00 GMT [KW] # # NB: This is a DRAFT of a modified format for NameAliases.txt, # for consideration by the UTC. It is not an approved data file. # # This file is a normative contributory data file in the # Unicode Character Database. # # Copyright (c) 2005-2011 Unicode, Inc. # For terms of use, see http://www.unicode.org/terms_of_use.html # # This file defines the formal name aliases for Unicode characters. # # For informative aliases see NamesList.txt # # The formal name aliases are divided into four types. # # 1. Corrections for serious problems in the character names # 2. ISO 6429 names for C0 and C1 control functions # 3. Other commonly occurring names for control codes, format characters, # and spaces # 4. Commonly occurring abbreviations for control codes, format characters, # spaces, and variation selectors # # The formal name aliases are part of the Unicode character namespace, which # includes the character names and the names of named character sequences. # The inclusion of ISO 6429 names and other commonly occurring names and # abbreviations for control codes and format characters as formal name alisases # is to help avoid name collisions between Unicode character names and the # labels which commonly appear in text and/or in implementations such as regex, for # control codes (which have no Unicode character name) or for format characters. # # For documentation, see NamesList.html and http://www.unicode.org/reports/tr44/ # # FORMAT # # Each line has three fields, as described here: # # First field: Code point # Second field: Alias # Third field: Type # # The Type labels used are: correction, iso6429, control, abbreviation # Those Type labels can be mapped to other strings for display, if desired, # e.g. "preferred", "control name in ISO 6429", "other control code name", # "abbreviated as", etc. # # In case multiple aliases are assigned, additional aliases # are provided on separate lines. Parsers of this data file should # take note that the code points are not in numerical order, and that # the same code point can (and does) occur more than once. # #----------------------------------------------------------------- # 1. Corrections for serious problems in the Unicode character names 01A2;LATIN CAPITAL LETTER GHA;correction 01A3;LATIN SMALL LETTER GHA;correction 0CDE;KANNADA LETTER LLLA;correction 0E9D;LAO LETTER FO FON;correction 0E9F;LAO LETTER FO FAY;correction 0EA3;LAO LETTER RO;correction 0EA5;LAO LETTER LO;correction 0FD0;TIBETAN MARK BKA- SHOG GI MGO RGYAN;correction 2118;WEIERSTRASS ELLIPTIC FUNCTION;correction 2448;MICR ON US SYMBOL;correction 2449;MICR DASH SYMBOL;correction A015;YI SYLLABLE ITERATION MARK;correction FE18;PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRACKET;correction 1D0C5;BYZANTINE MUSICAL SYMBOL FTHORA SKLIRON CHROMA VASIS;correction #----------------------------------------------------------------- # 2. Aliases provided for ISO 6429 control function names 0000;NULL;iso6429 0001;START OF HEADING;iso6429 0002;START OF TEXT;iso6429 0003;END OF TEXT;iso6429 0004;END OF TRANSMISSION;iso6429 0005;ENQUIRY;iso6429 0006;ACKNOWLEDGE;iso6429 # Note that no formal name alias for the ISO 6429 "BELL" is # provided for U+0007, because of the existing name collision # with U+1F514 BELL. 0008;BACKSPACE;iso6429 0009;CHARACTER TABULATION;iso6429 000A;LINE FEED;iso6429 000B;LINE TABULATION;iso6429 000C;FORM FEED;iso6429 000D;CARRIAGE RETURN;iso6429 000E;SHIFT OUT;iso6429 000F;SHIFT IN;iso6429 0010;DATA LINK ESCAPE;iso6429 0011;DEVICE CONTROL ONE;iso6429 0012;DEVICE CONTROL TWO;iso6429 0013;DEVICE CONTROL THREE;iso6429 0014;DEVICE CONTROL FOUR;iso6429 0015;NEGATIVE ACKNOWLEDGE;iso6429 0016;SYNCHRONOUS IDLE;iso6429 0017;END OF TRANSMISSION BLOCK;iso6429 0018;CANCEL;iso6429 0019;END OF MEDIUM;iso6429 001A;SUBSTITUTE;iso6429 001B;ESCAPE;iso6429 001C;INFORMATION SEPARATOR FOUR;iso6429 001D;INFORMATION SEPARATOR THREE;iso6429 001E;INFORMATION SEPARATOR TWO;iso6429 001F;INFORMATION SEPARATOR ONE;iso6429 007F;DELETE;iso6429 0082;BREAK PERMITTED HERE;iso6429 0083;NO BREAK HERE;iso6429 0085;NEXT LINE;iso6429 0086;START OF SELECTED AREA;iso6429 0087;END OF SELECTED AREA;iso6429 0088;CHARACTER TABULATION SET;iso6429 0089;CHARACTER TABULATION WITH JUSTIFICATION;iso6429 008A;LINE TABULATION SET;iso6429 008B;PARTIAL LINE FORWARD;iso6429 008C;PARTIAL LINE BACKWARD;iso6429 008D;REVERSE LINE FEED;iso6429 008E;SINGLE SHIFT TWO;iso6429 008F;SINGLE SHIFT THREE;iso6429 0090;DEVICE CONTROL STRING;iso6429 0091;PRIVATE USE ONE;iso6429 0092;PRIVATE USE TWO;iso6429 0093;SET TRANSMIT STATE;iso6429 0094;CANCEL CHARACTER;iso6429 0095;MESSAGE WAITING;iso6429 0096;START OF GUARDED AREA;iso6429 0097;END OF GUARDED AREA;iso6429 0098;START OF STRING;iso6429 009A;SINGLE CHARACTER INTRODUCER;iso6429 009B;CONTROL SEQUENCE INTRODUCER;iso6429 009C;STRING TERMINATOR;iso6429 009D;OPERATING SYSTEM COMMAND;iso6429 009E;PRIVACY MESSAGE;iso6429 009F;APPLICATION PROGRAM COMMAND;iso6429 #----------------------------------------------------------------- # 3. Aliases provided for other de facto control code names and # format control names in widespread use # These include ISO 6429 control function names valid in # earlier editions of that standard. 0007;ALERT;control 0009;HORIZONTAL TABULATION;control 000A;NEW LINE;control 000A;END OF LINE;control 000B;VERTICAL TABULATION;control 000E;LOCKING-SHIFT ONE;control 000F;LOCKING-SHIFT ZERO;control 001C;FILE SEPARATOR;control 001D;GROUP SEPARATOR;control 001E;RECORD SEPARATOR;control 001F;UNIT SEPARATOR;control 0084;INDEX;control 0088;HORIZONTAL TABULATION SET;control 0089;HORIZONTAL TABULATION WITH JUSTIFICATION;control 008A;VERTICAL TABULATION SET;control 008B;PARTIAL LINE DOWN;control 008C;PARTIAL LINE UP;control 008D;REVERSE INDEX;control 008E;SINGLE-SHIFT 2;control 008F;SINGLE-SHIFT 3;control 0091;PRIVATE USE 1;control 0092;PRIVATE USE 2;control 0096;START OF PROTECTED AREA;control 0097;END OF PROTECTED AREA;control FEFF;BYTE ORDER MARK;control #----------------------------------------------------------------- # 4. Aliases provided for de facto abbreviations of control codes, # format controls, spaces, and variation selectors in widespread use 0000;NUL;abbreviation 0005;ENQ;abbreviation 0006;ACK;abbreviation 0007;BEL;abbreviation 0008;BS;abbreviation 0009;HT;abbreviation 0009;TAB;abbreviation 000A;LF;abbreviation 000A;NL;abbreviation 000A;EOL;abbreviation 000B;VT;abbreviation 000C;FF;abbreviation 000D;CR;abbreviation 000E;SO;abbreviation 000F;SI;abbreviation 0015;NAK;abbreviation 001A;SUB;abbreviation 001B;ESC;abbreviation 001C;FS;abbreviation 001D;GS;abbreviation 001E;RS;abbreviation 001F;US;abbreviation 0020;SP;abbreviation 007F;DEL;abbreviation 0085;NEL;abbreviation 00A0;NBSP;abbreviation 00AD;SHY;abbreviation 034F;CGJ;abbreviation 200B;ZWSP;abbreviation 200C;ZWNJ;abbreviation 200D;ZWJ;abbreviation 200E;LRM;abbreviation 200F;RLM;abbreviation 202A;LRE;abbreviation 202B;RLE;abbreviation 202C;PDF;abbreviation 202D;LRO;abbreviation 202E;RLO;abbreviation 202F;NNBSP;abbreviation 2060;WJ;abbreviation FE00;VS1;abbreviation # NB: If the additions to this data file are approved, # the entries to fill out the ranges VS1..VS16 and VS17..VS256 # will be added. For now, they are omitted to avoid # cluttering the file during review. FE0F;VS16;abbreviation FEFF;BOM;abbreviation FEFF;ZWNBSP;abbreviation E0100;VS17;abbreviation E01EF;VS256;abbreviation # Total code points: xxx # EOF