NamesList.txt as data source
Janusz S. Bień
jsbien at mimuw.edu.pl
Sat Mar 26 04:10:24 CDT 2016
On Thu, Mar 10 2016 at 22:40 CET, kenwhistler at att.net writes:
> The *reason* that NamesList.txt exists at all is to drive the tool, unibook,
> that formats the full Unicode code charts for posting. It is only
> posted in the Unicode Character Database at all as a matter of
> convenience, to give people access to a text only version of the
> names list that appears in the fully formatted pdf versions of the
> code charts
> that contain all the representative glyphs.
> NamesList.txt should *not* be data mined.
I've just noticed that NamesList.txt is in a sense data mined by the
Unicode consortium itself. I mean the "Unicode Utilities: Character
Properties", which e.g. for LATIN SMALL LETTER P WITH FLOURISH
(http://unicode.org/cldr/utility/character.jsp?a=A753) display in
subhead: Medievalist addition
Am I right that this information is available only in NamesList.txt?
In my opinion this is important information and should be officially
available for character data mining engines.
Prof. dr hab. Janusz S. Bien - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej)
Prof. Janusz S. Bien - University of Warsaw (Formal Linguistics Department)
jsbien at uw.edu.pl, jsbien at mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/
More information about the Unicode