Corrigendum #9

Richard Wordingham richard.wordingham at
Mon Jun 2 13:05:11 CDT 2014

On Mon, 2 Jun 2014 10:17:04 -0700
Markus Scherer < at> wrote:

> CLDR collation data defines special contraction mappings that start
> with a noncharacter, for

> In CLDR 23 and before (when we were still using XML collation syntax),
> these were raw noncharacters in the .xml files.

> As I said earlier:
> it should be ok to include noncharacters in CLDR data files for
> processing by CLDR implementations, and it should be possible to edit
> and diff and version-control and web-view those files etc.

They come as a nasty shock when someone thinks XML files are marked-up
text files.  I'm still surprised that the published human-readable form
of CLDR files should contain automatically applied non-Unicode copyright


More information about the Unicode mailing list