[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #11072(new data)

Opened 3 months ago

Last modified 5 days ago

supplementalData.xml has a line containing 7000+ characters, and this breaks the parser.

Reported by: adam.farley@… Owned by: anybody
Component: supplemental Data Locale: https://www.unicode.org/repos/cldr/trunk/common/supplemental/supplementalData.xml
Phase: dsub Review:
Weeks: Data Xpath:
Xref:

Description

Line 5204 in the below file is so long that vi on z/OS cannot handle it, nor can the java xml parser parse that file on z/OS.

https://www.unicode.org/repos/cldr/trunk/common/supplemental/supplementalData.xml

Would it be possible to reduce the length of the line to 2000 characters or less, somehow?

Attachments

Change History

comment:1 Changed 3 months ago by srl

I measure the line as 9,505 bytes

<territoryCodes type="ZZ" numeric="999" alpha3="ZZZ" internet="AAA AARP ABARTH ABB ABBOTT…

comment:2 Changed 5 days ago by srl

Can we just get rid of the internet= attribute on this line? What benefit does it give to CLDR?

The list of TLD data is readily available at https://www.icann.org/resources/pages/tlds-2012-02-25-en - I don't think this belongs in CLDR.

the CLDR copy isn't even up to date. I'm OK with keeping tlds-alpha-by-domain.txt in the CLDR tools' data if it gets used for something (exemplar checks or whatever)

Last edited 5 days ago by srl (previous) (diff)
View

Add a comment

Modify Ticket

Action
as new
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.