[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #8316(accepted data)

Opened 2 years ago

Last modified 21 months ago

Add other characters attribute to numberingSystem element

Reported by: mark Owned by: mark
Component: other Data Locale:
Phase: rc Review:
Weeks: Data Xpath:


Right now, if we want to find out the full set of "exemplar" characters, we have to look at the numbering systems.

That involves grabbing the symbols used by the locale for that numbering system, but also the characters that could come out of the system. For the latter, it is easy with numeric types:

        <numberingSystem id="arabext" type="numeric" digits="۰۱۲۳۴۵۶۷۸۹"/>

However, it is not trivial with algorithmic ones.

        <numberingSystem id="armn" type="algorithmic" rules="armenian-upper"/>

        <numberingSystem id="roman" type="algorithmic" rules="roman-upper"/>

I suggest that we:

  1. Add another attribute that contains all the characters that could come out of RBNF
            <numberingSystem id="roman" type="algorithmic" rules="roman-upper" 
                    charset="[ↁ ↂ ↇ ↈ C D I L-N V X]"/>
  2. Add to the RBNF tests (since they know how to parse the rules), a test that ensures that the rules contain all and only those characters.


Change History

comment:1 Changed 2 years ago by mark

  • Summary changed from Add other characters to to Add other characters attribute to numberingSystem element

comment:2 Changed 2 years ago by Nick Patch <patch@…>

I was recently thinking about this issue since I have a use case for it related to localized entity recognition. It would be a welcome addition to the CLDR.

comment:3 Changed 2 years ago by emmons

  • Status changed from new to assigned
  • Component changed from unknown to data-other
  • Priority changed from assess to medium
  • Phase changed from dsub to rc
  • Milestone changed from UNSCH to 28
  • Owner changed from anybody to mark

comment:4 Changed 2 years ago by markus

  • Type set to data

comment:5 Changed 2 years ago by markus

  • Component changed from data-other to other

comment:6 Changed 2 years ago by srl

  • Status changed from assigned to accepted

comment:7 Changed 22 months ago by mark

  • Milestone changed from 28 to 29

comment:8 Changed 21 months ago by emmons

  • Milestone changed from 29 to upcoming

Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.