[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #8316(accepted data)

Opened 2 years ago

Last modified 18 months ago

Add other characters attribute to numberingSystem element

Reported by: mark Owned by: mark
Component: other Data Locale:
Phase: rc Review:
Weeks: Data Xpath:
Xref:

Description

Right now, if we want to find out the full set of "exemplar" characters, we have to look at the numbering systems.

That involves grabbing the symbols used by the locale for that numbering system, but also the characters that could come out of the system. For the latter, it is easy with numeric types:

        <numberingSystem id="arabext" type="numeric" digits="۰۱۲۳۴۵۶۷۸۹"/>

However, it is not trivial with algorithmic ones.

        <numberingSystem id="armn" type="algorithmic" rules="armenian-upper"/>

        <numberingSystem id="roman" type="algorithmic" rules="roman-upper"/>

I suggest that we:

  1. Add another attribute that contains all the characters that could come out of RBNF
            <numberingSystem id="roman" type="algorithmic" rules="roman-upper" 
                    charset="[ↁ ↂ ↇ ↈ C D I L-N V X]"/>
    
  2. Add to the RBNF tests (since they know how to parse the rules), a test that ensures that the rules contain all and only those characters.

Attachments

Change History

comment:1 Changed 2 years ago by mark

  • Summary changed from Add other characters to to Add other characters attribute to numberingSystem element

comment:2 Changed 2 years ago by Nick Patch <patch@…>

I was recently thinking about this issue since I have a use case for it related to localized entity recognition. It would be a welcome addition to the CLDR.

comment:3 Changed 2 years ago by emmons

  • Status changed from new to assigned
  • Component changed from unknown to data-other
  • Priority changed from assess to medium
  • Phase changed from dsub to rc
  • Milestone changed from UNSCH to 28
  • Owner changed from anybody to mark

comment:4 Changed 2 years ago by markus

  • Type set to data

comment:5 Changed 2 years ago by markus

  • Component changed from data-other to other

comment:6 Changed 2 years ago by srl

  • Status changed from assigned to accepted

comment:7 Changed 19 months ago by mark

  • Milestone changed from 28 to 29

comment:8 Changed 18 months ago by emmons

  • Milestone changed from 29 to upcoming
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.