[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #5555(accepted docs)

Opened 4 years ago

Last modified 22 months ago

CLDR root grapheme break should treat IDS sequences as single unit

Reported by: pedberg Owned by: pedberg
Component: segmentation Data Locale:
Phase: final Review:
Weeks: Data Xpath:
Xref:

Description

The CLDR root grapheme break iterator should treat ideographic description character sequences as a single cluster; these are intended to represent a single logical character. This should also be proposed to Unicode for UAX #29, and a separate ICU ticket should be filed as well.

Attachments

Change History

comment:1 Changed 4 years ago by pedberg

From TC discussion 2013-01-09: This is difficult as it may require arbitrary lookahead and lookbehind. Should not be part of default UAX #29 behavior, but could be mentioned in a note about tailorings. Also, this is not possible to express in CLDR rules, though it is ipossible to express in ICU rules.

Actions:

  • CLDR to propose to UTC that a note be added to UAX #29 that one possible grapheme cluster tailoring is to handle IDS sequences as a single unit.
  • File bug for ICU to consider handling this in root grapheme cluster break, or perhaps CJK grapheme break tailoring

comment:2 Changed 4 years ago by pedberg

  • Owner changed from anybody to pedberg
  • Priority changed from assess to medium
  • Status changed from new to assigned
  • Milestone changed from UNSCH to 23

comment:3 Changed 4 years ago by pedberg

  • Type changed from enhancement to task

comment:4 Changed 4 years ago by pedberg

comment:5 Changed 4 years ago by pedberg

If we decide to do this in the ICU root break iterator, we should also add a corresponding comment in the CLDR rules.

comment:6 Changed 4 years ago by pedberg

  • Component changed from data-segmentation to docs

So the net here is that no change is proposed to CLDR data, just to file comments, and this is mostly about making proposals to UTC and ICU.

comment:7 Changed 4 years ago by pedberg

  • Milestone changed from 23 to 24dsub

comment:8 Changed 4 years ago by pedberg

  • Milestone changed from 24dsub to 24dres

comment:9 Changed 4 years ago by pedberg

  • Milestone changed from 24rc to 25rc

comment:10 Changed 3 years ago by pedberg

  • Milestone changed from 25rc to 26dsub

comment:11 Changed 3 years ago by pedberg

  • Milestone changed from 26dsub to 26rc

comment:12 Changed 3 years ago by pedberg

  • Milestone changed from 26rc to 27rc

comment:13 Changed 3 years ago by markus

  • Phase set to rc
  • Milestone changed from 27rc to 27

comment:14 Changed 2 years ago by pedberg

  • Milestone changed from 27 to 28

comment:15 Changed 2 years ago by markus

  • Type changed from task to docs
  • Component changed from docs to unknown

comment:16 Changed 2 years ago by srl

  • Status changed from assigned to accepted

comment:17 Changed 22 months ago by emmons

  • Phase changed from rc to final
  • Component changed from unknown to segmentation

comment:18 Changed 22 months ago by pedberg

  • Milestone changed from 28 to UNSCH
View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.