[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #6580(accepted data)

Opened 4 years ago

Last modified 2 years ago

Fix spurious zero-width characters

Reported by: mark Owned by: shervin
Component: main Data Locale:
Phase: dsub Review:
Weeks: Data Xpath:
Xref:

Description

I was looking at the narrow forms of temperature in a spreadsheet, and produced a unique list, getting the following. Note that lines 7-10 appear to be doubled. I investigated, and the problem is that there is a spurious U+200E ( ‎ ) LEFT-TO-RIGHT MARK in them. With a bit of investigation, I found that there are three cases where they occur.

ur.xml {0}‎°C
ur.xml {0}‎°C
ur.xml {0}‎°F

Three is always a bad number: it doesn't include all the Urdu cases (leaves out one of the ‎°F cases). Moreover, you'd think that if it were needed in Urdu, it would be needed in other BIDI languages.

Recommendations

  1. We should have some mechanism to make [:di:] characters visible to translators so that they can make sure that they are used correctly for their language.
  2. For this release, we might want to review a few of the most problematic cases, such as the above.

Background Info

Here are the [:di:] characters in our source (/main)

[\u00AD\u200B-\u200F\u202A\u202C]

Latin 1 Supplement — Latin-1 punctuation and symbols items: 1

U+00AD ( ) SOFT HYPHEN

General Punctuation — Spaces items: 1

U+200B ( ) ZERO WIDTH SPACE

General Punctuation — Format character items: 6

U+200C ( ) ZERO WIDTH NON-JOINER
U+200D ( ) ZERO WIDTH JOINER
U+200E ( ‎ ) LEFT-TO-RIGHT MARK
U+200F ( ‎‏‎ ) RIGHT-TO-LEFT MARK
U+202A ( ) LEFT-TO-RIGHT EMBEDDING
U+202C ( ) POP DIRECTIONAL FORMATTING


The original suspicious list:

1 {0} °C
2 {0} °F
3 {0} د ف
4 {0} ອົງສາ ຊີ.
5 {0} ອົງສາ ຟ.
6 {0}°
7 {0}°C
8 {0}‎°C
9 {0}°F
10 {0}‎°F
11 {0}°ሴ
12 {0}°ፋ
13 {0}°फ
14 {0}°फॅ
15 {0}°से
16 {0}°ஃபா.
17 {0}°செ.
18 {0}°ಫ್ಯಾ
19 {0}°ಸೆ
20 {0}°ഫാ
21 {0}°സെ
22 {0}د م
23 සෙල්. {0}°
24 ෆැර. {0}°

Attachments

Change History

comment:1 Changed 4 years ago by emmons

  • Status changed from new to assigned
  • Component changed from unknown to data
  • Priority changed from assess to major
  • Milestone changed from UNSCH to 25dsub
  • Owner changed from anybody to roozbeh
  • Type changed from unknown to task

comment:2 Changed 4 years ago by emmons

  • Milestone changed from 25dsub to 25M1

Moving all 25dsub to 25M1. Please adjust the milestone if you are not planning to complete the item in the 25M1 time frame.

comment:3 Changed 4 years ago by emmons

  • Milestone changed from 25M1 to 25rc

Moving Roozbeh's 25M1 to 25rc

comment:4 Changed 4 years ago by emmons

  • Milestone changed from 25rc to 26rc

comment:5 Changed 3 years ago by mark

  • Milestone changed from 26rc to 27dsub

comment:6 Changed 3 years ago by markus

  • Phase set to dsub
  • Milestone changed from 27dsub to 27

comment:7 Changed 3 years ago by roozbeh

  • Milestone changed from 27 to 28

comment:8 Changed 2 years ago by roozbeh

  • Owner changed from roozbeh to shervin

comment:9 Changed 2 years ago by markus

  • Type changed from task to data

comment:10 Changed 2 years ago by srl

  • Status changed from assigned to accepted

comment:11 Changed 2 years ago by shervin

  • Milestone changed from 28 to 29

comment:12 Changed 2 years ago by emmons

  • Milestone changed from 29 to upcoming

Automatic move of all 29 -> upcoming

View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.