[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search
 
Modify

CLDR Ticket #11065(accepted)

Opened 8 months ago

Last modified 3 days ago

Translation General guide for using [ ]

Reported by: kristi Owned by: fredrik
Component: docs-site Data Locale:
Phase: dsub Review:
Weeks: Data Xpath:
Xref:

Description (last modified by kristi) (diff)

From the translation guide:
"Don't use the characters "(" and ")", since they will be confusing in complex language names. If you have to use brackets, use square ones: [ and ]."

This is in two places in the translation guide:

  1. #5 in http://cldr.unicode.org/translation/country-names#TOC-General-Guidelines
  2. 3rd bullet in http://cldr.unicode.org/translation/language-names

Spot checking this across languages it does not look like this is followed at all.
We found use of ( ), but not [ ]

We should revisit the reason to reason for this guide.

Attachments

Change History

comment:1 Changed 7 months ago by pedberg

  • Cc kristi added
  • type changed from unknown to site

The reason is that parentheses are often used when combining the name of a language with the name of a script and/or language, for example:

  • Uzbek (Latin), Uzbek (Arabic)
  • Uzbek (Afghanizstan)
  • Uzbek (Arabic, Afghanistan)

So having parentheses in the name of an individual component like language or region would be confusing when combining them. However, the actual pattern that is used to do the combining is something that is also part of the locale data, see for example http://st.unicode.org/cldr-apps/v#/en/Locale_Name_Patterns/4c3249dbb329101c

So maybe the real guidance should be that in names for language, regions, or scripts, don't use bracketing characters (parentheses or other) that are the same as what are specified in the localePattern. We could add a test for that if we do not already have one.

Last edited 7 months ago by pedberg (previous) (diff)

comment:2 Changed 7 weeks ago by mark

  • Owner changed from anybody to fredrik
  • Component changed from unknown to docs-site

comment:3 Changed 3 weeks ago by kristi

  • Description modified (diff)

On http://cldr.unicode.org/translation/country-names#TOC-Geopolitically-Sensitive-Names

The instruction is "Some of these require brackets, which should either be the ASCII ones (…), or the full-width ones (CJK)."
Thus, this instruction includes use of:
Ascii ( )U+0028 and U+0029
CJK ( ) U+FF08 and U+FF09

And they are not [ ].

comment:4 Changed 3 days ago by mark

  • Status changed from new to accepted

Setting non-new items to be status=accepted, as per policy discussion

View

Add a comment

Modify Ticket

Action
as accepted
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.