[Unicode]   Unicode Localization Interoperability Technical Committee : Bug Tracking Home | Site Map | Search
 
Modify

ULI Ticket #18348(new defect)

Opened 4 weeks ago

Last modified 4 weeks ago

Check `fa` abbreviations against exemplar

Reported by: shervinafshar@… Owned by: somebody
Component: data Version: Current

Description

Background: Before a Persian standard keyboard became widespread and widely available on all platforms, Iranian users were using Arabic keyboards shipped with software products. Two issues encountered using these keyboards was that they mislead the users to use ي (U+064A ARABIC LETTER YEH) in place of ی (U+06CC ARABIC LETTER FARSI YEH) and ك (U+0643 ARABIC LETTER KAF) in place of ک (U+06A9 ARABIC LETTER KEHEH) specifically as the initial and medial forms are indistinguishable.

When Persian Wikipedia community encountered this issue, they decided to keep the article titles in Persian letters, but keep some of the redirects which were mistakenly using Arabic letters.

These should be excluded from the data collected for Persian through aggregating Wikipedia articles.

Attachments

Change History

comment:1 Changed 4 weeks ago by shervinafshar@…

It seems like a good idea to check the data for any locales against the exemplar to prevent issues like this:

http://unicode.org/uli/trac/browser/trunk/abbrs/xls_dbpedia/nl.tsv?rev=57#L295

View

Add a comment

Modify Ticket

Action
as new
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.