[Unicode]   Common Locale Data Repository : Bug Tracking Home | Site Map | Search

CLDR Ticket #8368(accepted data)

Opened 3 years ago

Last modified 3 years ago

Hyphen at word boundary should be part of the word in Finnish

Reported by: hatapitk@… Owned by: andy
Component: other Data Locale: FI
Phase: rc Review:
Weeks: Data Xpath:


In Finnish HYPHEN-MINUS may be used at the beginning or end of the word to represent a part of the word that has been omitted. For example

"sosiaali- ja terveysministeriö" (ministry of social affairs and health)

According to current CLDR rules there is a word boundary between "sosiaali" and "-". This causes problems with spell checking since "sosiaali" is not a valid word, it is a prefix. It would be better to consider the hyphen as a part of the word.

Another example is

"syntymäaika ja -paikka" (date and place of birth)

Here it would also be correct to consider the hyphen as a part of the word but current behavior does not cause problems with spell checking.

At least Microsoft Office does consider hyphen to be part of the word similarly as I have suggested here. OpenOffice.org and LibreOffice had similar customization for CLDR rules for many years until recently the customization was removed to fix other issues (see https://bugs.documentfoundation.org/show_bug.cgi?id=55707). Rather than re-applying the customization in LibreOffice I would appreciate if you could consider applying this tweak directly to Finnish CLDR rules.


Change History

comment:1 Changed 3 years ago by kent.karlsson14@…

Not sure why this is requested as something special for Finnish. Ok, maybe the spell check problem mentioned is peculiar to Finnish, but other languages use hyphen in a similar way as examplified. So I think it should be a general rule, nothing special for Finnish.

In addition, this applies not only for HYPHEN-MINUS but also HYPHEN and NOBREAK HYPHEN.

comment:2 Changed 3 years ago by emmons

  • Status changed from new to accepted
  • Cc pedberg added
  • Component changed from unknown to other
  • Priority changed from assess to medium
  • Phase changed from dsub to rc
  • Milestone changed from UNSCH to 28
  • Owner changed from anybody to andy
  • Type set to data

comment:3 Changed 3 years ago by emmons

  • Milestone changed from 28 to 28roll

Moving all outstanding 28 tickets to 28roll. We will discuss disposition of these at the next CLDR TC.


Add a comment

Modify Ticket

as accepted

E-mail address and user name can be saved in the Preferences.

Note: See TracTickets for help on using tickets.