|
Forum rules
Use this forum for technical discussion of UAXes 11, 14, 15, 24, 29, 31, 34, 42, and 44. Technical discussion of UTSes 6, 10, 18, 22, 39, and 46. Technical discussion of UTRs 16, 17, 20, 23, 25, 26, 33, and 36, as well as the related properties and files in the Unicode Character Database.
|
Page 1 of 1
|
[ 2 posts ] |
|
| Author |
Message |
|
guest
|
Post subject: When are graphme clusters "meaningless" Posted: Thu Dec 23, 2010 7:59 pm |
|
 |
| Guest |
Joined: Tue Dec 21, 2010 5:26 pm Posts: 3
|
Another issue raised on the list: Quote: The "grapheme cluster boundary" algorithm sems to quietly allows building meaningless "graphemes" such as base-less (sequences of) combining codes. What are we expected to do with them?
|
|
| Top |
|
 |
|
RichardWordingham
|
Post subject: Re: When are graphme clusters "meaningless" Posted: Sun Feb 17, 2013 10:05 am |
|
Joined: Sat Feb 13, 2010 4:46 pm Posts: 7
|
|
When it comes to displaying them, there are two main options if they consist entirely of non-spacing marks. The first is to display them on <U+00A0 NO-BREAK SPACE>. The second is to give an error indication, e.g. by displaying them on <U+25CC DOTTED CIRCLE>, possibly breaking up the sequence.
There are many options for rendering a spacing mark plus non-spacing marks. There are occasions when non-spacing marks are intended to be treated as though letters.
If a search string starts with a baseless cluster, I would say that was a very good argument for ignoring any 'complete graphemes only' setting when looking for the starting boundary of the matching string.
|
|
| Top |
|
 |
|
Page 1 of 1
|
[ 2 posts ] |
|
Who is online |
Users browsing this forum: No registered users and 1 guest |
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot post attachments in this forum
|
|
|