Do 'Grapheme_Extend' characters only apply to 'Grapheme_Base'?

From: Doug Ewell <doug_at_ewellic.org>
Date: Thu, 24 Apr 2014 12:25:42 -0700

Mathias Bynens <mathias at qiwi dot be> wrote:

> Let's say I'm writing a program that strips combining characters and
> grapheme extenders from an input string.
>
> For combining marks, I'm looking for any non-combining marks (e.g.
> 'a') followed by one or more combining marks (e.g. ' ̃'), and then I
> remove everything but the non-combining mark (e.g. leaving only 'a').
> Is this a correct approach?

It's entirely up to you. This is a rather unusual thing to want to do
with text. Fr mn lnggs, t wld b qvlnt t strppng ll vwls t f th txt.

--
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell
_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode

Received on Thu Apr 24 2014 - 14:26:58 CDT

This message: [ Message body ]
Next message: Whistler, Ken: "RE: Do `Grapheme_Extend` characters only apply to `Grapheme_Base`?"
Previous message: Philippe Verdy: "Re: Unclear text in the UBA (UAX#9) of Unicode 6.3"

Mail actions: [ respond to this message ] [ mail a new topic ]
Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]

This archive was generated by hypermail 2.2.0 : Thu Apr 24 2014 - 14:26:58 CDT