Another take on the English apostrophe in Unicode

Leo Broukhis leob at
Thu Jun 4 21:01:56 CDT 2015

Along the same lines, we might need a MODIFIER LETTER HYPHEN, because, for
example, the work ack-ack isn't decomposable into words, or even morphemes,
"ack" and "ack".


On Thu, Jun 4, 2015 at 6:31 PM, David Starner <prosfilaes at> wrote:

> On Thu, Jun 4, 2015 at 2:38 PM Markus Scherer < at>
> wrote:
>> "don’t" is a contraction of two words, it is not one word.
> But as he points out, it's not a contraction of don and t; it is, at best,
> a contraction of do and n't. It's eliding, not punctuating. In the
> comments, he also brings up the examples of "Don’t you mind?" being okay
> but not *"Do not you mind?", and "fo’c’sle".
> > You can't use simple regular expressions to find word boundaries.
> Who uses _simple_ regular expressions? You can't use any code to reliably
> find word boundaries in English, and that's a problem.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list