Re: Unicode education in Schools

From: Richard Wordingham via Unicode <unicode_at_unicode.org>
Date: Sun, 27 Aug 2017 05:26:45 +0100

On Fri, 25 Aug 2017 09:36:44 -0400
John W Kennedy <john.w.kennedy_at_gmail.com> wrote:

> Just a reminder that in Apple’s Swift a “Character” is anything that
> looks like a character, including a letter with any theoretically
> unlimited stack of diacritics, a flag, or a skin-toned emoji, and all
> Swift functions working with characters, strings, and substrings
> count characters in this way. There is an underlying store that is,
> for historic reasons, UTF-16, and that can be accessed, but so can
> UTF-8 and UTF-32.

Can the individual Unicode characters be accessed one by one, e.g. for
searching for vowels or other such 'diacritics'? Or would one only
have access to the code units?

Could one easily search for a subjoined consonant, e.g. COENG RO
<U+17D2 KHMER SIGN COENG, U+179A KHMER LETTER RO> in Khmer, where the
two constituent characters would be in adjacent extended grapheme
clusters?

Richard.
Received on Sat Aug 26 2017 - 23:27:03 CDT

This archive was generated by hypermail 2.2.0 : Sat Aug 26 2017 - 23:27:04 CDT