Mark
(https://twitter.com/mark_e_davis)
On Thu, Aug 24, 2017 at 11:01 PM, Asmus Freytag via Unicode <
unicode_at_unicode.org> wrote:
> On 8/24/2017 10:17 AM, Andre Schappo via Unicode wrote:
>
>> Because there are many systems that can now handle BMP characters but not
>> cannot handle SMP characters.
>>
>> One example being systems that use mysql utf8 (3 byte encoding) and have
>> not yet updated to utf8mb4 (4 byte encoding)
>>
>> So, I consider it important to familiarise students with SMP characters
>> as well as BMP characters. Then when they develop software they will, at
>> the start, be thinking beyond ASCII and Unicode BMP characters.
>>
>
> The thinking "beyond BMP" part only comes in when you work in encoding
> forms where the BMP uses a different number of code units than the SMP (or
> any other non-BMP "page"). This is true for both utf8 and utf16 but not if
> you work in utf32 or in scalar values (as in the posted exercise).
>
>
> The trick with using emoji in this lesson is that the descriptions and
> images are meaningful to any English speaker, so it gets the student to
> learn about character names.
>
> The same exercise would be more of a challenge for students whose native
> tongue is not English.
> The trick with using emoji...
True. For emoji names it would be better to use the CLDR names with
non-anglophone audiences, since those names are available in a number of
languages.
eg http://www.unicode.org/cldr/charts/31/annotations/romance.html#%f0%9f%98%95 (that
was last release's version; next release will have improvements...)
>
>
> A./
>
>
>> André Schappo
>>
>> On 24 Aug 2017, at 17:45, Shriramana Sharma <samjnaa_at_gmail.com> wrote:
>>>
>>> So how do you think it matters if the characters are in the BMP or SMP?
>>>
>>
>>
>>
>
Received on Fri Aug 25 2017 - 03:29:01 CDT
This archive was generated by hypermail 2.2.0 : Fri Aug 25 2017 - 03:29:01 CDT