Accumulated Feedback on PRI #334

This page is a compilation of formal public feedback received so far. See Feedback for further information on this issue, how to discuss it, and how to provide feedback.

Date/Time: Tue Sep 27 09:37:17 CDT 2016
Name: Gervase Markham
Report Type: Error Report (UTS #39)
Opt Subject: Unicode: Κʻ / ĸ (U+0138, *Kra*) should be classed as "historic" in xidmodifications.txt

See https://bugzilla.mozilla.org/show_bug.cgi?id=1256009 . We think that
Unicode: Κʻ / ĸ (U+0138, *Kra*) should be added to the "historic" characters
defined in http://unicode.org/Public/security/latest/xidmodifications.txt.

Our research suggests this is justified:
https://en.wikipedia.org/wiki/Kra_(letter) says "In 1973, a spelling reform
replaced the use of kra in Greenlandic with Latin small letter q (and the
associated Latin capital letter with Q)".

This is an issue because it's a homograph for k, but is considered a Latin
letter, and so script mixing algorithms don't disallow domain names such as
http://vĸ.com.

Thanks,

Gerv
(Mozilla)

Feedback above this line was reviewed prior to UTC #150.

Date/Time: Tue Nov 22 13:11:37 CST 2016
Name: David Corbett
Report Type: Public Review Issue
Opt Subject: PRI #334: U+025F is not obsolete

U+025F LATIN SMALL LETTER DOTLESS J WITH STROKE is technical but not obsolete.

Feedback above this line was reviewed at UTC #150.

Date/Time: Sun Apr 2 09:54:21 CDT 2017
Name: S. Gilles
Report Type: Other Question, Problem, or Feedback
Opt Subject: Suggestions for confusables

# I have some things that may be confusables. I only care about
# ASCII-level confusion, so I ignore diacritics, etc.

01AB ;  0074 ;  MA      # ( ƫ → t ) LATIN SMALL LETTER T WITH PALATAL HOOK → LATIN SMALL LETTER T
0272 ;  006E ;  MA      # ( ɲ → n ) LATIN SMALL LETTER N WITH LEFT HOOK → LATIN SMALL LETTER N
0291 ;  007A ;  MA      # ( ʑ → z ) LATIN SMALL LETTER Z WITH CURL → LATIN SMALL LETTER Z
0298 ;  004F ;  MA      # ( ʘ → O ) LATIN LETTER BILABIAL CLICK → LATIN CAPITAL LETTER O
0629 ;  006F ;  MA      # ( ة → e ) ARABIC LETTER TEH MARBUTA → LATIN SMALL LETTER E
1D07 ;  0045 ;  MA      # ( ᴇ → E ) LATIN LETTER SMALL CAPITAL E → LATIN CAPITAL LETTER E
2365 ;  004F ;  MA      # ( ⍥ → O ) APL FUNCTIONAL SYMBOL CIRCLE DIAERESIS → LATIN CAPITAL LETTER O
0644 ;  004A ;  MA      # ( ل → J ) ARABIC LETTER LAM → LATIN CAPITAL LETTER J
1472 ;  0062 ;  MA      # ( ᑲ → b ) CANADIAN SYLLABICS KA → LATIN SMALL LETTER B
1473 ;  0062 ;  MA      # ( ᑳ → b ) CANADIAN SYLLABICS KAA → LATIN SMALL LETTER B

# I'm not sure if these should be considered confusables or whether
# these should be submitted for changes to Decompositional_Mapping.
# If the latter: well, here they are. They should probably have
# something, even if not the replacement I suggest.

# DINGBAT NEGATIVE CIRCLED DIGIT N
2776 ;  0028 0031 0029
2777 ;  0028 0032 0029
2778 ;  0028 0033 0029
2779 ;  0028 0034 0029
277A ;  0028 0035 0029
277B ;  0028 0036 0029
277C ;  0028 0037 0029
277D ;  0028 0038 0029
277E ;  0028 0039 0029
277F ;  0028 0031 0030 0029

# DINGBAT CIRCLED SANS-SERIF DIGIT N
2780 ;  0028 0031 0029
2781 ;  0028 0032 0029
2782 ;  0028 0033 0029
2783 ;  0028 0034 0029
2784 ;  0028 0035 0029
2785 ;  0028 0036 0029
2786 ;  0028 0037 0029
2787 ;  0028 0038 0029
2788 ;  0028 0039 0029
2789 ;  0028 0031 0030 0029

# DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT N
278A ;  0028 0031 0029
278B ;  0028 0032 0029
278C ;  0028 0033 0029
278D ;  0028 0034 0029
278E ;  0028 0035 0029
278F ;  0028 0036 0029
2790 ;  0028 0037 0029
2791 ;  0028 0038 0029
2792 ;  0028 0039 0029
2793 ;  0028 0031 0030 0029

# And now, the ones there are no chance of taking, but I'll mention
# them because I use them and find them confusing at low resolutions.

00DF ;  0042 ;  MA›     # ( ß → B ) LATIN SMALL LETTER SHARP S → LATIN CAPITAL LETTER B
# This would affect A7B5, 03B2, 03D0, 1D6C3, 1D6FD, 1D737, 1D771, 1D7AB, 13F0

0460;   0077 ;  MA›     # ( Ѡ → W ) CYRILLIC CAPITAL LETTER OMEGA → LATIN CAPITAL LETTER W
# I guess this would be vv instead.

A793 ;  0065 ;  MA›     # ( ꞓ → e ) LATIN SMALL LETTER C WITH BAR → LATIN SMALL LETTER E
# I find 03F5 confusable# with 0065, however, and to suggest that 
# change, A793 must be made confusable.

Date/Time: Fri May 5 12:16:22 CDT 2017
Name: John Colosi
Report Type: Error Report
Opt Subject: Confusables: 043C <-> 004D

I am referring to the confusables.txt located here:
ftp://ftp.unicode.org/Public/security/latest/confusables.txt

It looks like the Cyrillic Small Letter EM (U+043C) is *not* considered to be
confusable with the Latin Capital Letter M (U+004D).

There are other characters listed as confusable with Latin Capital Letter M
(U+004D), including other Cyrillic code points such as Cyrillic Capital Letter
EM (U+041C).

I realize that this may be a case-folding design decision. But for
implementers of this confusability mapping, I'm wondering what mechanism would
let them detect that Cyrillic Small Letter EM (U+043C) is confusable with the
Latin Capital Letter M (U+004D).