Re: Questionable lines on LineBreakTest.txt

From: Masaaki Shibata (shibatamasaaki@gmail.com)
Date: Tue Jun 08 2010 - 02:55:58 CDT

  • Next message: Luke-Jr: "Re: Hexadecimal digits"

    Asmus, Mark, thank you for replying.

    I'm very surprised. These document and test file must have been public
    for years and I couldn't find any cautions or notations about that on
    their site. This is very misleading. Most developers will reasonably
    expect this text file will be useful.

    I agree with Mark. I hope some UTC people will notice our argument.

    Ref. I've got 17 cases of the same kind of contradiction on
    LineBreakTest.txt. They are all seemed to be against LB25:

    l.1137: ÷ [0.2] RIGHT PARENTHESIS (CP) ÷ [999.0] PERCENT SIGN (PO) ÷ [0.3]
    l.1139: ÷ [0.2] RIGHT PARENTHESIS (CP) × [9.0] COMBINING DIAERESIS
    (CM) ÷ [999.0] PERCENT SIGN (PO) ÷ [0.3]
    l.1141: ÷ [0.2] RIGHT PARENTHESIS (CP) ÷ [999.0] DOLLAR SIGN (PR) ÷ [0.3]
    l.1143: ÷ [0.2] RIGHT PARENTHESIS (CP) × [9.0] COMBINING DIAERESIS
    (CM) ÷ [999.0] DOLLAR SIGN (PR) ÷ [0.3]
    l.2569: ÷ [0.2] COMMA (IS) ÷ [999.0] DIGIT ZERO (NU) ÷ [0.3]
    l.2571: ÷ [0.2] COMMA (IS) × [9.0] COMBINING DIAERESIS (CM) ÷ [999.0]
    DIGIT ZERO (NU) ÷ [0.3]
    l.3869: ÷ [0.2] PERCENT SIGN (PO) ÷ [999.0] LEFT PARENTHESIS (OP) ÷ [0.3]
    l.3871: ÷ [0.2] PERCENT SIGN (PO) × [9.0] COMBINING DIAERESIS (CM) ÷
    [999.0] LEFT PARENTHESIS (OP) ÷ [0.3]
    l.4013: ÷ [0.2] DOLLAR SIGN (PR) ÷ [999.0] LEFT PARENTHESIS (OP) ÷ [0.3]
    l.4015: ÷ [0.2] DOLLAR SIGN (PR) × [9.0] COMBINING DIAERESIS (CM) ÷
    [999.0] LEFT PARENTHESIS (OP) ÷ [0.3]
    l.4441: ÷ [0.2] SOLIDUS (SY) ÷ [999.0] DIGIT ZERO (NU) ÷ [0.3]
    l.4443: ÷ [0.2] SOLIDUS (SY) × [9.0] COMBINING DIAERESIS (CM) ÷
    [999.0] DIGIT ZERO (NU) ÷ [0.3]
    l.5226: ÷ [0.2] LATIN SMALL LETTER E (AL) × [28.0] LATIN SMALL LETTER
    Q (AL) × [28.0] LATIN SMALL LETTER U (AL) × [28.0] LATIN SMALL LETTER
    A (AL) × [28.0] LATIN SMALL LETTER L (AL) × [28.0] LATIN SMALL LETTER
    S (AL) × [7.01] SPACE (SP) × [13.02] FULL STOP (IS) ÷ [999.0] DIGIT
    THREE (NU) × [25.03] DIGIT FIVE (NU) × [7.01] SPACE (SP) ÷ [18.0]
    LATIN SMALL LETTER C (AL) × [28.0] LATIN SMALL LETTER E (AL) × [28.0]
    LATIN SMALL LETTER N (AL) × [28.0] LATIN SMALL LETTER T (AL) × [28.0]
    LATIN SMALL LETTER S (AL) ÷ [0.3]

    Notice that they are the only cases i've found. There may be more.

    I also took a glance at LineBreakTest-6_0_0d4.txt and found same
    contradictions there too.

    Thanks.



    This archive was generated by hypermail 2.1.5 : Tue Jun 08 2010 - 11:00:42 CDT