Question about the Sentence_Break property
public at khwilliamson.com
Thu Feb 19 20:55:20 CST 2015
UAX 29 says this:
Break after paragraph separators.
SB4. Sep | CR | LF
Why are CR and LF considered to be paragraph separators? NEL and Line
Break are as well.
My mental model of plain text has it containing embedded characters,
which I'll call \n, to allow it to be displayed in a terminal window of
a given width. Not all text is like that, of course, but there is an
awful lot that is. This rule makes no sense to me.
More information about the Unicode