Re: Why do binary files contain text but text files don't contain binary?

From: Richard Wordingham via Unicode <unicode_at_unicode.org>
Date: Fri, 21 Feb 2020 16:17:09 +0000

On Fri, 21 Feb 2020 15:53:52 +0000
"Costello, Roger L. via Unicode" <unicode_at_unicode.org> wrote:

> Based on a private correspondence, I now realize that this statement:
>
>
>
> > Text files do not contain binary
>
>
>
> is not correct.
>
>
>
> Text files may indeed contain binary (i.e., bytes that are not
> interpretable as characters). Namely, text files may contain
> newlines, tabs, and some other invisible things.
>
>
>
> Question: "characters" are defined as only the visible things, right?

No, white space (e.g. spaces, tabs and newlines) is normally considered
to be composed of characters. And then there are much harder to discern
things, such as zero-width spaces, line-break suppressors such as
U+2060 WORD JOINER, and soft hyphens (interpreted as line-break
opportunities).

Richard.
Received on Fri Feb 21 2020 - 10:17:31 CST

This archive was generated by hypermail 2.2.0 : Fri Feb 21 2020 - 10:17:31 CST