[PATCH] Fix potentially broken white space truncation on certain Windows versions

Linus Torvalds torvalds at linux-foundation.org
Thu Mar 7 11:46:31 PST 2013


On Thu, Mar 7, 2013 at 11:22 AM, Dirk Hohndel <dirk at hohndel.org> wrote:
> BZZZZT. Wrong. Dang
>
> U+00A0 / U+C2A0 are non-breaking space and U+FEFF is a zero width
> no-breaking space.
>
> So at least in parse-xml.c/utf8_string() we need to handle this
> differently...

No no no.

They aren't space. They are just odd utf8 characters that have a
*glyph* that is space.

Don't treat them specially. We only want to remove true whitespace
(tab, true space, newline).

              Linus


More information about the subsurface mailing list