Re: REC-xml-19980210: whitespace

David Brownell (db@Eng.Sun.COM)
Thu, 15 Oct 1998 09:21:00 -0700


Richard Tobin wrote:
>
> > > > a single #x20
> > > is appended for a "#xD#xA" sequence that is part of an external
> > > parsed entity or the literal entity value of an internal parsed
> > > entity"
>
> > This seems like a buglet to me.
>
> Is not the document itself an external parsed entity? It's an entity,
> it's parsed, and it's not internal...

Moreover, 2.10 says (albeit oddly) that CRLF gets normalized
to LF everywhere, and the LF would get normalized to a single
space inside of an attribute (or Public Identifier) value.

There's an inconsistency somewhere with respect to CRLF handling,
but the "always normalize CRLF to LF" approach makes things
generally consistent. One issue is that 2.10 only identifies
this in a parenthetical (non-normative?) comment, and the text
there that's normative is more restrictive.

- Dave