We may be driving towards fundamentally different things.
I am not sure if there is a significant overlap.
I see XObject as being founded in the DOM for navigation,
with a set of bindings which transforms a document into a
composition of components which themselves are an active
aspect of the application:
document + bindings + classes = application-specific agent
This approach precludes many things. It is not generic to
XML processing.
Bill