I've tried to read http://www.w3.org/TR/xml-infoset/ and the wikipedia entry. But frankly I'm still not sure what the difference is. The quote : <blockquote> An XML document has an information set if it is well-formed and satisfies the namespace constraints. There is no requirement for an XML document to be valid in order to have an information set. </blockquote> From the wikipedia entry seems to not make sense. How can a non valid document have any semantics, and thus how can it be an 'information' set? What is this 'infoset' that <blockquote> well-formed and satisfies the namespace constrained </blockquote> XML has? And in what way it is useful in itself. In other words why is it, semantically speaking, necessary to define the XML infoset? Is there any information that cannot be represented in XML? If so I can see the limiting set of the XML Infoset, but if not surely the XML Infoset is as meaningless as term 'information'? Thank you for the interesting answers: I still cannot grasp why the Xml infoset has any purpose as opposed to the term infoset. But you guys have given me the direct answer to the question.

XML is not text. XML "is" the XML infoset. This may then be serialized into text in an XML document, but it is the XML infoset that is the reality. The infoset may exist in memory as a DOM tree, for instance. It exists in memory as the implementation of an abstract object model. What if I serialized it as UTF-8 and then as UTF-16. Chances are the results would be two different sets of bits, but same infoset. Consider also that with text it makes sense to do things like string concatenation. You don't want to concatenate a "<" into the middle of an XML element. You have to encode it first. Why would you have to do this if it were just text? If you used the DOM, for instance, you'd just say element.InnerText = "<"; When serialized, the "<" would be encoded into "&lt;". Yet it's the same infoset.

What is an XML infoset and in what ways is it different to an XML document?

Tags:

xml

xml-validation

well-formed

infoset

I've tried to read http://www.w3.org/TR/xml-infoset/ and the wikipedia entry. But frankly I'm still not sure what the difference is.

The quote :

An XML document has an information set if it is well-formed and satisfies the namespace constraints. There is no requirement for an XML document to be valid in order to have an information set.

From the wikipedia entry seems to not make sense. How can a non valid document have any semantics, and thus how can it be an 'information' set?

What is this 'infoset' that

well-formed and satisfies the namespace constrained

XML has? And in what way it is useful in itself. In other words why is it, semantically speaking, necessary to define the XML infoset? Is there any information that cannot be represented in XML? If so I can see the limiting set of the XML Infoset, but if not surely the XML Infoset is as meaningless as term 'information'?

Thank you for the interesting answers: I still cannot grasp why the Xml infoset has any purpose as opposed to the term infoset. But you guys have given me the direct answer to the question.

650

asked May 08 '09 10:05

Preet Sangha

1 Answers

XML is not text. XML "is" the XML infoset. This may then be serialized into text in an XML document, but it is the XML infoset that is the reality.

The infoset may exist in memory as a DOM tree, for instance. It exists in memory as the implementation of an abstract object model.

What if I serialized it as UTF-8 and then as UTF-16. Chances are the results would be two different sets of bits, but same infoset.

Consider also that with text it makes sense to do things like string concatenation. You don't want to concatenate a "<" into the middle of an XML element. You have to encode it first. Why would you have to do this if it were just text? If you used the DOM, for instance, you'd just say element.InnerText = "<"; When serialized, the "<" would be encoded into "<". Yet it's the same infoset.

182

answered Sep 20 '22 16:09

John Saunders

Related questions
                            
                                How do I remove redundant namespace in nested query when using FOR XML PATH
                            
                                Where to add a version to an XSD schema?
                            
                                How to select distinct values from XML document using XPATH?
                            
                                how do you send a SOAP request?
                            
                                How to add header and footer for every pages in xsl-fo to generate pdf
                            
                                Is use="optional" in xsd redundant?
                            
                                CardView has lost margin when inflating
                            
                                Android status bar scrolling up with coordinator layout, leaving status icons overlapping toolbar title
                            
                                EmojiTextView renders Emoji semi-transparent
                            
                                Concatenate XML without type casting to string
                            
                                How to update the variable value in xslt?
                            
                                Scala XML: brace escapes in attributes
                            
                                Are JSON and XML Comparable? [closed]
                            
                                How to test if an attribute exists in some XML
                            
                                How to use hibernate.properties file instead of hibernate.cfg.xml
                            
                                How to make a button that shows the backspace (⌫) character on Android?
                            
                                Is there a JavaScript API for XML binding - analog to JAXB for Java?
                            
                                Parsing simple XML with Nokogiri
                            
                                'xsi' is an undeclared prefix using XmlDocument
                            
                                How to define multiple names for XmlElement field?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With