XML Information Set
This article may be too technical for most readers to understand. Please help improve it to make it understandable to non-experts, without removing the technical details. (February 2015) (Learn how and when to remove this template message)
XML Information Set (XML Infoset) is a W3C specification describing an abstract data model of an XML document in terms of a set of information items. The definitions in the XML Information Set specification are meant to be used in other specifications that need to refer to the information in a well-formed XML document.
An information set can contain up to eleven different types of information items:
- The Document Information Item (always present)
- Element Information Items
- Attribute Information Items
- Processing Instruction Information Items
- Unexpanded Entity Reference Information Items
- Character Information Items
- Comment Information Items
- The Document Type Declaration Information Item
- Unparsed Entity Information Items
- Notation Information Items
- Namespace Information Items
XML was initially developed without a formal definition of its infoset. This was only formalised by later work beginning in 1999, first published as a separate W3C Working Draft at the end of December that year. Infoset recommendation Second Edition was adopted on 4 February, 2004. If a 2.0 version of the XML standard is ever published, it is likely that this would absorb the Infoset recommendation as an integral part of that standard.
Infoset augmentation or infoset modification refers to the process of modifying the infoset during schema validation, for example by adding default attributes. The augmented infoset is called the post-schema-validation infoset, or PSVI. 
Infoset augmentation is somewhat controversial, with claims that it is a violation of modularity and tends to cause interoperability problems, since applications get different information depending on whether or not validation has been performed. 
- W3C XML Infoset
- "XML Information Set" (Working Draft ed.). W3C. 20 December 1999.
- "XML Information Set" (Second ed.). W3C. 4 February 2004.
- XML Schema 1.1 Part 1: Structures
- RELAX NG and W3C XML Schema Archived September 27, 2007, at the Wayback Machine., James Clark, 4 Jun 2002
- "Extensible Markup Language (XML)". W3C. Retrieved 9 October 2014.
- XmlCsvReader Implementation
- Apache CXF JSON Support
- "XML Information set recommendation (Second Edition)". W3C. 4 February 2004.
|This computing article is a stub. You can help Wikipedia by expanding it.|