Previous Topic Next topic Print topic


XML document coded character sets

The PLISAX built-in subroutines only work with XML documents in WIDECHAR encoded using Unicode UTF-16, or in CHARACTER encoded using one certain supported single-byte character sets. The parser uses up to three sources of information about the encoding of your XML document, and signals an exception XML event if it discovers any conflicts between these sources:

If the XML document begins with an XML declaration that includes an encoding declaration specifying one of the supported code pages listed below, the parser honors the encoding declaration if it does not conflict with either the basic document encoding or the encoding information from the PLISAX built-in. If the XML document does not have an XML declaration at all, or if the XML declaration omits the encoding declaration, the parser uses the encoding information from the PLISAX built-in to process the document, as long as it does not conflict with the basic document encoding.

Previous Topic Next topic Print topic