19. Ferramentas de Processamento de Markup Estruturado¶
O Python suporta uma variedade de módulos para trabalhar com vários formatos de marcação de dados estruturados. Isso inclui módulos para trabalhar com o Standard Generalized Markup Language (SGML) e o Hypertext Markup Language (HTML) e várias interfaces para trabalhar com o XML (Extensible Markup Language).
It is important to note that modules in the xml
package require that
there be at least one SAX-compliant XML parser available. Starting with Python
2.3, the Expat parser is included with Python, so the xml.parsers.expat
module will always be available. You may still want to be aware of the PyXML
add-on package; that package provides an
extended set of XML libraries for Python.
The documentation for the xml.dom
and xml.sax
packages are the
definition of the Python bindings for the DOM and SAX interfaces.
- 19.1.
HTMLParser
— Simple HTML and XHTML parser - 19.2.
sgmllib
— Simple SGML parser - 19.3.
htmllib
— A parser for HTML documents - 19.4.
htmlentitydefs
— Definitions of HTML general entities - 19.5. XML Processing Modules
- 19.6. XML vulnerabilities
- 19.7. API XML ElementTree
- 19.8.
xml.dom
— The Document Object Model API- 19.8.1. Conteúdo do Módulo
- 19.8.2. Objects in the DOM
- 19.8.2.1. DOMImplementation Objects
- 19.8.2.2. Objetos Node
- 19.8.2.3. NodeList Objects
- 19.8.2.4. DocumentType Objects
- 19.8.2.5. Document Objects
- 19.8.2.6. Element Objects
- 19.8.2.7. Attr Objects
- 19.8.2.8. NamedNodeMap Objects
- 19.8.2.9. Comment Objects
- 19.8.2.10. Text and CDATASection Objects
- 19.8.2.11. ProcessingInstruction Objects
- 19.8.2.12. Exceções
- 19.8.3. Conformance
- 19.9.
xml.dom.minidom
— Minimal DOM implementation - 19.10.
xml.dom.pulldom
— Support for building partial DOM trees - 19.11.
xml.sax
— Support for SAX2 parsers - 19.12.
xml.sax.handler
— Base classes for SAX handlers - 19.13.
xml.sax.saxutils
— SAX Utilities - 19.14.
xml.sax.xmlreader
— Interface for XML parsers - 19.15.
xml.parsers.expat
— Fast XML parsing using Expat