Ferramentas de Processamento de Markup Estruturado¶
O Python suporta uma variedade de módulos para trabalhar com vários formatos de marcação de dados estruturados. Isso inclui módulos para trabalhar com o Standard Generalized Markup Language (SGML) e o Hypertext Markup Language (HTML) e várias interfaces para trabalhar com o XML (Extensible Markup Language).
html
— Suporte HTML (HyperText Markup Language)html.parser
— Simple HTML and XHTML parserhtml.entities
— Definições de entidades gerais de HTML- Módulos de Processamento de XML
xml.etree.ElementTree
— A API XML ElementTreexml.dom
— The Document Object Model APIxml.dom.minidom
— Minimal DOM implementationxml.dom.pulldom
— Support for building partial DOM treesxml.sax
— Support for SAX2 parsersxml.sax.handler
— Base classes for SAX handlersContentHandler
DTDHandler
EntityResolver
ErrorHandler
LexicalHandler
feature_namespaces
feature_namespace_prefixes
feature_string_interning
feature_validation
feature_external_ges
feature_external_pes
all_features
property_lexical_handler
property_declaration_handler
property_dom_node
property_xml_string
all_properties
- ContentHandler Objects
- DTDHandler Objects
- EntityResolver Objects
- ErrorHandler Objects
- LexicalHandler Objects
xml.sax.saxutils
— SAX Utilitiesxml.sax.xmlreader
— Interface for XML parsersxml.parsers.expat
— Fast XML parsing using ExpatExpatError
error
XMLParserType
ErrorString()
ParserCreate()
- Objetos XMLParser
Parse()
ParseFile()
SetBase()
GetBase()
GetInputContext()
ExternalEntityParserCreate()
SetParamEntityParsing()
UseForeignDTD()
SetReparseDeferralEnabled()
GetReparseDeferralEnabled()
buffer_size
buffer_text
buffer_used
ordered_attributes
specified_attributes
ErrorByteIndex
ErrorCode
ErrorColumnNumber
ErrorLineNumber
CurrentByteIndex
CurrentColumnNumber
CurrentLineNumber
XmlDeclHandler()
StartDoctypeDeclHandler()
EndDoctypeDeclHandler()
ElementDeclHandler()
AttlistDeclHandler()
StartElementHandler()
EndElementHandler()
ProcessingInstructionHandler()
CharacterDataHandler()
UnparsedEntityDeclHandler()
EntityDeclHandler()
NotationDeclHandler()
StartNamespaceDeclHandler()
EndNamespaceDeclHandler()
CommentHandler()
StartCdataSectionHandler()
EndCdataSectionHandler()
DefaultHandler()
DefaultHandlerExpand()
NotStandaloneHandler()
ExternalEntityRefHandler()
- Exceções ExpatError
- Exemplo
- Content Model Descriptions
- Expat error constants
codes
messages
XML_ERROR_ASYNC_ENTITY
XML_ERROR_ATTRIBUTE_EXTERNAL_ENTITY_REF
XML_ERROR_BAD_CHAR_REF
XML_ERROR_BINARY_ENTITY_REF
XML_ERROR_DUPLICATE_ATTRIBUTE
XML_ERROR_INCORRECT_ENCODING
XML_ERROR_INVALID_TOKEN
XML_ERROR_JUNK_AFTER_DOC_ELEMENT
XML_ERROR_MISPLACED_XML_PI
XML_ERROR_NO_ELEMENTS
XML_ERROR_NO_MEMORY
XML_ERROR_PARAM_ENTITY_REF
XML_ERROR_PARTIAL_CHAR
XML_ERROR_RECURSIVE_ENTITY_REF
XML_ERROR_SYNTAX
XML_ERROR_TAG_MISMATCH
XML_ERROR_UNCLOSED_TOKEN
XML_ERROR_UNDEFINED_ENTITY
XML_ERROR_UNKNOWN_ENCODING
XML_ERROR_UNCLOSED_CDATA_SECTION
XML_ERROR_EXTERNAL_ENTITY_HANDLING
XML_ERROR_NOT_STANDALONE
XML_ERROR_UNEXPECTED_STATE
XML_ERROR_ENTITY_DECLARED_IN_PE
XML_ERROR_FEATURE_REQUIRES_XML_DTD
XML_ERROR_CANT_CHANGE_FEATURE_ONCE_PARSING
XML_ERROR_UNBOUND_PREFIX
XML_ERROR_UNDECLARING_PREFIX
XML_ERROR_INCOMPLETE_PE
XML_ERROR_XML_DECL
XML_ERROR_TEXT_DECL
XML_ERROR_PUBLICID
XML_ERROR_SUSPENDED
XML_ERROR_NOT_SUSPENDED
XML_ERROR_ABORTED
XML_ERROR_FINISHED
XML_ERROR_SUSPEND_PE
XML_ERROR_RESERVED_PREFIX_XML
XML_ERROR_RESERVED_PREFIX_XMLNS
XML_ERROR_RESERVED_NAMESPACE_URI
XML_ERROR_INVALID_ARGUMENT
XML_ERROR_NO_BUFFER
XML_ERROR_AMPLIFICATION_LIMIT_BREACH