| |
- Method resolution order:
- CorrectLinks
- HTMLParser.HTMLParser
- markupbase.ParserBase
Methods defined here:
- handle_charref(self, name)
- handle_comment(self, name)
- handle_data(self, data)
- handle_decl(self, decl)
- handle_endtag(self, tag)
- handle_entityref(self, name)
- handle_pi(self, data)
- handle_startendtag(self, tag, attrs)
- handle_starttag(self, tag, attrs, startend=False)
- process(self, in_file, out_file)
Methods inherited from HTMLParser.HTMLParser:
- __init__(self)
- Initialize and reset this instance.
- check_for_whole_start_tag(self, i)
- # Internal -- check to see if we have a complete starttag; return end
# or -1 if incomplete.
- clear_cdata_mode(self)
- close(self)
- Handle any buffered data.
- error(self, message)
- feed(self, data)
- Feed data to the parser.
Call this as often as you want, with as little or as much text
as you want (may include '
').
- get_starttag_text(self)
- Return full source of start tag: '<...>'.
- goahead(self, end)
- # Internal -- handle data as far as reasonable. May leave state
# and data to be processed by a subsequent call. If 'end' is
# true, force handling all data as if followed by EOF marker.
- parse_endtag(self, i)
- # Internal -- parse endtag, return end or -1 if incomplete
- parse_pi(self, i)
- # Internal -- parse processing instr, return end or -1 if not terminated
- parse_starttag(self, i)
- # Internal -- handle starttag, return end or -1 if not terminated
- reset(self)
- Reset this instance. Loses all unprocessed data.
- set_cdata_mode(self)
- unescape(self, s)
- # Internal -- helper to remove special character quoting
- unknown_decl(self, data)
Data and other attributes inherited from HTMLParser.HTMLParser:
- CDATA_CONTENT_ELEMENTS = ('script', 'style')
Methods inherited from markupbase.ParserBase:
- getpos(self)
- Return current line number and offset.
- parse_comment(self, i, report=1)
- # Internal -- parse comment, return length or -1 if not terminated
- parse_declaration(self, i)
- # Internal -- parse declaration (for use by subclasses).
- parse_marked_section(self, i, report=1)
- # Internal -- parse a marked section
# Override this to handle MS-word extension syntax <![if word]>content<![endif]>
- updatepos(self, i, j)
- # Internal -- update line number and offset. This should be
# called for each piece of data exactly once, in order -- in other
# words the concatenation of all the input strings to this
# function should be exactly the entire input.
|