Using the Python Documentation I found the HTML parser but I have no idea which library to import to use it, how do I find this out (bearing in mind it doesn't say on the page).

You probably really want BeautifulSoup, check the link for an example. But in any case <pre class="prettyprint"><code>>>> import HTMLParser >>> h = HTMLParser.HTMLParser() >>> h.feed('<html></html>') >>> h.get_starttag_text() '<html>' >>> h.close() </code></pre>

Try: <pre class="prettyprint"><code>import HTMLParser </code></pre> In Python 3.0, the HTMLParser module has been renamed to html.parser you can check about this here Python 3.0 <pre class="prettyprint"><code>import html.parser </code></pre> Python 2.2 and above <pre class="prettyprint"><code>import HTMLParser </code></pre>

HTML parser in Python [closed]

4 Answers

You probably really want BeautifulSoup, check the link for an example.

But in any case

>>> import HTMLParser
>>> h = HTMLParser.HTMLParser()
>>> h.feed('<html></html>')
>>> h.get_starttag_text()
'<html>'
>>> h.close()

127

answered Oct 05 '22 23:10

Vinko Vrsalovic

Try:

import HTMLParser

In Python 3.0, the HTMLParser module has been renamed to html.parser you can check about this here

Python 3.0

import html.parser

Python 2.2 and above

import HTMLParser

answered Oct 05 '22 23:10

1077

I would recommend using Beautiful Soup module instead and it has good documentation.

answered Oct 06 '22 01:10

Swaroop C H

You may be interested in lxml. It is a separate package and has C components, but is the fastest. It has also very nice API, allowing you to easily list links in HTML documents, or list forms, sanitize HTML, and more. It also has capabilities to parse not well-formed HTML (it's configurable).

answered Oct 06 '22 00:10

Paweł Hajdan

Related questions
                            
                                List multiplication
                            
                                Match last occurrence with regex
                            
                                Socket error: Address already in use
                            
                                why does this python program print True
                            
                                Count the multiple occurrences in a set
                            
                                Finding every nth element in a list
                            
                                Boto3, s3 folder not getting deleted
                            
                                How to check if today is Monday in Python [duplicate]
                            
                                Pass shape tuple to Numpy `random.rand`
                            
                                Find the a 4 digit number who's square is 8 digits AND last 4 digits are the original number [closed]
                            
                                Study Objective-C , Ruby OR Python? [closed]
                            
                                how to loop from 0000 to 9999 and convert the number to the relative string?
                            
                                check if an IP is within a range of CIDR in Python
                            
                                Import error: cannot import name 'opentype'
                            
                                Pythonic way to select first variable that evaluates to True
                            
                                Check even/odd for Palindrome?
                            
                                Remove timezone information from datetime object
                            
                                Python: Problem with overloaded constructors
                            
                                Python Last Iteration in For Loop
                            
                                Calling unknown Python functions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

HTML parser in Python [closed]

Tags:

python

import

Teifion

People also ask