Can someone help me parse a html file to get the links for all the images in the file in python? Preferably with out a 3rd party module... Thanks!

You can use Beautiful Soup. I know you said without a 3rd party module. However, this is an ideal tool for parsing HTML. <pre class="prettyprint"><code>import urllib2 from BeautifulSoup import BeautifulSoup page = BeautifulSoup(urllib2.urlopen("http://www.url.com")) page.findAll('img') </code></pre>

Python - Getting all images from an html file

1 Answers

You can use Beautiful Soup. I know you said without a 3rd party module. However, this is an ideal tool for parsing HTML.

import urllib2
from BeautifulSoup import BeautifulSoup
page = BeautifulSoup(urllib2.urlopen("http://www.url.com"))
page.findAll('img')

answered Nov 22 '22 00:11

Russell Dias

Related questions
                            
                                How to convert a grayscale image to heatmap image with Python OpenCV
                            
                                None propagation in Python chained attribute access [duplicate]
                            
                                Is there a pythonic way to sample N consecutive elements from a list or numpy array
                            
                                How do I pass a python list in the post query?
                            
                                Using for...else in Python generators
                            
                                How do I resolve an SRV record in Python?
                            
                                Handling Windows-specific exceptions in platform-independent way
                            
                                Python: Set with only existence check?
                            
                                A more pythonic way of iterating a list while excluding an element each iteration
                            
                                UTF-8 and upper()
                            
                                Colorize PyLint Output?
                            
                                Concurrent downloads - Python
                            
                                Send an email using python script
                            
                                Inserting newline character after every 76 characters in a base64 string
                            
                                python regex: match a string with only one instance of a character
                            
                                Pass each element of a list to a function that takes multiple arguments in Python?
                            
                                To IDE or Not? A beginner developer's dilemma [closed]
                            
                                Comparing Python lists
                            
                                Is possible to know the path of the file of a subclass in python?
                            
                                How do I set the ident string when using logging.SysLogHandler in Python 2.6?

Python - Getting all images from an html file

Tags:

python

image

urllib

user377419

People also ask

1 Answers

Russell Dias

Recent Activity

Donate For Us