BeautifulSoup find only elements where an attribute contains a sub-string? Is this possible?

Tags:

I have a call to find_all() in my BeautifulSoup code. This works currently to get me all images, but if I wanted to target only images which have a sub-string of "placeholder" in their src, how could I do this?

for t in soup.find_all('img'):  # WHERE img.href.contains("placeholder")

711

asked Jan 30 '15 17:01

Simon Kiely

1 Answers

You can pass a function in the src keyword argument:

for t in soup.find_all('img', src=lambda x: x and 'placeholder' in x):

Or, a regular expression:

import re

for t in soup.find_all('img', src=re.compile(r'placeholder')):

Or, instead of find_all(), use select():

for t in soup.select('img[src*=placeholder]'):

answered Oct 04 '22 22:10

alecxe

Related questions
                            
                                Does python have Matlab's `ans` variable that captures returned value not stored in any variable?
                            
                                In a gevent application, how can I kill all greenlets that have been started?
                            
                                getting seconds from numpy timedelta64
                            
                                Redis Queue + python-rq: Right pattern to prevent high memory usage?
                            
                                Python class method chaining
                            
                                using python WeakSet to enable a callback functionality
                            
                                Storing a dict with np.savez gives unexpected result?
                            
                                Using Pandas, how do I drop the last row of each group?
                            
                                ImportError: No module named gi.repository
                            
                                Reading back tuples from a csv file with pandas
                            
                                pow or ** for very large number in Python
                            
                                NetworkX largest component no longer working?
                            
                                Clustering geo location coordinates (lat,long pairs) using KMeans algorithm with Python
                            
                                how to aggregate elements of a list of tuples if the tuples have the same first element?
                            
                                MFCC feature descriptors for audio classification using librosa
                            
                                What's the difference between '_io' and 'io'?
                            
                                Python numpy subtraction no negative numbers (4-6 gives 254)
                            
                                How to stream twitter mentions with tweepy?
                            
                                Python project using protocol buffers, Deployment issues
                            
                                Show only errors with pylint and syntastic in vim

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

BeautifulSoup find only elements where an attribute contains a sub-string? Is this possible?

Tags:

python

html

html-parsing

beautifulsoup

Simon Kiely

People also ask

1 Answers

alecxe

Recent Activity

Donate For Us