Python BeautifulSoup give multiple tags to findAll

Tags:

python

beautifulsoup

I'm looking for a way to use findAll to get two tags, in the order they appear on the page.

Currently I have:

import requests import BeautifulSoup  def get_soup(url):     request = requests.get(url)     page = request.text     soup = BeautifulSoup(page)     get_tags = soup.findAll('hr' and 'strong')     for each in get_tags:         print each

If I use that on a page with only 'em' or 'strong' in it then it will get me all of those tags, if I use on one with both it will get 'strong' tags.

Is there a way to do this? My main concern is preserving the order in which the tags are found.

734

asked Dec 18 '13 02:12

DasSnipez

2 Answers

You could pass a list, to find any of the given tags:

tags = soup.find_all(['hr', 'strong'])

164

answered Sep 21 '22 18:09

jfs

Use regular expressions:

import re get_tags = soup.findAll(re.compile(r'(hr|strong)'))

The expression r'(hr|strong)' will find either hr tags or strong tags.

answered Sep 21 '22 18:09

TerryA

Related questions
                            
                                Converting a list to a string [duplicate]
                            
                                How to get ipywidgets working in Jupyter Lab?
                            
                                App created with PyInstaller has a slow startup
                            
                                Python list comprehension - want to avoid repeated evaluation
                            
                                Why does Python 3 need dict.items to be wrapped with list()?
                            
                                Debugging Apache/Django/WSGI Bad Request (400) Error
                            
                                How to check if DynamoDB table exists?
                            
                                Pandas: ValueError: cannot convert float NaN to integer
                            
                                recover dict from 0-d numpy array
                            
                                Jinja2 template not rendering if-elif-else statement properly
                            
                                Check if dataframe column is Categorical
                            
                                Get weekday/day-of-week for Datetime column of DataFrame
                            
                                Get POSIX/Unix time in seconds and nanoseconds in Python?
                            
                                Python: Converting string into decimal number
                            
                                Multiple assignments into a python dictionary
                            
                                can you write a str.replace() using dictionary values in Python?
                            
                                jinja2 how to remove trailing newline
                            
                                Why Java and Python garbage collection methods are different?
                            
                                Error handling in SQLAlchemy
                            
                                Replace part of a string in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With