BeautifulSoup getting href [duplicate]

People also ask

How do you get the href value in BeautifulSoup?

To get href with Python BeautifulSoup, we can use the find_all method. to create soup object with BeautifulSoup class called with the html string. Then we find the a elements with the href attribute returned by calling find_all with 'a' and href set to True .

How do I find all href?

Two ways to find all the anchor tags or href entries on the webpage are: soup. find_all() SoupStrainer class.

You can use find_all in the following way to find every a element that has an href attribute, and print each one:

from BeautifulSoup import BeautifulSoup

html = '''<a href="some_url">next</a>
<span class="class"><a href="another_url">later</a></span>'''

soup = BeautifulSoup(html)

for a in soup.find_all('a', href=True):
    print "Found the URL:", a['href']

The output would be:

Found the URL: some_url
Found the URL: another_url

Note that if you're using an older version of BeautifulSoup (before version 4) the name of this method is findAll. In version 4, BeautifulSoup's method names were changed to be PEP 8 compliant, so you should use find_all instead.

If you want all tags with an href, you can omit the name parameter:

href_tags = soup.find_all(href=True)

Related questions
                            
                                How to search and replace text in a file?
                            
                                Plot correlation matrix using pandas
                            
                                Display image as grayscale using matplotlib
                            
                                Django set default form values
                            
                                Initializing a list to a known number of elements in Python [duplicate]
                            
                                Detect and exclude outliers in a pandas DataFrame
                            
                                How to split a dataframe string column into two columns?
                            
                                What is the difference between Jupyter Notebook and JupyterLab?
                            
                                Python, Matplotlib, subplot: How to set the axis range?
                            
                                Why is 'x' in ('x',) faster than 'x' == 'x'?
                            
                                How to specify "nullable" return type with type hints
                            
                                How to override the [] operator in Python?
                            
                                Counting the number of distinct keys in a dictionary in Python
                            
                                How do I implement interfaces in python?
                            
                                Is generator.next() visible in Python 3?
                            
                                Is it not possible to define multiple constructors in Python? [duplicate]
                            
                                Error message: "'chromedriver' executable needs to be available in the path"
                            
                                How to execute raw SQL in Flask-SQLAlchemy app
                            
                                Apply pandas function to column to create multiple new columns?
                            
                                What's the u prefix in a Python string?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

BeautifulSoup getting href [duplicate]

Tags:

python

beautifulsoup

tags

People also ask

Recent Activity

Donate For Us