Getting all Links from a page Beautiful Soup

Tags:

I am using beautifulsoup to get all the links from a page. My code is:

import requests
from bs4 import BeautifulSoup


url = 'http://www.acontecaeventos.com.br/marketing-promocional-sao-paulo'
r = requests.get(url)
html_content = r.text
soup = BeautifulSoup(html_content, 'lxml')

soup.find_all('href')

All that I get is:

[]

How can I get a list of all the href links on that page?

209

asked Sep 29 '17 14:09

user1922364

1 Answers

You are telling the find_all method to find href tags, not attributes.

You need to find the <a> tags, they're used to represent link elements.

links = soup.find_all('a')

Later you can access their href attributes like this:

link = links[0]          # get the first link in the entire page
url  = link['href']      # get value of the href attribute
url  = link.get('href')  # or like this

146

answered Sep 20 '22 18:09

Anonta

Related questions
                            
                                How to enable python repl autocomplete and still allow new line tabs
                            
                                How to store a Python dictionary as an Environment Variable
                            
                                How to return data with 403 error in Django Rest Framework?
                            
                                subprocess call ffmpeg (command line)
                            
                                Where is Qt designer app on Mac + Anaconda?
                            
                                Count how many times each row is present in numpy.array
                            
                                How to get one number specific times in an array python
                            
                                Multiple threads writing to the same CSV in Python
                            
                                How to sort an array of objects by datetime in Python? [duplicate]
                            
                                Call another function and optionally keep default arguments
                            
                                How to round dates to week starts in Pandas
                            
                                Python "ValueError: incomplete format" upon print("stuff %" % "thingy")
                            
                                Ensure the gensim generate the same Word2Vec model for different runs on the same data
                            
                                Find local maximums in numpy array
                            
                                pandas.Series() Creation using DataFrame Columns returns NaN Data entries
                            
                                Filling dict with NA values to allow conversion to pandas dataframe
                            
                                When am I supposed to use del in python?
                            
                                requests.get returns 403 while the same url works in browser
                            
                                Is there a method like .replace() for list in python? [duplicate]
                            
                                How to get superuser details in Django?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Getting all Links from a page Beautiful Soup

Tags:

python

html-parsing

beautifulsoup

web-scraping

user1922364

People also ask

1 Answers

Anonta

Recent Activity

Donate For Us