Scraper in Python gives "Access Denied"

Tags:

I'm trying to code a scraper in Python to get some info from a page. Like the title of the offers that appear on this page:
https://www.justdial.com/Panipat/Saree-Retailers/nct-10420585

By now I use this code :

import bs4
import requests

def extract_source(url):
    source=requests.get(url).text
    return source

def extract_data(source):
    soup=bs4.BeautifulSoup(source)
    names=soup.findAll('title')
    for i in names:
        print i

extract_data(extract_source('https://www.justdial.com/Panipat/Saree-Retailers/nct-10420585'))

But when I execute this code, it gives me an error:

<titlee> Access Denied</titlee>

What can I do to solve this?

875

asked Feb 01 '17 14:02

duca

1 Answers

As was mentioned in comments, you need to specify allowable user-agent and pass it as headers:

def extract_source(url):
    headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:50.0) Gecko/20100101 Firefox/50.0'}
    source=requests.get(url, headers=headers).text
    return source

169

answered Sep 19 '22 11:09

Andersson

Related questions
                            
                                How do I extract data from a Bokeh ColumnDatasource
                            
                                Core Reporting API - How to use multiple dimensionFilterClauses filters?
                            
                                Pandas equivalent rbind operation
                            
                                Mean Squared error in Python
                            
                                Preserve quotes and also add data with quotes in Ruamel
                            
                                Django - dropdown form with multiple select
                            
                                PySpark DataFrame - Join on multiple columns dynamically
                            
                                How to draw properly networkx graphs
                            
                                Python - cannot import name viewkeys
                            
                                Python, tuple indices must be integers, not tuple?
                            
                                Apply a function to all keys of a Python dict
                            
                                python pptx change entire table font size
                            
                                Python tkinter text modified callback
                            
                                How to create a list of a range with incremental step?
                            
                                python map a lambda function to a list
                            
                                How to stop Gunicorn when Flask application exits
                            
                                Python raw string "r" flag equivalent in C# [duplicate]
                            
                                Python SSL X509: KEY_VALUES_MISMATCH
                            
                                possible to ignore a KeyError?
                            
                                Pandas dataframe.query method syntax

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scraper in Python gives "Access Denied"

Tags:

python

beautifulsoup

python-requests

duca

People also ask

1 Answers

Andersson

Recent Activity

Donate For Us