How to get round the HTTP Error 403: Forbidden with urllib.request using Python 3

Tags:

Hi not every time but sometimes when trying to gain access to the LSE code I am thrown the every annoying HTTP Error 403: Forbidden message.

Anyone know how I can overcome this issue only using standard python modules (so sadly no beautiful soup).

import urllib.request

url = "http://www.londonstockexchange.com/exchange/prices-and-markets/stocks/indices/ftse-indices.html"
infile = urllib.request.urlopen(url) # Open the URL
data = infile.read().decode('ISO-8859-1') # Read the content as string decoded with ISO-8859-1

print(data) # Print the data to the screen

However every now and then this is the error I am shown:

Traceback (most recent call last):
  File "/home/ubuntu/workspace/programming_practice/Assessment/Summative/removingThe403Error.py", line 5, in <module>
    webpage = urlopen(req).read().decode('ISO-8859-1')
  File "/usr/lib/python3.4/urllib/request.py", line 161, in urlopen
    return opener.open(url, data, timeout)
  File "/usr/lib/python3.4/urllib/request.py", line 469, in open
    response = meth(req, response)
  File "/usr/lib/python3.4/urllib/request.py", line 579, in http_response
    'http', request, response, code, msg, hdrs)
  File "/usr/lib/python3.4/urllib/request.py", line 507, in error
    return self._call_chain(*args)
  File "/usr/lib/python3.4/urllib/request.py", line 441, in _call_chain
    result = func(*args)
  File "/usr/lib/python3.4/urllib/request.py", line 587, in http_error_default
    raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 403: Forbidden


Process exited with code: 1

Link to a list of all the modules that are okay: https://docs.python.org/3.4/py-modindex.html

Many thanks in advance.

654

asked Mar 17 '17 16:03

JoeTilsed

1 Answers

This is probably due to mod_security. You need to spoof by opening the URL as a browser, not as python urllib.

Here, I corrected your code:

import urllib.request

url = "http://www.londonstockexchange.com/exchange/prices-and-markets/stocks/indices/ftse-indices.html"

# Open the URL as Browser, not as python urllib
page=urllib.request.Request(url,headers={'User-Agent': 'Mozilla/5.0'}) 
infile=urllib.request.urlopen(page).read()
data = infile.decode('ISO-8859-1') # Read the content as string decoded with ISO-8859-1

print(data) # Print the data to the screen

Next, you can use BeautifulSoup to scrape the HTML.

160

answered Oct 15 '22 09:10

Kardi Teknomo

Related questions
                            
                                matplotlib add_subplot odd number of plots
                            
                                Pandas: AttributeError: 'module' object has no attribute 'getLogger'
                            
                                extract hash seed in unit testing
                            
                                Why is 10/3 equal to 3.3333333333333335 instead of ...332 or ..334?
                            
                                Weighted bins in a distribution hist plot
                            
                                Detect a changed password in Django
                            
                                using best params from gridsearchcv
                            
                                sudo and pip not on the same path
                            
                                Python selenium not work with WebDriverWait
                            
                                Considerations for using ReLU as activation function
                            
                                How to rearrange one list based on a second list of indices [duplicate]
                            
                                python & postgresql: reliably check for updates in a specific table
                            
                                Adding global attribute using xarray
                            
                                Difference between Tensorflow convolution and numpy convolution
                            
                                Escape analysis
                            
                                Pandas - Counting quantity of commas in character field
                            
                                I deleted my dict, but my dict_keys don't mind, why is that?
                            
                                Get the inverse function of a polyfit in numpy
                            
                                error using gmail api tuto using python 3 "except errors.HttpError, error:"
                            
                                Nested merges in pandas with suffixes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get round the HTTP Error 403: Forbidden with urllib.request using Python 3

Tags:

python

http-status-code-403

python-3.x

urllib

urllib3

JoeTilsed

People also ask

1 Answers

Kardi Teknomo

Recent Activity

Donate For Us