I get the following error with the code below. <blockquote> HTTP Error 406: Not Acceptable Python urllib2 </blockquote> This is my first step before I use beautifulsoup to parse the page. <pre class="prettyprint"><code>import urllib2 opener = urllib2.build_opener() opener.addheaders = [('User-agent', 'Mozilla/5.0')] url = "http://www.choicemoney.us/retail.php" response = opener.open(url) </code></pre> All help greatly appreciated.

<blockquote> The resource identified by the request is only capable of generating response entities which have content characteristics not acceptable according to the accept headers sent in the request. [RFC2616] </blockquote> Based on the code and what the RFC describes I assume that you need to set both the key and the value of the <code>User-Agent</code> header correctly. These are correct examples: <ul> <li><code>Mozilla/5.0 (X11; U; Linux i686) Gecko/20071127 Firefox/2.0.0.11</code></li> <li><code>Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2228.0 Safari/537.36</code></li> <li><code>Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.75.14 (KHTML, like Gecko) Version/7.0.3 Safari/7046A194A</code></li> </ul> Just replace the following. <pre class="prettyprint"><code>opener.addheaders = [('User-agent', 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.75.14 (KHTML, like Gecko) Version/7.0.3 Safari/7046A194A')] </code></pre>

I believe @ipinak's answer is correct. <code>urllib2</code> actually provides a default User-Agent that works here, so if you delete <code>opener.addheaders = [('User-agent', 'Mozilla/5.0')]</code> the response should have status code 200. I recommend the popular requests library for such jobs as its API is much easier to use. <pre class="prettyprint"><code>url = "http://www.choicemoney.us/retail.php" resp = requests.get(url) print resp.status_code # 200 print resp.content # can be used in your beautifulsoup. </code></pre>

HTTP Error 406: Not Acceptable Python urllib2

Tags:

python

urllib2

python-2.7

I get the following error with the code below.

HTTP Error 406: Not Acceptable Python urllib2

This is my first step before I use beautifulsoup to parse the page.

Click to copy

import urllib2
opener = urllib2.build_opener()
opener.addheaders = [('User-agent', 'Mozilla/5.0')]
url = "http://www.choicemoney.us/retail.php"
response = opener.open(url)

All help greatly appreciated.

296

asked Jan 16 '16 22:01

cflanagan17

2 Answers

The resource identified by the request is only capable of generating response entities which have content characteristics not acceptable according to the accept headers sent in the request. [RFC2616]

Based on the code and what the RFC describes I assume that you need to set both the key and the value of the User-Agent header correctly.

These are correct examples:

Mozilla/5.0 (X11; U; Linux i686) Gecko/20071127 Firefox/2.0.0.11
Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2228.0 Safari/537.36
Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.75.14 (KHTML, like Gecko) Version/7.0.3 Safari/7046A194A

Just replace the following.

Click to copy

opener.addheaders = [('User-agent', 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.75.14 (KHTML, like Gecko) Version/7.0.3 Safari/7046A194A')]

184

answered Sep 30 '22 15:09

ipinak

I believe @ipinak's answer is correct.

urllib2 actually provides a default User-Agent that works here, so if you delete opener.addheaders = [('User-agent', 'Mozilla/5.0')] the response should have status code 200.

I recommend the popular requests library for such jobs as its API is much easier to use.

Click to copy

url = "http://www.choicemoney.us/retail.php"
resp = requests.get(url)
print resp.status_code # 200
print resp.content # can be used in your beautifulsoup.

answered Sep 30 '22 16:09

ohw

Related questions
                            
                                MySQLdb raises "execute() first" error even though I execute before calling fetchall
                            
                                Where can the RDS_DB_NAME setting for an Elastic Beanstalk environment be changed
                            
                                Difference between local and dense layers in CNNs
                            
                                Can't reproduce distance value between sources obtained with astropy
                            
                                How to change request url before making request in scrapy?
                            
                                Installed Anaconda for python 2 and 3. Can't run 2
                            
                                Errno13, Permission denied when trying to read file
                            
                                How to scrape elements that immediately follows a certain element?
                            
                                Django Admin - remove permissions from the list on Add/Edit Group page
                            
                                Pandas groupby slice of a string
                            
                                print first paragraph in python
                            
                                Why is pandas.apply() executing on null elements?
                            
                                Python: why is zip(*) used instead of unzip()? [closed]
                            
                                How to read JSON file that contains list of dictionaries into pandas data frame?
                            
                                How to calculate GPU memory usage in Theano?
                            
                                Cannot assign values to a 'double slice' using numpy
                            
                                Plotting errorbar with mean and std after grouping
                            
                                One liner Python equivalent of JavaScript like assignment when value is falsey
                            
                                Conda hangs when installing from https://conda.anaconda.org
                            
                                WordNet - What does n and the number represent?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

HTTP Error 406: Not Acceptable Python urllib2

Tags:

python

urllib2

python-2.7

cflanagan17

People also ask

2 Answers

ipinak

ohw

Recent Activity

Donate For Us