I have this program that check a website, and I want to know how can I check it via proxy in Python... this is the code, just for example <pre class="prettyprint"><code>while True: try: h = urllib.urlopen(website) break except: print '['+time.strftime('%Y/%m/%d %H:%M:%S')+'] '+'ERROR. Trying again in a few seconds...' time.sleep(5) </code></pre>

By default, <code>urlopen</code> uses the environment variable <code>http_proxy</code> to determine which HTTP proxy to use: <pre class="prettyprint"><code>$ export http_proxy='http://myproxy.example.com:1234' $ python myscript.py # Using http://myproxy.example.com:1234 as a proxy </code></pre> If you instead want to specify a proxy inside your application, you can give a <code>proxies</code> argument to <code>urlopen</code>: <pre class="prettyprint"><code>proxies = {'http': 'http://myproxy.example.com:1234'} print("Using HTTP proxy %s" % proxies['http']) urllib.urlopen("http://www.google.com", proxies=proxies) </code></pre> Edit: If I understand your comments correctly, you want to try several proxies and print each proxy as you try it. How about something like this? <pre class="prettyprint"><code>candidate_proxies = ['http://proxy1.example.com:1234', 'http://proxy2.example.com:1234', 'http://proxy3.example.com:1234'] for proxy in candidate_proxies: print("Trying HTTP proxy %s" % proxy) try: result = urllib.urlopen("http://www.google.com", proxies={'http': proxy}) print("Got URL using proxy %s" % proxy) break except: print("Trying next proxy in 5 seconds") time.sleep(5) </code></pre>

How can I open a website with urllib via proxy in Python?

Tags:

python

proxy

I have this program that check a website, and I want to know how can I check it via proxy in Python...

this is the code, just for example

while True:     try:         h = urllib.urlopen(website)         break     except:         print '['+time.strftime('%Y/%m/%d %H:%M:%S')+'] '+'ERROR. Trying again in a few seconds...'         time.sleep(5)

821

asked Jul 02 '10 18:07

Bruno 'Shady'

2 Answers

By default, urlopen uses the environment variable http_proxy to determine which HTTP proxy to use:

$ export http_proxy='http://myproxy.example.com:1234' $ python myscript.py  # Using http://myproxy.example.com:1234 as a proxy

If you instead want to specify a proxy inside your application, you can give a proxies argument to urlopen:

proxies = {'http': 'http://myproxy.example.com:1234'} print("Using HTTP proxy %s" % proxies['http']) urllib.urlopen("http://www.google.com", proxies=proxies)

Edit: If I understand your comments correctly, you want to try several proxies and print each proxy as you try it. How about something like this?

candidate_proxies = ['http://proxy1.example.com:1234',                      'http://proxy2.example.com:1234',                      'http://proxy3.example.com:1234'] for proxy in candidate_proxies:     print("Trying HTTP proxy %s" % proxy)     try:         result = urllib.urlopen("http://www.google.com", proxies={'http': proxy})         print("Got URL using proxy %s" % proxy)         break     except:         print("Trying next proxy in 5 seconds")         time.sleep(5)

answered Sep 21 '22 13:09

Pär Wieslander

Python 3 is slightly different here. It will try to auto detect proxy settings but if you need specific or manual proxy settings, think about this kind of code:

#!/usr/bin/env python3 import urllib.request  proxy_support = urllib.request.ProxyHandler({'http' : 'http://user:pass@server:port',                                               'https': 'https://...'}) opener = urllib.request.build_opener(proxy_support) urllib.request.install_opener(opener)  with urllib.request.urlopen(url) as response:     # ... implement things such as 'html = response.read()'

Refer also to the relevant section in the Python 3 docs

answered Sep 23 '22 13:09

DomTomCat

Related questions
                            
                                How can I get the screen size in Tkinter?
                            
                                Imputation of missing values for categories in pandas
                            
                                Compute pairwise distance in a batch without replicating tensor in Tensorflow?
                            
                                Merge a list of pandas dataframes
                            
                                Difference between "__method__" and "method" [duplicate]
                            
                                get script directory name - Python [duplicate]
                            
                                Temporarily Disabling Django Caching
                            
                                sklearn : TFIDF Transformer : How to get tf-idf values of given words in document
                            
                                Write a Pandas DataFrame to Google Cloud Storage or BigQuery
                            
                                Is it possible to list all functions in a module? [duplicate]
                            
                                How do I do dependency parsing in NLTK?
                            
                                Python - Use 'set' to find the different items in list
                            
                                Can't find Python executable "python"
                            
                                Prevent TensorFlow from accessing the GPU? [duplicate]
                            
                                Add leading Zero Python [duplicate]
                            
                                pandas replace multiple values one column
                            
                                Protected method in python [duplicate]
                            
                                PySerial non-blocking read loop
                            
                                Renaming multiple files in a directory using Python
                            
                                Change Django ModelChoiceField to show users' full names rather than usernames

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With