I wanted to write a piece of code like the following: <pre class="prettyprint"><code>from bs4 import BeautifulSoup import urllib2 url = 'http://www.thefamouspeople.com/singers.php' html = urllib2.urlopen(url) soup = BeautifulSoup(html) </code></pre> But I found that I have to install <code>urllib3</code> package now. Moreover, I couldn't find any tutorial or example to understand how to rewrite the above code, for example, <code>urllib3</code> does not have <code>urlopen</code>. Any explanation or example, please?! P/S: I'm using python 3.4.

urllib3 is a different library from urllib and urllib2. It has lots of additional features to the urllibs in the standard library, if you need them, things like re-using connections. The documentation is here: https://urllib3.readthedocs.org/ If you'd like to use urllib3, you'll need to <code>pip install urllib3</code>. A basic example looks like this: <pre class="prettyprint"><code>from bs4 import BeautifulSoup import urllib3 http = urllib3.PoolManager() url = 'http://www.thefamouspeople.com/singers.php' response = http.request('GET', url) soup = BeautifulSoup(response.data) </code></pre>

You do not have to install <code>urllib3</code>. You can choose any HTTP-request-making library that fits your needs and feed the response to <code>BeautifulSoup</code>. The choice is though usually <code>requests</code> because of the rich feature set and convenient API. You can install <code>requests</code> by entering <code>pip install requests</code> in the command line. Here is a basic example: <pre class="prettyprint"><code>from bs4 import BeautifulSoup import requests url = "url" response = requests.get(url) soup = BeautifulSoup(response.content, "html.parser") </code></pre>

What should I use to open a url instead of urlopen in urllib3

Tags:

python

beautifulsoup

web-scraping

urllib3

I wanted to write a piece of code like the following:

from bs4 import BeautifulSoup
import urllib2

url = 'http://www.thefamouspeople.com/singers.php'
html = urllib2.urlopen(url)
soup = BeautifulSoup(html)

But I found that I have to install urllib3 package now.

Moreover, I couldn't find any tutorial or example to understand how to rewrite the above code, for example, urllib3 does not have urlopen.

Any explanation or example, please?!

P/S: I'm using python 3.4.

402

asked Apr 09 '16 11:04

niloofar

3 Answers

urllib3 is a different library from urllib and urllib2. It has lots of additional features to the urllibs in the standard library, if you need them, things like re-using connections. The documentation is here: https://urllib3.readthedocs.org/

If you'd like to use urllib3, you'll need to pip install urllib3. A basic example looks like this:

from bs4 import BeautifulSoup
import urllib3

http = urllib3.PoolManager()

url = 'http://www.thefamouspeople.com/singers.php'
response = http.request('GET', url)
soup = BeautifulSoup(response.data)

155

answered Oct 11 '22 07:10

shazow

You do not have to install urllib3. You can choose any HTTP-request-making library that fits your needs and feed the response to BeautifulSoup. The choice is though usually requests because of the rich feature set and convenient API. You can install requests by entering pip install requests in the command line. Here is a basic example:

from bs4 import BeautifulSoup
import requests

url = "url"
response = requests.get(url)

soup = BeautifulSoup(response.content, "html.parser")

answered Oct 11 '22 07:10

alecxe

The new urllib3 library has a nice documentation here
In order to get your desired result you shuld follow that:

Import urllib3
from bs4 import BeautifulSoup

url = 'http://www.thefamouspeople.com/singers.php'

http = urllib3.PoolManager()
response = http.request('GET', url)
soup = BeautifulSoup(response.data.decode('utf-8'))

The "decode utf-8" part is optional. It worked without it when i tried, but i posted the option anyway.
Source: User Guide

answered Oct 11 '22 07:10

Lan Vukušič

Related questions
                            
                                Multiple variables in SciPy's optimize.minimize
                            
                                Can Python be used for client side web development? [closed]
                            
                                Python equivalent to 'hold on' in Matlab
                            
                                What is a virtualenv, and why should I use one?
                            
                                one-to-many inline select with django admin
                            
                                Indexing Pandas data frames: integer rows, named columns
                            
                                How to put a variable into Python docstring
                            
                                virtualenv, mysql-python, pip: anyone know how? [duplicate]
                            
                                ImportError: No module named - Python
                            
                                unbuffered stdout in python (as in python -u) from within the program [duplicate]
                            
                                Specifying data type in Pandas csv reader
                            
                                Pandas update sql
                            
                                Different result with roc_auc_score() and auc()
                            
                                How to download a file using python in a 'smarter' way?
                            
                                Are numpy arrays passed by reference?
                            
                                memoization library for python 2.7
                            
                                Matplotlib: display plot on a remote machine
                            
                                How to specify install order for python pip?
                            
                                Unittest's assertEqual and iterables - only check the contents
                            
                                Reading YAML file with Python results in yaml.composer.ComposerError: expected a single document in the stream

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What should I use to open a url instead of urlopen in urllib3

Tags:

python

beautifulsoup

web-scraping

urllib3

niloofar

People also ask

3 Answers

shazow

alecxe

Lan Vukušič

Recent Activity

Donate For Us