The following works when I paste it on the browser: <pre class="prettyprint"><code>http://www.somesite.com/details.pl?urn=2344 </code></pre> But when I try reading the URL with Python nothing happens: <pre class="prettyprint"><code> link = 'http://www.somesite.com/details.pl?urn=2344' f = urllib.urlopen(link) myfile = f.readline() print myfile </code></pre> Do I need to encode the URL, or is there something I'm not seeing?

To answer your question: <pre class="prettyprint"><code>import urllib link = "http://www.somesite.com/details.pl?urn=2344" f = urllib.urlopen(link) myfile = f.read() print(myfile) </code></pre> You need to <code>read()</code>, not <code>readline()</code> EDIT (2018-06-25): Since Python 3, the legacy <code>urllib.urlopen()</code> was replaced by <code>urllib.request.urlopen()</code> (see notes from https://docs.python.org/3/library/urllib.request.html#urllib.request.urlopen for details). If you're using Python 3, see answers by Martin Thoma or i.n.n.m within this question: https://stackoverflow.com/a/28040508/158111 (Python 2/3 compat) https://stackoverflow.com/a/45886824/158111 (Python 3) Or, just get this library here: http://docs.python-requests.org/en/latest/ and seriously use it :) <pre class="prettyprint"><code>import requests link = "http://www.somesite.com/details.pl?urn=2344" f = requests.get(link) print(f.text) </code></pre>

How can I read the contents of an URL with Python?

Tags:

python

The following works when I paste it on the browser:

http://www.somesite.com/details.pl?urn=2344

But when I try reading the URL with Python nothing happens:

 link = 'http://www.somesite.com/details.pl?urn=2344'  f = urllib.urlopen(link)             myfile = f.readline()    print myfile

Do I need to encode the URL, or is there something I'm not seeing?

765

asked Feb 28 '13 14:02

Helen Neely

2 Answers

To answer your question:

import urllib  link = "http://www.somesite.com/details.pl?urn=2344" f = urllib.urlopen(link) myfile = f.read() print(myfile)

You need to read(), not readline()

EDIT (2018-06-25): Since Python 3, the legacy urllib.urlopen() was replaced by urllib.request.urlopen() (see notes from https://docs.python.org/3/library/urllib.request.html#urllib.request.urlopen for details).

If you're using Python 3, see answers by Martin Thoma or i.n.n.m within this question: https://stackoverflow.com/a/28040508/158111 (Python 2/3 compat) https://stackoverflow.com/a/45886824/158111 (Python 3)

Or, just get this library here: http://docs.python-requests.org/en/latest/ and seriously use it :)

import requests  link = "http://www.somesite.com/details.pl?urn=2344" f = requests.get(link) print(f.text)

answered Oct 04 '22 22:10

woozyking

For python3 users, to save time, use the following code,

from urllib.request import urlopen  link = "https://docs.scipy.org/doc/numpy/user/basics.broadcasting.html"  f = urlopen(link) myfile = f.read() print(myfile)

I know there are different threads for error: Name Error: urlopen is not defined, but thought this might save time.

answered Oct 05 '22 00:10

i.n.n.m

Related questions
                            
                                How to merge multiple dicts with same key or different key?
                            
                                Multiprocessing a for loop?
                            
                                Sklearn, gridsearch: how to print out progress during the execution?
                            
                                Is there an analysis speed or memory usage advantage to using HDF5 for large array storage (instead of flat binary files)?
                            
                                Is there a visual profiler for Python? [closed]
                            
                                How can I split my Click commands, each with a set of sub-commands, into multiple files?
                            
                                Copy directory contents into a directory with python [duplicate]
                            
                                ModuleNotFoundError: No module named 'tools.nnwrap'
                            
                                imploding a list for use in a python MySQLDB IN clause
                            
                                Any way to properly pretty-print OrderedDict?
                            
                                How do I run a Python program?
                            
                                Python get current time in right timezone [duplicate]
                            
                                What is the most pythonic way to pop a random element from a list?
                            
                                Pipe subprocess standard output to a variable [duplicate]
                            
                                How to convert a file into a dictionary?
                            
                                Activate a virtualenv with a Python script
                            
                                Check if all values in list are greater than a certain number
                            
                                Generating permutations with repetitions
                            
                                List comprehension: Returning two (or more) items for each item
                            
                                How to add a custom CA Root certificate to the CA Store used by pip in Windows?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With