The goal is to download a file from the internet, and create from it a file object, or a file like object without ever having it touch the hard drive. This is just for my knowledge, wanting to know if its possible or practical, particularly because I would like to see if I can circumvent having to code a file deletion line. This is how I would normally download something from the web, and map it to memory: <pre class="prettyprint"><code>import requests import mmap u = requests.get("http://www.pythonchallenge.com/pc/def/channel.zip") with open("channel.zip", "wb") as f: # I want to eliminate this, as this writes to disk f.write(u.content) with open("channel.zip", "r+b") as f: # and his as well, because it reads from disk mm = mmap.mmap(f.fileno(), 0) mm.seek(0) print mm.readline() mm.close() # question: if I do not include this, does this become a memory leak? </code></pre>

Your answer is <code>u.content</code>. The content is in the memory. Unless you write it to a file, it won’t be stored on disk.

Python - Download File Using Requests, Directly to Memory

Tags:

The goal is to download a file from the internet, and create from it a file object, or a file like object without ever having it touch the hard drive. This is just for my knowledge, wanting to know if its possible or practical, particularly because I would like to see if I can circumvent having to code a file deletion line.

This is how I would normally download something from the web, and map it to memory:

import requests import mmap  u = requests.get("http://www.pythonchallenge.com/pc/def/channel.zip")  with open("channel.zip", "wb") as f: # I want to eliminate this, as this writes to disk     f.write(u.content)  with open("channel.zip", "r+b") as f: # and his as well, because it reads from disk     mm = mmap.mmap(f.fileno(), 0)     mm.seek(0)     print mm.readline()     mm.close() # question: if I do not include this, does this become a memory leak?

963

asked Mar 12 '14 01:03

Anon

2 Answers

r.raw (HTTPResponse) is already a file-like object (just pass stream=True):

#!/usr/bin/env python import sys import requests # $ pip install requests from PIL import Image # $ pip install pillow  url = sys.argv[1] r = requests.get(url, stream=True) r.raw.decode_content = True # Content-Encoding im = Image.open(r.raw) #NOTE: it requires pillow 2.8+ print(im.format, im.mode, im.size)

In general if you have a bytestring; you could wrap it as f = io.BytesIO(r.content), to get a file-like object without touching the disk:

#!/usr/bin/env python import io import zipfile from contextlib import closing import requests # $ pip install requests  r = requests.get("http://www.pythonchallenge.com/pc/def/channel.zip") with closing(r), zipfile.ZipFile(io.BytesIO(r.content)) as archive:     print({member.filename: archive.read(member) for member in archive.infolist()})

You can't pass r.raw to ZipFile() directly because the former is a non-seekable file.

I would like to see if I can circumvent having to code a file deletion line

tempfile can delete files automatically f = tempfile.SpooledTemporaryFile(); f.write(u.content). Until .fileno() method is called (if some api requires a real file) or maxsize is reached; the data is kept in memory. Even if the data is written on disk; the file is deleted as soon as it closed.

187

answered Sep 17 '22 22:09

jfs

Your answer is u.content. The content is in the memory. Unless you write it to a file, it won’t be stored on disk.

answered Sep 17 '22 22:09

poke

Related questions
                            
                                How to get an app name using python in django
                            
                                Sort a string in lexicographic order python
                            
                                how to export HDF5 file to NumPy using H5PY?
                            
                                What does "variable or 0" mean in python?
                            
                                getting dynamic attribute in python [duplicate]
                            
                                How can I set a DateField format in django from the model?
                            
                                Append item to MongoDB document array in PyMongo without re-insertion
                            
                                Faithfully Preserve Comments in Parsed XML
                            
                                How to check if a key-value pair is present in a dictionary?
                            
                                Difference between "raise" and "raise e"?
                            
                                how to install tensorflow on anaconda python 3.6
                            
                                installing urllib in Python3.6
                            
                                2D arrays in Python
                            
                                How can I place a table on a plot in Matplotlib?
                            
                                Why is matplotlib plotting my circles as ovals?
                            
                                Check if list items contains substrings from another list
                            
                                Celery task schedule (Ensuring a task is only executed one at a time)
                            
                                How do I convert an array to string using the jinja template engine?
                            
                                Scrapy - Silently drop an item
                            
                                Get array elements from index to end

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python - Download File Using Requests, Directly to Memory

Tags:

python

python-requests

mmap

Anon

People also ask

2 Answers

jfs

poke

Recent Activity

Donate For Us