Python writing binary files, bytes

Tags:

Python 3. I'm using QT's file dialog widget to save PDFs downloaded from the internet. I've been reading the file using 'open', and attempting to write it using the file dialog widget. However, I've been running into a"TypeError: '_io.BufferedReader' does not support the buffer interface" error.

Example code:

with open('file_to_read.pdf', 'rb') as f1: 
    with open('file_to_save.pdf', 'wb') as f2:
        f2.write(f1)

This logic works properly with text files when not using the 'b' designator, or when reading a file from the web, like with urllib or requests. These are of the 'bytes' type, which I think I need to be opening the file as. Instead, it's opening as a Buffered Reader. I tried bytes(f1), but get "TypeError: 'bytes' object cannot be interpreted as an integer." Any ideaas?

919

asked May 19 '13 02:05

Turtles Are Cute

1 Answers

If your intent is to simply make a copy of the file, you could use shutil

>>> import shutil
>>> shutil.copyfile('file_to_read.pdf','file_to_save.pdf')

Or if you need to access byte by byte, similar to your structure, this works:

>>> with open('/tmp/fin.pdf','rb') as f1:
...    with open('/tmp/test.pdf','wb') as f2:
...       while True:
...          b=f1.read(1)
...          if b: 
...             # process b if this is your intent   
...             n=f2.write(b)
...          else: break

But byte by byte is potentially really slow.

Or, if you want a buffer that will speed this up (without taking the risk of reading an unknown file size completely into memory):

>>> with open('/tmp/fin.pdf','rb') as f1:
...    with open('/tmp/test.pdf','wb') as f2:
...       while True:
...          buf=f1.read(1024)
...          if buf: 
...              for byte in buf:
...                 pass    # process the bytes if this is what you want
...                         # make sure your changes are in buf
...              n=f2.write(buf)
...          else:
...              break

With Python 2.7+ or 3.1+ you can also use this shortcut (rather than using two with blocks):

with open('/tmp/fin.pdf','rb') as f1,open('/tmp/test.pdf','wb') as f2:
    ...

100

answered Sep 28 '22 12:09

dawg

Related questions
                            
                                What does [...] (an ellipsis) in a list mean in Python? [duplicate]
                            
                                Checking for nan in Cython
                            
                                Why does importing a python module not import nested modules?
                            
                                Retrieve browser headers in Python
                            
                                Flask-Babel how to use translation in Jinja template file
                            
                                Accurate binary image classification
                            
                                Django - Is storing objects in session a good practice?
                            
                                Matplotlib suptitle prints over old title
                            
                                Weird closure behavior in python
                            
                                Parallel construction of a distance matrix
                            
                                Append to a dict of lists with a dict comprehension
                            
                                Change &#39 into normal character
                            
                                python tkinter return value from function used in command
                            
                                Extract Google Scholar results using Python (or R)
                            
                                MongoDB: Find the minimum element in array and delete it
                            
                                Numpy error: Singular matrix
                            
                                beautifulSoup html csv
                            
                                How to monitor events from workers in a Celery-Django application?
                            
                                Matplotlib half black and half white circle
                            
                                TypeError: type object argument after * must be a sequence, not generator

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python writing binary files, bytes

Tags:

python

io

python-3.x

buffer

bufferedreader

Turtles Are Cute

People also ask

1 Answers

dawg

Recent Activity

Donate For Us