Handing conversion from bytes to string when not explicitly opening a file in Python 3

Tags:

I am using the Requests module to authorise and then pull csv content from a web API and have it running fine in Python 2.7. I now want to write the same script in Python 3.5 but experiencing some issues:

"iterator should return strings, not bytes (did you open the file in text mode?)"

The requests.get seems to return bytes and not a string, which seems to be related to the encoding issues seen when moving to Python 3.x. The error is raised on the 3rd from last line: next(reader). In Python 2.7 this was not an issue because the csv functions were handled in 'wb' mode.

This article is very similar, but as I'm not opening a csv file directly, I cant seem to force the response text to be encoded this way: csv.Error: iterator should return strings, not bytes

countries = ['UK','US','CA']
datelist = [1,2,3,4]
baseurl = 'https://somewebsite.com/exporttoCSV.php'

#--- For all date/cc combinations
for cc in countries:
    for d in datelist:

        #---Build API String with variables
        url = (baseurl + '?data=chart&output=csv' +
               '&dataset=' + d + 
               '&cc=' + cc)

        #---Run API Call and create reader object
        r = requests.get(url, auth=(username, password))
        text = r.iter_lines()
        reader = csv.reader(text,delimiter=',')

        #---Write csv output to csv file with territory and date columns
        with open(cc + '_'+ d +'.csv','wt', newline='') as file:
            a = csv.writer(file)
            a.writerow(['position','id','title','kind','peers','territory','date']) #---Write header line
            next(reader) #---Skip original headers
            for i in reader:
                a.writerow(i +[countrydict[cc]] + [datevalue])

727

asked Jul 08 '16 11:07

Steve

2 Answers

Without being able to test your exact scenario, I believe this should be solved by changing text = r.iter_lines() to:

text = (line.decode('utf-8') for line in r.iter_lines())

This should decode each line read in by r.iter_lines() from a byte string to a string usable by csv.reader

My test case is as follows:

>>> iter_lines = [b'1,2,3,4',b'2,3,4,5',b'3,4,5,6']
>>> text = (line.decode('utf-8') for line in iter_lines)
>>> reader = csv.reader(text, delimiter=',')
>>> next(reader)
['1', '2', '3', '4']
>>> for i in reader:
...     print(i)
...
['2', '3', '4', '5']
['3', '4', '5', '6']

113

answered Oct 07 '22 17:10

Bamcclur

Some files have to be read in as bytes, for example from Django SimpleUploadedFile, which is a testing class only uses bytes. Here is some example code from my test suite on how I got it working:

test_code.py

import os
from django.core.files.uploadedfile import SimpleUploadedFile
from django.test import TestCase

class ImportDataViewTests(TestCase):

    def setUp(self):
        self.path = "test_in/example.csv"
        self.filename = os.path.split(self.file)[1]

    def test_file_upload(self):
        with open(self.path, 'rb') as infile:
            _file = SimpleUploadedFile(self.filename, infile.read())

        # now an `InMemoryUploadedFile` exists, so test it as you shall!

prod_code.py

import csv

def import_records(self, infile):
    csvfile = (line.decode('utf8') for line in infile)
    reader = csv.DictReader(csvfile)

    for row in reader:
        # loop through file and do stuff!

answered Oct 07 '22 16:10

Aaron Lelevier

Related questions
                            
                                SQL Query results in tkinter
                            
                                running django python 3.4 on mod_wsgi with apache2
                            
                                get lastweek dates using python?
                            
                                How do you find a unique and constant ID of a widget?
                            
                                Why can't you reference modules that appear to be automatically loaded by the interpreter without an additional `import` statement?
                            
                                Abstract base class is not enforcing function implementation
                            
                                Can't install modules 'os' and 'os.path'
                            
                                What misspellings / typos are supported in Python?
                            
                                How to avoid new line in readline() function in python 3x? [duplicate]
                            
                                How do I get the index of an item in tkinter.Listbox?
                            
                                Installed Anaconda for python 2 and 3. Can't run 2
                            
                                Python: why is zip(*) used instead of unzip()? [closed]
                            
                                no module named fuzzywuzzy
                            
                                How Does String Conversion Between PyUnicode String and C String Work? [closed]
                            
                                Embedding Python in C: Error in linking - undefined reference to PyString_AsString
                            
                                linux centos 6.7 pip3 install
                            
                                Python : Behaviour of send() in generators
                            
                                star unpacking for own classes
                            
                                Can't import pygal_maps_world.World
                            
                                How to install PyRTF with Python3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Handing conversion from bytes to string when not explicitly opening a file in Python 3

Tags:

python-3.x

csv

python-requests

Steve

People also ask

2 Answers

Bamcclur

Aaron Lelevier

Recent Activity

Donate For Us