I am trying to write the outputs to a CSV file in Python 3.4 but the CSV file always contains 'b' flags. For example, b'The text output1', b'The text output2',... I am wondering if there is a way to get rid of the 'b' flags. I understand that this is not an issue in Python 2.X. Here are the codes that I used <pre class="prettyprint"><code>with open('test.csv', 'w') as f: writer = csv.DictWriter(f, ['field'], extrasaction='ignore') writer.writeheader() test_text = mongo.test.find({'text': text}) for t in test_text writer.writerow({i:v.encode('utf') for i,v in t.items()}) </code></pre> Thanks very much ------Updates----------- Thanks very much for Tim Pietzcker, John Zwinck, and Warren Weckesser providing helpful comments and answers. Per Warren's suggestions, if I change my codes to <pre class="prettyprint"><code>import csv data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r'] with open('test.csv', 'w') as f: writer = csv.writer(f) for item in data: writer.writerow([item]) </code></pre> I will get error message <pre class="prettyprint"><code>UnicodeEncodeError: 'charmap' codec can't encode character '\u03d5' in position 0: character maps to <undefined> </code></pre> if I change my codes to <pre class="prettyprint"><code>import csv data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r'] with open('test.csv', 'w') as f: writer = csv.writer(f) for item in data: writer.writerow([item.encode('utf')]) </code></pre> I will get outputs with 'b' flags <pre class="prettyprint"><code>b'\xcf\x95oo' b'b\xc4\x81r' </code></pre> Any thoughts on how this is happening and how I might be able to fix it? Thanks again. ------Updates 2----------- Thanks very much for Warren's solution. The following codes worked! <pre class="prettyprint"><code>import csv data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r'] with open('test.csv', 'w', encoding='utf8') as f: writer = csv.writer(f) for item in data: writer.writerow([item]) </code></pre>

Don't explicitly encode the strings yourself; let the writer take care of it. For example, this code: <pre class="prettyprint"><code>import csv data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r'] with open('test.csv', 'w') as f: writer = csv.writer(f) for item in data: writer.writerow([item]) </code></pre> writes the file <pre class="prettyprint"><code>ϕoo bār </code></pre> with UTF-8 encoding (at least it does on my system, where <code>locale.getpreferredencoding(False)</code> returns <code>'UTF-8'</code>). To make the encoding explicit, you can set the encoding in the call to <code>open</code>: <pre class="prettyprint"><code> with open('test.csv', 'w', encoding='utf8') as f: </code></pre> If the last line is changed to <code>writer.writerow([item.encode('utf')])</code> (which converts the strings to <code>bytes</code>), it produces <pre class="prettyprint"><code>b'\xcf\x95oo' b'b\xc4\x81r' </code></pre> In your example, try changing this line: <pre class="prettyprint"><code> writer.writerow({i:v.encode('utf') for i,v in t.items()}) </code></pre> to this: <pre class="prettyprint"><code> writer.writerow(t) </code></pre> Then if that works, you could replace this: <pre class="prettyprint"><code> for t in test_text writer.writerow({i:v.encode('utf') for i,v in t.items()}) </code></pre> with <pre class="prettyprint"><code> writer.writerows(test_text) </code></pre>

Remove 'b' flag in CSV output

Tags:

python

csv

I am trying to write the outputs to a CSV file in Python 3.4 but the CSV file always contains 'b' flags. For example, b'The text output1', b'The text output2',... I am wondering if there is a way to get rid of the 'b' flags. I understand that this is not an issue in Python 2.X.

Here are the codes that I used

with open('test.csv', 'w') as f:
    writer = csv.DictWriter(f, ['field'], extrasaction='ignore')
    writer.writeheader()
    test_text = mongo.test.find({'text': text})
    for t in test_text
        writer.writerow({i:v.encode('utf') for i,v in t.items()})

Thanks very much

------Updates-----------

Thanks very much for Tim Pietzcker, John Zwinck, and Warren Weckesser providing helpful comments and answers. Per Warren's suggestions, if I change my codes to

import csv

data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r']

with open('test.csv', 'w') as f:
    writer = csv.writer(f)
    for item in data:
        writer.writerow([item])

I will get error message

UnicodeEncodeError: 'charmap' codec can't encode character '\u03d5' in position 0: character maps to <undefined>

if I change my codes to

import csv

data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r']

with open('test.csv', 'w') as f:
    writer = csv.writer(f)
    for item in data:
        writer.writerow([item.encode('utf')])

I will get outputs with 'b' flags

b'\xcf\x95oo'
b'b\xc4\x81r'

Any thoughts on how this is happening and how I might be able to fix it? Thanks again.

------Updates 2-----------

Thanks very much for Warren's solution. The following codes worked!

import csv

data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r']

with open('test.csv', 'w', encoding='utf8') as f:
    writer = csv.writer(f)
    for item in data:
        writer.writerow([item])

725

asked Oct 07 '14 05:10

frostman

1 Answers

Don't explicitly encode the strings yourself; let the writer take care of it. For example, this code:

import csv

data = [chr(0x03d5) + 'oo', 'b' + chr(0x0101) + 'r']

with open('test.csv', 'w') as f:
    writer = csv.writer(f)
    for item in data:
        writer.writerow([item])

writes the file

ϕoo
bār

with UTF-8 encoding (at least it does on my system, where locale.getpreferredencoding(False) returns 'UTF-8'). To make the encoding explicit, you can set the encoding in the call to open:

    with open('test.csv', 'w', encoding='utf8') as f:

If the last line is changed to writer.writerow([item.encode('utf')]) (which converts the strings to bytes), it produces

b'\xcf\x95oo'
b'b\xc4\x81r'

In your example, try changing this line:

        writer.writerow({i:v.encode('utf') for i,v in t.items()})

to this:

        writer.writerow(t)

Then if that works, you could replace this:

    for t in test_text
        writer.writerow({i:v.encode('utf') for i,v in t.items()})

with

    writer.writerows(test_text)

125

answered Oct 20 '22 06:10

Warren Weckesser

Related questions
                            
                                Python with PySDL2 - could not find any library for SDL2
                            
                                How python interpret a function as a generator
                            
                                Bottle Template Support?
                            
                                Can not import modules from gi.repository
                            
                                How to accumulate values in numpy array by column?
                            
                                Pandas - retrieving HDF5 columns and memory usage
                            
                                Numpy, how to get a sub matrix with boolean slicing
                            
                                Set up the size of backgound colour plot to the size of axes in seaborn jointplot
                            
                                Supporting the deep copy operation on a custom class?
                            
                                How can one generate Clover-compatible (Bamboo) xml which includes coverage data for python unnittest?
                            
                                How to auth into BigQuery on Google Compute Engine?
                            
                                Filling Out Web Form Data Using Built-In Python Modules
                            
                                pandas value_counts() with multiple values in list form?
                            
                                How to insert map type into cassandra using cassandra-driver for python
                            
                                Transmitting a pickled object output between python scripts through a subprocess.communicate
                            
                                How do I install Python in Google Cloud Shell?
                            
                                How can I plot the probability density function for a fitted Gaussian mixture model under scikit-learn?
                            
                                converting a 2d dictionary to a numpy matrix
                            
                                Reading specific columns from a text file in python
                            
                                How can one modify the outline color of a node In networkx?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With