I have several CSV files that look like this: <pre class="prettyprint lang-none prettyprint-override"><code>Input Name Code blackberry 1 wineberry 2 rasberry 1 blueberry 1 mulberry 2 </code></pre> I would like to add a new column to all CSV files so that it would look like this: <pre class="prettyprint lang-none prettyprint-override"><code>Output Name Code Berry blackberry 1 blackberry wineberry 2 wineberry rasberry 1 rasberry blueberry 1 blueberry mulberry 2 mulberry </code></pre> The script I have so far is this: <pre class="prettyprint"><code>import csv with open(input.csv,'r') as csvinput: with open(output.csv, 'w') as csvoutput: writer = csv.writer(csvoutput) for row in csv.reader(csvinput): writer.writerow(row+['Berry']) </code></pre> (Python 3.2) But in the output, the script skips every line and the new column has only Berry in it: <pre class="prettyprint lang-none prettyprint-override"><code>Output Name Code Berry blackberry 1 Berry wineberry 2 Berry rasberry 1 Berry blueberry 1 Berry mulberry 2 Berry </code></pre>

I'm surprised no one suggested Pandas. Although using a set of dependencies like Pandas might seem more heavy-handed than is necessary for such an easy task, it produces a very short script and Pandas is a great library for doing all sorts of CSV (and really all data types) data manipulation. Can't argue with 4 lines of code: <pre class="prettyprint"><code>import pandas as pd csv_input = pd.read_csv('input.csv') csv_input['Berries'] = csv_input['Name'] csv_input.to_csv('output.csv', index=False) </code></pre> Check out Pandas Website for more information! Contents of <code>output.csv</code>: <pre class="prettyprint"><code>Name,Code,Berries blackberry,1,blackberry wineberry,2,wineberry rasberry,1,rasberry blueberry,1,blueberry mulberry,2,mulberry </code></pre>

How to add a new column to a CSV file?

Tags:

python

python-3.x

csv

I have several CSV files that look like this:

Input Name        Code blackberry  1 wineberry   2 rasberry    1 blueberry   1 mulberry    2

I would like to add a new column to all CSV files so that it would look like this:

Output Name        Code    Berry blackberry  1   blackberry wineberry   2   wineberry rasberry    1   rasberry blueberry   1   blueberry mulberry    2   mulberry

The script I have so far is this:

import csv with open(input.csv,'r') as csvinput:     with open(output.csv, 'w') as csvoutput:         writer = csv.writer(csvoutput)         for row in csv.reader(csvinput):             writer.writerow(row+['Berry'])

(Python 3.2)

But in the output, the script skips every line and the new column has only Berry in it:

Output Name        Code    Berry blackberry  1   Berry  wineberry   2   Berry  rasberry    1   Berry  blueberry   1   Berry  mulberry    2   Berry

902

asked Jun 17 '12 10:06

fairyberry

2 Answers

This should give you an idea of what to do:

>>> v = open('C:/test/test.csv') >>> r = csv.reader(v) >>> row0 = r.next() >>> row0.append('berry') >>> print row0 ['Name', 'Code', 'berry'] >>> for item in r: ...     item.append(item[0]) ...     print item ...      ['blackberry', '1', 'blackberry'] ['wineberry', '2', 'wineberry'] ['rasberry', '1', 'rasberry'] ['blueberry', '1', 'blueberry'] ['mulberry', '2', 'mulberry'] >>>

Edit, note in py3k you must use next(r)

Thanks for accepting the answer. Here you have a bonus (your working script):

import csv  with open('C:/test/test.csv','r') as csvinput:     with open('C:/test/output.csv', 'w') as csvoutput:         writer = csv.writer(csvoutput, lineterminator='\n')         reader = csv.reader(csvinput)          all = []         row = next(reader)         row.append('Berry')         all.append(row)          for row in reader:             row.append(row[0])             all.append(row)          writer.writerows(all)

Please note

the lineterminator parameter in csv.writer. By default it is set to '\r\n' and this is why you have double spacing.
the use of a list to append all the lines and to write them in one shot with writerows. If your file is very, very big this probably is not a good idea (RAM) but for normal files I think it is faster because there is less I/O.
As indicated in the comments to this post, note that instead of nesting the two with statements, you can do it in the same line:

with open('C:/test/test.csv','r') as csvinput, open('C:/test/output.csv', 'w') as csvoutput:

134

answered Oct 18 '22 04:10

joaquin

I'm surprised no one suggested Pandas. Although using a set of dependencies like Pandas might seem more heavy-handed than is necessary for such an easy task, it produces a very short script and Pandas is a great library for doing all sorts of CSV (and really all data types) data manipulation. Can't argue with 4 lines of code:

import pandas as pd csv_input = pd.read_csv('input.csv') csv_input['Berries'] = csv_input['Name'] csv_input.to_csv('output.csv', index=False)

Check out Pandas Website for more information!

Contents of output.csv:

Name,Code,Berries blackberry,1,blackberry wineberry,2,wineberry rasberry,1,rasberry blueberry,1,blueberry mulberry,2,mulberry

answered Oct 18 '22 02:10

Blairg23

Related questions
                            
                                Which is the best way to check for the existence of an attribute? [duplicate]
                            
                                What is the difference between int() and floor() in Python 3?
                            
                                Turn string into operator
                            
                                Round integers to the nearest 10
                            
                                Checking if a website is up via Python
                            
                                Escape string Python for MySQL
                            
                                How to conditionally update DataFrame column in Pandas
                            
                                Writing a __init__ function to be used in django model
                            
                                Access request in django custom template tags
                            
                                Using Colormaps to set color of line in matplotlib
                            
                                Using setattr() in python
                            
                                What's the best way to return multiple values from a function?
                            
                                Using Python to execute a command on every file in a folder
                            
                                Python parse comma-separated number into int [duplicate]
                            
                                Getting the first non None value from list
                            
                                Unresolved attribute reference 'objects' for class '' in PyCharm
                            
                                How can I find the missing value more concisely?
                            
                                for loop in Python
                            
                                number of values in a list greater than a certain number
                            
                                How to add clickable links to a field in Django admin?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With