I'm writing a script to reduce a large .xlsx file with headers into a csv, and then write a new csv file with only the required columns based on header name. <pre class="prettyprint"><code>import pandas import csv df = pandas.read_csv('C:\\Python27\\Work\\spoofing.csv') time = df["InviteTime (Oracle)"] orignum = df["Orig Number"] origip = df["Orig IP Address"] destnum = df["Dest Number"] df.to_csv('output.csv', header=[time,orignum,origip,destnum]) </code></pre> The error I'm getting is with that last bit of code, and it says <pre class="prettyprint"><code>ValueError: Writing 102 cols but got 4 aliases </code></pre> I'm sure i'm overlooking something stupid, but I've read over the to_csv documentation on the pandas website and I'm still at a loss. I know I'm using the to_csv parameters incorrectly but I can't seem to get my head around the documentation I guess. Any help is appreciated, thanks!

The way to select specific columns is this - <pre class="prettyprint"><code>header = ["InviteTime (Oracle)", "Orig Number", "Orig IP Address", "Dest Number"] df.to_csv('output.csv', columns = header) </code></pre>

Pandas Writing Dataframe Columns to csv

Tags:

python

pandas

csv

I'm writing a script to reduce a large .xlsx file with headers into a csv, and then write a new csv file with only the required columns based on header name.

import pandas import csv  df = pandas.read_csv('C:\\Python27\\Work\\spoofing.csv')  time = df["InviteTime (Oracle)"] orignum = df["Orig Number"] origip = df["Orig IP Address"] destnum = df["Dest Number"]  df.to_csv('output.csv', header=[time,orignum,origip,destnum])

The error I'm getting is with that last bit of code, and it says

ValueError: Writing 102 cols but got 4 aliases

I'm sure i'm overlooking something stupid, but I've read over the to_csv documentation on the pandas website and I'm still at a loss. I know I'm using the to_csv parameters incorrectly but I can't seem to get my head around the documentation I guess.

Any help is appreciated, thanks!

932

asked Feb 25 '14 16:02

Harrison Boles

1 Answers

The way to select specific columns is this -

header = ["InviteTime (Oracle)", "Orig Number", "Orig IP Address", "Dest Number"] df.to_csv('output.csv', columns = header)

160

answered Sep 19 '22 23:09

user1827356

Related questions
                            
                                How to use Python decorators to check function arguments?
                            
                                Python vectorizing nested for loops
                            
                                Is it possible to change the model name in the django admin site?
                            
                                Slicing a list into n nearly-equal-length partitions [duplicate]
                            
                                Django and query string parameters
                            
                                Why doesn't Python evaluate constant number arithmetic before compiling to bytecode?
                            
                                Appending two dataframes with same columns, different order
                            
                                Overloading __dict__() on python class
                            
                                pandas groupby and join lists
                            
                                Reading input sound signal using Python
                            
                                How to install dependencies from a copied pipfile inside a virtual environment?
                            
                                How to exit when viewing python help like help(os.listdir)
                            
                                Function with varying number of For Loops (python)
                            
                                Saving numpy array to txt file row wise
                            
                                python regex first/shortest match
                            
                                How do I test dictionary-equality with Python's doctest-package?
                            
                                setting up s3 for logs in airflow
                            
                                How to output CDATA using ElementTree
                            
                                Creating dummy variables in pandas for python
                            
                                set very low values to zero in numpy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With