I would like to name columns when I import a csv to a dataframe with dask in Python.The code I use looks like this: <blockquote> <pre class="prettyprint"><code>for i in range(1, files + 1): filename = str(i) + 'GlobalActorsHeatMap.csv' runs[i] = dd.read_csv(filename, header=None) </code></pre> </blockquote> I would like to use an array with names for each column: <blockquote> names = ['tribute', 'percent_countries_active', 'num_wars', 'num_tributes', 'war', 'war_to_tribute_ratio', 'US_wealth', 'UK_wealth', 'NZ_wealth' ] </blockquote> Is this possible to do directly?

Just use the <code>names</code> argument for the <code>read_csv</code> <pre class="prettyprint"><code>names = [...] dd.read_csv(filename, header=None, names=names) </code></pre> Read more here

Name columns when importing csv to dataframe in dask

Tags:

python

csv

numpy

dask

I would like to name columns when I import a csv to a dataframe with dask in Python.The code I use looks like this:

for i  in range(1, files + 1):
    filename = str(i) + 'GlobalActorsHeatMap.csv'
    runs[i] = dd.read_csv(filename, header=None)

I would like to use an array with names for each column:

names = ['tribute', 'percent_countries_active', 'num_wars', 'num_tributes', 'war', 'war_to_tribute_ratio', 'US_wealth', 'UK_wealth', 'NZ_wealth' ]

Is this possible to do directly?

227

asked Mar 17 '16 13:03

Jim Caton

1 Answers

Just use the names argument for the read_csv

names = [...]
dd.read_csv(filename, header=None, names=names)

Sevanteri

Related questions
                            
                                Python: How to mock class attribute initializer function
                            
                                How to clear a plot in a `while` loop when using PyQtGraph?
                            
                                Python sorting dictionaries: Key [Ascending] and then Value [Descending]
                            
                                Matplotlib normalize colorbar (Python)
                            
                                Summary statistics on Large csv file using python pandas
                            
                                Count of unequal elements across numpy arrays
                            
                                Replacing punctuation except intra-word dashes with a space
                            
                                Should I generate *.pyc files when deploying?
                            
                                Scrapy + Splash + ScrapyJS
                            
                                Changing multiple characters by other characters in a string [duplicate]
                            
                                How can I enumerate rows in groups with Spark/Python?
                            
                                How can I get the Python compiler string programmatically?
                            
                                Multiindex only some of columns in Pandas
                            
                                Create a method attribute in a class
                            
                                Setting values with multiindex in pandas
                            
                                Docker. No such file or directory
                            
                                Messed up numpy installation - `GFORTRAN_1.4' not found bug
                            
                                Accessing rows of an array, inside an array of arrays?
                            
                                Keras - is it possible to view the weights and biases of models in Tensorboard
                            
                                Wrapping around a list as a slice operation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With