Saving DataFrame names as .csv file names in Pandas

Tags:

pandas

In [37]: blue = pd.DataFrame({'A': ['foo','foo','foo','bar','bar'], 'B': [4.0, 4.0, 5.0, 8.0, 8.0]})

In [38]: blue
Out[38]: 
     A  B
0  foo  4
1  foo  4
2  foo  5
3  bar  8
4  bar  8

In [39]: red = pd.DataFrame({'A': ['foo','foo','foo','bar','bar'], 'B': [np.nan, np.nan, np.nan, np.nan, np.nan]})

In [40]: red
Out[40]: 
     A   B
0  foo NaN
1  foo NaN
2  foo NaN
3  bar NaN
4  bar NaN

In [41]: for df in [blue, red]:
   ....:     df.to_csv(str(df))
   ....:     

In [42]: !ls
     A  B?0  foo  4?1  foo  4?2  foo  5?3  bar  8?4  bar  8       A   B?0  foo NaN?1  foo NaN?2  foo NaN?3  bar NaN?4  bar NaN  postinstall.sh  vagrant

I have some DataFrames. I loop over each DataFrame to work on them. At the end of the loop I want to save each DataFrame as a .csv file named after the DataFrame. I know that it's generally difficult to stringify the name of a variable in Python, but I have to think that I'm missing something obvious here. There is no "name" attribute for DataFrames, so what do I do?

642

asked Aug 15 '14 19:08

verbsintransit

1 Answers

You can just add an attribute to the df, same as any other python object that has a __dict__ attribute and use it later:

In [2]:

blue.name = 'blue'
red.name = 'red'
df_list = [blue, red]
for df in df_list:
    print(df.name)
    df.to_csv(df.name + '.csv')
blue
red

Even better, for convenience you can store the csv name and use it later too:

In [5]:

blue.name = 'blue'
blue.csv_path = 'blue.csv'
red.name = 'red'
red.csv_path = 'red.csv'
df_list = [blue, red]
for df in df_list:
    print(df.name)
    print(df.csv_path)
    df.to_csv(df.csv_path)
blue
blue.csv
red
red.csv

EDIT As @Jeff has pointed out, the attributes will not persist across most operations on the df as a copy of the df is returned and these attributes are not copied across so be aware of this.

147

answered Sep 22 '22 21:09

EdChum

Related questions
                            
                                Asynchronous multiprocessing with a worker pool in Python: how to keep going after timeout?
                            
                                Python Subprocess Security
                            
                                Use a metaclass only for subclasses
                            
                                search criteria of IMAP protocol search command
                            
                                pip executes the wrong python library versions inside virtual env
                            
                                Check if value is between pair of values in a tuple?
                            
                                Python - extending properties like you'd extend a function
                            
                                Trailing underscore in `np.ix_`
                            
                                Cannot list all of my fields in list_editable without causing errors
                            
                                Python Regex Sub - Use Match as Dict Key in Substitution
                            
                                Python mock method call arguments display the last state of a list
                            
                                SQLAlchemy, PostgreSQL and array_agg: How to select items from array_agg?
                            
                                pandas.Series.interpolate() does nothing. Why?
                            
                                Drawing polygon with n number of sides in Python 3.2
                            
                                Timeout on tests with nosetests
                            
                                Extract text between tags with XPath including markup
                            
                                How to truncate html without breaking the tags?
                            
                                How to generate a list of antonyms for adjectives in WordNet using Python
                            
                                Using Labels in HAVING() Clause in SQLAlchemy
                            
                                Dynamic arguments for Python's argparse

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With