I can pass a StringIO object to pd.to_csv() just fine: <pre class="prettyprint"><code>io = StringIO.StringIO() pd.DataFrame().to_csv(io) </code></pre> But when using the excel writer, I am having a lot more trouble. <pre class="prettyprint"><code>io = StringIO.StringIO() writer = pd.ExcelWriter(io) pd.DataFrame().to_excel(writer,"sheet name") writer.save() </code></pre> Returns an <pre class="prettyprint"><code>AttributeError: StringIO instance has no attribute 'rfind' </code></pre> I'm trying to create an <code>ExcelWriter</code> object without calling <code>pd.ExcelWriter()</code> but am having some trouble. This is what I've tried so far: <pre class="prettyprint"><code>from xlsxwriter.workbook import Workbook writer = Workbook(io) pd.DataFrame().to_excel(writer,"sheet name") writer.save() </code></pre> But now I am getting an <code>AttributeError: 'Workbook' object has no attribute 'write_cells'</code> How can I save a pandas dataframe in excel format to a <code>StringIO</code> object?

Glancing at the pandas.io.excel source looks like it shouldn't be too much of a problem if you don't mind using xlwt as your writer. The other engines may not be all that difficult either but xlwt jumps out as easy since its save method takes a stream or a filepath. You need to initially pass in a filename just to make pandas happy as it checks the filename extension against the engine to make sure it's a supported format. But in the case of the xlwt engine, it just stuffs the filename into the object's path attribute and then uses it in the save method. If you change the path attribute to your stream, it'll happily save to that stream when you call the save method. Here's an example: <pre class="prettyprint"><code>import pandas as pd import StringIO import base64 df = pd.DataFrame.from_csv('http://moz.com/top500/domains/csv') xlwt_writer = pd.io.excel.get_writer('xlwt') my_writer = xlwt_writer('whatever.xls') #make pandas happy xl_out = StringIO.StringIO() my_writer.path = xl_out df.to_excel(my_writer) my_writer.save() print base64.b64encode(xl_out.getvalue()) </code></pre> That's the quick, easy and slightly dirty way to do it. BTW... a cleaner way to do it is to subclass ExcelWriter (or one of it's existing subclasses, e.g. _XlwtWriter) -- but honestly there's so little involved in updating the path attribute, I voted to show you the easy way rather than go the slightly longer route.

Write to StringIO object using Pandas Excelwriter?

Tags:

python

pandas

excel

stringio

xlsxwriter

I can pass a StringIO object to pd.to_csv() just fine:

io = StringIO.StringIO()
pd.DataFrame().to_csv(io)

But when using the excel writer, I am having a lot more trouble.

io = StringIO.StringIO()
writer = pd.ExcelWriter(io)
pd.DataFrame().to_excel(writer,"sheet name")
writer.save()

Returns an

AttributeError: StringIO instance has no attribute 'rfind'

I'm trying to create an ExcelWriter object without calling pd.ExcelWriter() but am having some trouble. This is what I've tried so far:

from xlsxwriter.workbook import Workbook
writer = Workbook(io)
pd.DataFrame().to_excel(writer,"sheet name")
writer.save()

But now I am getting an AttributeError: 'Workbook' object has no attribute 'write_cells'

How can I save a pandas dataframe in excel format to a StringIO object?

837

asked Jan 21 '15 02:01

A User

3 Answers

Pandas expects a filename path to the ExcelWriter constructors although each of the writer engines support StringIO. Perhaps that should be raised as a bug/feature request in Pandas.

In the meantime here is a workaround example using the Pandas xlsxwriter engine:

import pandas as pd
import StringIO

io = StringIO.StringIO()

# Use a temp filename to keep pandas happy.
writer = pd.ExcelWriter('temp.xlsx', engine='xlsxwriter')

# Set the filename/file handle in the xlsxwriter.workbook object.
writer.book.filename = io

# Write the data frame to the StringIO object.
pd.DataFrame().to_excel(writer, sheet_name='Sheet1')
writer.save()
xlsx_data = io.getvalue()

Update: As of Pandas 0.17 it is now possible to do this more directly:

# Note, Python 2 example. For Python 3 use: output = io.BytesIO().
output = StringIO.StringIO()

# Use the StringIO object as the filehandle.
writer = pd.ExcelWriter(output, engine='xlsxwriter')

See also Saving the Dataframe output to a string in the XlsxWriter docs.

169

answered Sep 22 '22 20:09

jmcnamara

None of these were working from me. I had a view I wanted to return an excel workbook from in Django. I found my solution from the pandas documentation.

import io
bio = io.BytesIO()
writer = pd.ExcelWriter(bio, engine='xlsxwriter')
df.to_excel(writer, sheet_name='Sheet1')
writer.save()
bio.seek(0)

# BONUS CONTENT
# .. because I wanted to return from an api
response = HttpResponse(bio, content_type='application/vnd.openxmlformats-officedocument.spreadsheetml.sheet')
response['Content-Disposition'] = 'attachment; filename=myfile.xlsx'
return response # returned from a view here

Note, I used that value for content type because it was the mime type according to the mozzilla docs. From ".xlsx" in the following link. Replace as needed. https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types/Common_types

answered Sep 24 '22 20:09

Nick Brady

Glancing at the pandas.io.excel source looks like it shouldn't be too much of a problem if you don't mind using xlwt as your writer. The other engines may not be all that difficult either but xlwt jumps out as easy since its save method takes a stream or a filepath.

You need to initially pass in a filename just to make pandas happy as it checks the filename extension against the engine to make sure it's a supported format. But in the case of the xlwt engine, it just stuffs the filename into the object's path attribute and then uses it in the save method. If you change the path attribute to your stream, it'll happily save to that stream when you call the save method.

Here's an example:

import pandas as pd
import StringIO
import base64

df = pd.DataFrame.from_csv('http://moz.com/top500/domains/csv')
xlwt_writer = pd.io.excel.get_writer('xlwt')
my_writer = xlwt_writer('whatever.xls')  #make pandas happy 
xl_out = StringIO.StringIO()
my_writer.path = xl_out  
df.to_excel(my_writer)
my_writer.save()
print base64.b64encode(xl_out.getvalue())

That's the quick, easy and slightly dirty way to do it. BTW... a cleaner way to do it is to subclass ExcelWriter (or one of it's existing subclasses, e.g. _XlwtWriter) -- but honestly there's so little involved in updating the path attribute, I voted to show you the easy way rather than go the slightly longer route.

answered Sep 20 '22 20:09

clockwatcher

Related questions
                            
                                Creating a dictionary with list of lists in Python
                            
                                Timedelta is not defined
                            
                                Python, write in memory zip to file
                            
                                Calculate sunrise and sunset times for a given GPS coordinate within PostgreSQL
                            
                                Delete an uploaded file after downloading it from Flask
                            
                                Optimize the performance of dictionary membership for a list of Keys
                            
                                Sklearn preprocessing - PolynomialFeatures - How to keep column names/headers of the output array / dataframe
                            
                                Changing the scale of a tensor in tensorflow
                            
                                Pandas dataframe error: matplotlib.axes._subplots.AxesSubplot
                            
                                Replace nan values in tensorflow tensor
                            
                                Save and load model optimizer state
                            
                                Iteration over the rows of a Pandas DataFrame as dictionaries
                            
                                Python procedure return values [closed]
                            
                                What does "evaluated only once" mean for chained comparisons in Python?
                            
                                Save JSON outputted from a URL to a file
                            
                                How does use_for_related_fields work in Django?
                            
                                Can't login to Django /admin interface
                            
                                How to make a Python HTTP Request with POST data and Cookie?
                            
                                How do I handle multiple asserts within a single Python unittest?
                            
                                How can I change Django admin language?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With