It is quite easy to add many pandas dataframes into excel work book as long as it is different worksheets. But, it is somewhat tricky to get many dataframes into one worksheet if you want to use pandas built-in df.to_excel functionality. <pre class="prettyprint"><code># Creating Excel Writer Object from Pandas writer = pd.ExcelWriter('test.xlsx',engine='xlsxwriter') workbook=writer.book worksheet=workbook.add_worksheet('Validation') df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0) another_df.to_excel(writer,sheet_name='Validation',startrow=20, startcol=0) </code></pre> The above code won't work. You will get the error of <pre class="prettyprint"><code> Sheetname 'Validation', with case ignored, is already in use. </code></pre> Now, I have experimented enough that I found a way to make it work. <pre class="prettyprint"><code>writer = pd.ExcelWriter('test.xlsx',engine='xlsxwriter') # Creating Excel Writer Object from Pandas workbook=writer.book df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0) another_df.to_excel(writer,sheet_name='Validation',startrow=20, startcol=0) </code></pre> This will work. So, my purpose of posting this question on stackoverflow is twofold. Firstly, I hope this will help someone if he/she is trying to put many dataframes into a single work sheet at excel. Secondly, Can someone help me understand the difference between those two blocks of code? It appears to me that they are pretty much the same except the first block of code created worksheet called "Validation" in advance while the second does not. I get that part. What I don't understand is why should it be any different ? Even if I don't create the worksheet in advance, this line, the line right before the last one, <pre class="prettyprint"><code> df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0) </code></pre> will create a worksheet anyway. Consequently, by the time we reached the last line of code the worksheet "Validation" is already created as well in the second block of code. So, my question basically, why should the second block of code work while the first doesn't? Please also share if there is another way to put many dataframes into excel using the built-in df.to_excel functionality !!

To create the Worksheet in advance, you need to add the created sheet to the <code>sheets</code> dict: <code>writer.sheets['Validation'] = worksheet</code> Using your original code: <pre class="prettyprint"><code># Creating Excel Writer Object from Pandas writer = pd.ExcelWriter('test.xlsx',engine='xlsxwriter') workbook=writer.book worksheet=workbook.add_worksheet('Validation') writer.sheets['Validation'] = worksheet df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0) another_df.to_excel(writer,sheet_name='Validation',startrow=20, startcol=0) </code></pre> <hr> <h3>Explanation</h3> If we look at the pandas function <code>to_excel</code>, it uses the writer's <code>write_cells</code> function: <pre class="prettyprint"><code>excel_writer.write_cells(formatted_cells, sheet_name, startrow=startrow, startcol=startcol) </code></pre> So looking at the <code>write_cells</code> function for <code>xlsxwriter</code>: <pre class="prettyprint"><code>def write_cells(self, cells, sheet_name=None, startrow=0, startcol=0): # Write the frame cells using xlsxwriter. sheet_name = self._get_sheet_name(sheet_name) if sheet_name in self.sheets: wks = self.sheets[sheet_name] else: wks = self.book.add_worksheet(sheet_name) self.sheets[sheet_name] = wks </code></pre> Here we can see that it checks for <code>sheet_name</code> in <code>self.sheets</code>, and so it needs to be added there as well.

Putting many python pandas dataframes to one excel worksheet

Tags:

python

pandas

dataframe

excel

xlsxwriter

It is quite easy to add many pandas dataframes into excel work book as long as it is different worksheets. But, it is somewhat tricky to get many dataframes into one worksheet if you want to use pandas built-in df.to_excel functionality.

# Creating Excel Writer Object from Pandas   writer = pd.ExcelWriter('test.xlsx',engine='xlsxwriter')    workbook=writer.book worksheet=workbook.add_worksheet('Validation')  df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0)    another_df.to_excel(writer,sheet_name='Validation',startrow=20, startcol=0)

The above code won't work. You will get the error of

 Sheetname 'Validation', with case ignored, is already in use.

Now, I have experimented enough that I found a way to make it work.

writer = pd.ExcelWriter('test.xlsx',engine='xlsxwriter')   # Creating Excel Writer Object from Pandas   workbook=writer.book df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0)    another_df.to_excel(writer,sheet_name='Validation',startrow=20, startcol=0)

This will work. So, my purpose of posting this question on stackoverflow is twofold. Firstly, I hope this will help someone if he/she is trying to put many dataframes into a single work sheet at excel.

Secondly, Can someone help me understand the difference between those two blocks of code? It appears to me that they are pretty much the same except the first block of code created worksheet called "Validation" in advance while the second does not. I get that part.

What I don't understand is why should it be any different ? Even if I don't create the worksheet in advance, this line, the line right before the last one,

 df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0)

will create a worksheet anyway. Consequently, by the time we reached the last line of code the worksheet "Validation" is already created as well in the second block of code. So, my question basically, why should the second block of code work while the first doesn't?

Please also share if there is another way to put many dataframes into excel using the built-in df.to_excel functionality !!

893

asked Oct 05 '15 20:10

nyan314sn

2 Answers

To create the Worksheet in advance, you need to add the created sheet to the sheets dict:

writer.sheets['Validation'] = worksheet

Using your original code:

# Creating Excel Writer Object from Pandas   writer = pd.ExcelWriter('test.xlsx',engine='xlsxwriter')    workbook=writer.book worksheet=workbook.add_worksheet('Validation') writer.sheets['Validation'] = worksheet df.to_excel(writer,sheet_name='Validation',startrow=0 , startcol=0)    another_df.to_excel(writer,sheet_name='Validation',startrow=20, startcol=0)

Explanation

If we look at the pandas function to_excel, it uses the writer's write_cells function:

excel_writer.write_cells(formatted_cells, sheet_name, startrow=startrow, startcol=startcol)

So looking at the write_cells function for xlsxwriter:

def write_cells(self, cells, sheet_name=None, startrow=0, startcol=0):     # Write the frame cells using xlsxwriter.     sheet_name = self._get_sheet_name(sheet_name)     if sheet_name in self.sheets:         wks = self.sheets[sheet_name]     else:         wks = self.book.add_worksheet(sheet_name)         self.sheets[sheet_name] = wks

Here we can see that it checks for sheet_name in self.sheets, and so it needs to be added there as well.

answered Oct 06 '22 07:10

Adrian

user3817518: "Please also share if there is another way to put many dataframes into excel using the built-in df.to_excel functionality !!"

Here's my attempt:

Easy way to put together a lot of dataframes on just one sheet or across multiple tabs. Let me know if this works!

-- To test, just run the sample dataframes and the second and third portion of code.

Sample dataframes

import pandas as pd import numpy as np  # Sample dataframes     randn = np.random.randn df = pd.DataFrame(randn(15, 20)) df1 = pd.DataFrame(randn(10, 5)) df2 = pd.DataFrame(randn(5, 10))

Put multiple dataframes into one xlsx sheet

# funtion def multiple_dfs(df_list, sheets, file_name, spaces):     writer = pd.ExcelWriter(file_name,engine='xlsxwriter')        row = 0     for dataframe in df_list:         dataframe.to_excel(writer,sheet_name=sheets,startrow=row , startcol=0)            row = row + len(dataframe.index) + spaces + 1     writer.save()  # list of dataframes dfs = [df,df1,df2]  # run function multiple_dfs(dfs, 'Validation', 'test1.xlsx', 1)

Put multiple dataframes across separate tabs/sheets

# function def dfs_tabs(df_list, sheet_list, file_name):     writer = pd.ExcelWriter(file_name,engine='xlsxwriter')        for dataframe, sheet in zip(df_list, sheet_list):         dataframe.to_excel(writer, sheet_name=sheet, startrow=0 , startcol=0)        writer.save()  # list of dataframes and sheet names dfs = [df, df1, df2] sheets = ['df','df1','df2']      # run function dfs_tabs(dfs, sheets, 'multi-test.xlsx')

answered Oct 06 '22 06:10

TomDobbs

Related questions
                            
                                Make new column in Panda dataframe by adding values from other columns
                            
                                How do you get the process ID of a program in Unix or Linux using Python?
                            
                                How to dynamically create a derived type in the Python C-API
                            
                                Is there a Generic python library to consume REST based services? [closed]
                            
                                How do I create a login API using Django Rest Framework?
                            
                                Why is Python 3 not backwards compatible? [closed]
                            
                                What is the difference between variable_scope and name_scope? [duplicate]
                            
                                built-in range or numpy.arange: which is more efficient?
                            
                                How to use a multiprocessing.Manager()?
                            
                                importing a module when the module name is in a variable [duplicate]
                            
                                py.test skips test class if constructor is defined
                            
                                django-rest-framework 3.0 create or update in nested serializer
                            
                                ":=" syntax and assignment expressions: what and why?
                            
                                Converting "yield from" statement to Python 2.7 code
                            
                                Turning off IntelliJ Auto-save
                            
                                Histogram values of a Pandas Series
                            
                                What is the point of setLevel in a python logging handler?
                            
                                Type hints: solve circular dependency [duplicate]
                            
                                mypy, type hint: Union[float, int] -> is there a Number type?
                            
                                Pandas column bind (cbind) two data frames

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With