Merging multiple CSV files into separate tabs of a spreadsheet in Python

Tags:

I have a code which generates multiple CSV files in a directory. I want to generate a report in excel which will consist of the CSV files as separate tabs. I have used the below code for the same:

import pandas as pd
import os
import csv
import glob    
path = "/MyScripts"
all_files = glob.glob(os.path.join(path, "*.csv"))
df_from_each_file = (pd.read_csv(f) for f in all_files)
df_from_each_file.to_excel(writer, sheet_name='ReturnData.csv')
writer.save()

But it gives below error: AttributeError: 'generator' object has no attribute 'to_excel' Not sure where i am going wrong. Do i need to import any specific library to solve the issue?

Python Version is 2.7

688

asked Aug 22 '18 09:08

Soubhik Banerjee

Video Answer

2 Answers

There are two issues here:

Your generator expression allows you to lazily iterate dataframe objects. You can't export a generator expression to an Excel file.
Your sheet_name parameter is a constant. To export to multiple worksheets, you need to specify a different name for each worksheet.

You can use a simple for loop for this purpose:

writer = pd.ExcelWriter('out.xlsx', engine='xlsxwriter')
df_from_each_file = (pd.read_csv(f) for f in all_files)

for idx, df in enumerate(df_from_each_file):
    df.to_excel(writer, sheet_name='data{0}.csv'.format(idx))

writer.save()

Your worksheets will be named data0.csv, data1.csv, etc. If you need the filename as your sheet name, you can restructure your logic and use the os module to extract the filename from path:

import os

writer = pd.ExcelWriter('out.xlsx', engine='xlsxwriter')

for f in all_files:
    df = pd.read_csv(f)
    df.to_excel(writer, sheet_name=os.path.basename(f))

writer.save()

answered Oct 19 '22 07:10

jpp

Here is the complete source code from jpp solution:

import os
import pandas as pd
import glob

path = './'
all_files = glob.glob(os.path.join(path, "*.csv"))

writer = pd.ExcelWriter('out.xlsx', engine='xlsxwriter')

for f in all_files:
    df = pd.read_csv(f)
    df.to_excel(writer, sheet_name=os.path.splitext(os.path.basename(f))[0], index=False)

writer.save()

answered Oct 19 '22 05:10

Dan

Related questions
                            
                                Making a graphQL mutation from my python code, getting error
                            
                                How to create a function object from an ast.FunctionDef node?
                            
                                Python Pandas Fillna Median not working
                            
                                Subclassing type vs object in Python3 [duplicate]
                            
                                PyInstaller stuck on "Building PKG ..." when exporting a single .exe
                            
                                NumPy equivalent to Keras function utils.to_categorical
                            
                                Django Rest Framework how to disable authentication and authorization
                            
                                Issues importing mlxtend python
                            
                                doing "nothing" in else command of if-else clause [duplicate]
                            
                                How to use flask context with concurrent.futures.ThreadPoolExecutor
                            
                                Drop duplicates, but ignore nulls
                            
                                adding static() to urlpatterns only work by appending to the list
                            
                                Unable to print names in the right way in another function
                            
                                Dividing each row by the previous one
                            
                                Merge two columns into one within the same data frame in pandas/python
                            
                                How to increase process speed using read_excel in pandas?
                            
                                Change color of individual boxes in pandas boxplot subplots
                            
                                Run bash script with Django
                            
                                PipEnv: How to handle locally installed .whl packages
                            
                                Python - matplotlib - how do I plot a plane from equation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Merging multiple CSV files into separate tabs of a spreadsheet in Python

Tags:

python

python-3.x

pandas

excel

python-2.7

Soubhik Banerjee

People also ask

Video Answer

2 Answers

jpp

Dan

Recent Activity

Donate For Us