How do I add a column to my dataframe that says what sheet name each row is from? Python

Tags:

I am working with a Dataframe that has five sheets and I want to use four of them. So I can load it in:

df = pd.read_excel('***.xls', sheet_name=['a', 'b', 'c', 'd'])

But now I would like to add a column that says what sheet each row was in, and I am not sure how to do this. I tried something like this

for name, frame in df.items():
        frame['Sheet'] = name
        df = df.append(frame, ignore_index=True)

but I was getting the following error:

AttributeError: 'collections.OrderedDict' object has no attribute 'append'

Any help would be greatly appreciated. Thank you in advance!

Let's say this is what my data looks like after I concat the sheets:

df = pd.concat(pd.read_excel(***.xls, sheet_name=['a', 'b', 'c', 'd'],
                          header=1), ignore_index=True, sort=False)

Concat data

My goal is to add a column that says what sheet each row was from, like so...

Concat data with sheet name row

Hopefully that helps you understand what I am trying to go for.

(Edit) I would also like to know how to do this if I wanted to use all the sheets in a dataframe, but didn't want to list the individual names of each sheet. Thanks!

236

asked Dec 13 '19 19:12

jpk

1 Answers

IIUC, try DataFrame.assign in a list comprehension:

sheets = ['a', 'b', 'c', 'd']

df = pd.concat([pd.read_excel('***.xls', sheet_name=s)
                .assign(sheet_name=s) for s in sheets])

Update

If you want to use all sheets and assign a column of sheetname, you could do:

workbook = pd.ExcelFile('***.xls')
sheets = workbook.sheet_names

df = pd.concat([pd.read_excel(workbook, sheet_name=s)
                .assign(sheet_name=s) for s in sheets])

answered Sep 27 '22 22:09

Chris Adams

Related questions
                            
                                Space complexity of split() function in python
                            
                                TypeError: 'JavaPackage' object is not callable (spark._jvm)
                            
                                Multiple if conditions, without nesting
                            
                                Is there a way to remove a layer from a JSON object?
                            
                                How to join subquery results to function results
                            
                                Plotly Dash table callback
                            
                                Converting Audio files between Pydub and Librosa
                            
                                How can I write an IF condition for my decision variable for Mixed Integer Linear Programming (MILP) using PuLP GLPK on Python?
                            
                                What's the big deal of pick a random element from a big stream?
                            
                                How to replace certain parts of a tensor on the condition in keras?
                            
                                How to see the loss of the best epoch from early stopping in Keras?
                            
                                Mypy doesn't throw an error when mixing booleans with integers
                            
                                Displaying a file (image) from S3 via Flask & BytesIO
                            
                                Count the number of connected non-zeros along rows and columns but not diagonaly in a Matrix in shell script
                            
                                KeyError: 'url_encoded_fmt_stream_map'
                            
                                Import python file from another folder that is not a child
                            
                                Python Pandas slicing with various datatypes
                            
                                How to add a new class to an existing classifier in deep learning?
                            
                                Why is os.scandir() as slow as os.listdir()?
                            
                                Set and verify SSL/TLS version used in Python MySQL connection

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I add a column to my dataframe that says what sheet name each row is from? Python

Tags:

python

python-3.x

pandas

excel

jpk

People also ask

1 Answers

Update

Chris Adams

Recent Activity

Donate For Us