Pandas - How to check if multi index column exists

Tags:

python

pandas

My question is similar to How to check if a column exists in Pandas but for the multi-index column case.

I'm trying to process values in a multi index column dataframe using column names originating in another file - hence the need to check if the column exists. A representative example is below:

import pandas as pd
from numpy.random import randint,randn

df = pd.DataFrame({ 'A': [randint(0,3) for p in range(0,12)],'B': [0.1* randint(0,3) for p in range(0,12)],
      'C': [0.1*randint(0,3) for p in range(0,12)],'D': randn(12),
    })

df1 = df.groupby(['A','B','C']).D.sum().unstack(-1)
df1 = df1.T
df1
A           0                   1                             2          
B         0.0       0.2       0.0       0.1       0.2       0.0       0.1
C                                                                        
0.0       NaN       NaN       NaN  0.845316       NaN  0.555513       NaN
0.1       NaN  0.139371       NaN       NaN       NaN       NaN -0.260868
0.2  5.002509       NaN  0.637353  0.438863  0.943098       NaN       NaN

df1[1][0.1]
C
0.0    0.845316
0.1         NaN
0.2    0.438863

Accessing df1[0][0.1] in the above example will result in a key error. How do I check if a multi index column exists, so that non-existent columns can be skipped during processing?

Thanks!

740

asked May 22 '16 22:05

kip6000

1 Answers

You can think of a multi index like an array of tuples, so can access like:

df1[(0, 0.1)]

and test like:

(0, 0.1) in df1.columns:

124

answered Sep 27 '22 16:09

Mr.F

Related questions
                            
                                Using Py_BuildValue() to create a list of tuples in C
                            
                                How to make a type hint forward reference
                            
                                python BeautifulSoup get all href in Children of div
                            
                                sklearn SVM fit() "ValueError: setting an array element with a sequence"
                            
                                Iterate over a dict except for x item items
                            
                                Flask app wrapped with DispatcherMiddleware no longer has test_client
                            
                                How to (properly) use external credentials in an AWS Lambda function?
                            
                                No handlers could be found for logger "__main__"
                            
                                Open BytesIO (xlsx) with xlrd
                            
                                Understanding Matplotlib's quiver plotting
                            
                                Dictionary Comprehension for list values
                            
                                How to assign and use column headers in Spark?
                            
                                Specifying default dtype for np.array(1.)
                            
                                How to erase line from text file in Python?
                            
                                How do you merge the master branch into a feature branch with GitPython?
                            
                                Add an autoincrementing ID column to an existing table with Sqlite
                            
                                How can I implement a recursive neural network in TensorFlow?
                            
                                DefaultRouter class not creating API root view for all apps in python
                            
                                Creating log directory in tensorboard
                            
                                How to download images from BeautifulSoup?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With