How do you update the levels of a pandas MultiIndex after slicing its DataFrame?

Tags:

I have a Dataframe with a pandas MultiIndex:

In [1]: import pandas as pd In [2]: multi_index = pd.MultiIndex.from_product([['CAN','USA'],['total']],names=['country','sex']) In [3]: df = pd.DataFrame({'pop':[35,318]},index=multi_index) In [4]: df Out[4]:                pop country sex CAN     total   35 USA     total  318

Then I remove some rows from that DataFrame:

In [5]: df = df.query('pop > 100')  In [6]: df Out[6]:                pop country sex USA     total  318

But when I consult the MutliIndex, it still has both countries in its levels.

In [7]: df.index.levels[0] Out[7]: Index([u'CAN', u'USA'], dtype='object')

I can fix this myself in a rather strange way:

In [8]: idx_names = df.index.names  In [9]: df = df.reset_index(drop=False)  In [10]: df = df.set_index(idx_names)  In [11]: df Out[11]:                pop country sex USA     total  318  In [12]: df.index.levels[0] Out[12]: Index([u'USA'], dtype='object')

But this seems rather messy. Is there a better way I'm missing?

525

asked Feb 27 '15 19:02

Kyle Heuton

1 Answers

From version pandas 0.20.0+ use MultiIndex.remove_unused_levels:

print (df.index) MultiIndex(levels=[['CAN', 'USA'], ['total']],            labels=[[1], [0]],            names=['country', 'sex'])  df.index = df.index.remove_unused_levels()  print (df.index) MultiIndex(levels=[['USA'], ['total']],            labels=[[0], [0]],            names=['country', 'sex'])

179

answered Oct 12 '22 10:10

jezrael

Related questions
                            
                                Cannot import requests.packages.urllib3.util 'Retry'
                            
                                How do you put environmental variables in web.config?
                            
                                Template specialization and enable_if problems [duplicate]
                            
                                What is the difference between String initializations by new String() and new String("") in Java?
                            
                                Check whether domain is registered
                            
                                RStudio: git add --all from the UI
                            
                                Should Django migrations live in source control?
                            
                                HTML-Email with inline attachments and non-inline attachments
                            
                                Cancel button is not shown in UISearchBar
                            
                                Pandas error - invalid value encountered
                            
                                Why can't I move the std::unique_ptr inside lambda in C++14?
                            
                                Why does git log not show anything new after git fetch?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With