How to iterate over MultiIndex levels in Pandas?

Tags:

I often have MultiIndex indices and I'd like to iterate over groups where higher level indices are equal. It basically looks like

from random import choice
import pandas as pd
N = 100
df = pd.DataFrame([choice([1, 2, 3]) for _ in range(N)],
                  columns=["A"],
                  index=pd.MultiIndex.from_tuples([(choice("ab"), choice("cd"), choice("de")) 
                                                   for _ in range(N)]))

for idx in zip(df.index.get_level_values(0), df.index.get_level_values(1)):
    df_select = df.ix[idx]

Is there a way to do the for loop iteration more neatly?

721

asked Dec 07 '15 17:12

Gerenuk

2 Answers

Use groupby. The index of the df_select view includes the first two level values, but otherwise is similar to your example.

for idx, df_select in df.groupby(level=[0, 1]):
    ...

170

answered Sep 21 '22 22:09

Mzzzzzz

Alternatively to groupby logic you can use a lambda function, which has the advantage of not having to specify the number of levels, i.e. it will pick all levels except the very last one:

for idx in df.index.map(lambda x: x[:-1]):
 df_select=df.ix[idx]

answered Sep 23 '22 22:09

rstreppa

Related questions
                            
                                python socket send immediately
                            
                                python axhline label not showing up in plot
                            
                                Python equivalent of Ruby's .select
                            
                                Apache Thrift Python 3 support
                            
                                What makes an element eligible for a set membership test in Python? [duplicate]
                            
                                What does `{...}` mean in the print output of a python variable?
                            
                                TMUX Session Won't Import Python Module
                            
                                How to get the visual length of a text string in python
                            
                                Correct way of unit testing __repr__ with dict
                            
                                Filter Pandas DataFrame for elements in list [duplicate]
                            
                                How to make a for loop either increasing or decreasing?
                            
                                compare two floats for equality in Python [duplicate]
                            
                                Fitting data to multimodal distributions with scipy, matplotlib
                            
                                PyQt5 error during "python3 configure.py": fatal error: 'qgeolocation.h' file not found
                            
                                How to create argument of type "list of pairs" with argparse?
                            
                                Dynamic default value setting for a flask form field
                            
                                Multiprocessing writing to pandas dataframe
                            
                                IPython Notebook - ShimWarning: The `IPython.kernel` package has been deprecated
                            
                                Pip hangs in Windows 7
                            
                                Field name `username` is not valid for model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to iterate over MultiIndex levels in Pandas?

Tags:

python

pandas

dataframe

multi-index

Gerenuk

People also ask

2 Answers

Mzzzzzz

rstreppa

Recent Activity

Donate For Us