If I have a multiindex dataframe: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame([[1,2,3],[4,5,6],[7,8,9]],columns=['a','b','c']).set_index(['a','b']) </code></pre> I can simply filter the dataframe on a column, for example: <pre class="prettyprint"><code>df[df.c>4] </code></pre> But to do the same on the level of an index, say "b", I can't do: <pre class="prettyprint"><code>df[df.b>4] </code></pre> Instead I can do: <pre class="prettyprint"><code>df[df.index.get_level_values('b')>4] </code></pre> But is there a less verbose way to do this?

You can use <code>query</code> for better readability <pre class="prettyprint"><code>In [795]: df.query('b > 4') Out[795]: c a b 4 5 6 7 8 9 </code></pre>

Filtering on index levels in a pandas.DataFrame

Tags:

python

pandas

If I have a multiindex dataframe:

import pandas as pd
df = pd.DataFrame([[1,2,3],[4,5,6],[7,8,9]],columns=['a','b','c']).set_index(['a','b'])

I can simply filter the dataframe on a column, for example:

df[df.c>4]

But to do the same on the level of an index, say "b", I can't do:

df[df.b>4]

Instead I can do:

df[df.index.get_level_values('b')>4]

But is there a less verbose way to do this?

374

asked Jul 13 '17 18:07

dlm

1 Answers

You can use query for better readability

In [795]: df.query('b > 4')
Out[795]:
     c
a b
4 5  6
7 8  9

answered Oct 03 '22 22:10

Zero

Related questions
                            
                                Modifying timestamps in pandas to make index unique
                            
                                Django how to get multiple context_object_name for multiple queryset from single view to single template
                            
                                Python how to get value from argparse from variable, but not the name of the variable?
                            
                                Create a matrix from a vector where each row is a shifted version of the vector
                            
                                Deploying asgi and wsgi on Heroku
                            
                                How to play mp3 from bytes?
                            
                                cbind (R function) equivalent in numpy
                            
                                How to import and call a Python function in a Jinja template? [closed]
                            
                                Get keys of pandas.Series.value_counts
                            
                                How can I display the test name *after* the test using pytest?
                            
                                Convert array into percentiles
                            
                                why is that people use sqlalchemy CORE to save data and use sqlalchemy ORM to query data
                            
                                what is the difference between scipy.stats module and numpy.random module, between similar methods that both modules have?
                            
                                How to get list of values in ImageDataGenerator.flow_from_directory Keras?
                            
                                Unresolved reference when calling a global variable?
                            
                                Use scrapy to get list of urls, and then scrape content inside those urls
                            
                                Convert PyQt5 QPixmap to numpy ndarray
                            
                                Best Algorithm to make correction typos in text
                            
                                Expanding/Zooming in a numpy array
                            
                                Memory Sharing among workers in gunicorn using --preload

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With