Filtering a mutli-index

Tags:

C1	C2	C3	C4
A	12	True	89
	9	False	77
	5	True	23
B	9	True	45
	5	True	45
	2	False	78
C	11	True	10
	8	False	08
	12	False	09

C1 & C2 are the multi index. I'm hoping to get a result which gives me only values in C1 which have values both lower than 10 and greater than or equal to 10 in C2.

So in the table above C1 - B should go, with the final result should look like this:

C1	C2	C3	C4
A	12	True	89
	9	False	77
	5	True	23
C	11	True	10
	8	False	08
	12	False	09

I tried df.loc[(df.C2 < 10 ) & (df.C2 >= 10)] but this didn't work.

I also tried:

filter1 = df.index.get_level_values('C2') < 10 filter2 = df.index.get_level_values('C2') >= 10

df.iloc[filter1 & filter2]

Which I saw suggested on another post that also didn't work. Any one know how to solve this? Thanks

643

asked Dec 14 '21 10:12

Wagsforever

1 Answers

Use GroupBy.transform with GroupBy.any for test at least one condition match per groups, so possible last filter by m DataFrame:

filter1 = df.index.get_level_values('C2') < 10 
filter2 = df.index.get_level_values('C2') >= 10

m = (df.assign(filter1= filter1, filter2=filter2)
       .groupby(level=0)[['filter1','filter2']]
       .transform('any'))

print (m)
       filter1  filter2
C1 C2                  
A  12     True     True
   9      True     True
   5      True     True
B  9      True    False
   5      True    False
   2      True    False
C  11     True     True
   8      True     True
   12     True     True

df = df[m.filter1 & m.filter2]
print (df)
          C3  C4
C1 C2           
A  12   True  89
   9   False  77
   5    True  23
C  11   True  10
   8   False   8
   12  False   9

Alternative solution:

filter1 = df.index.get_level_values('C2') < 10 
filter2 = df.index.get_level_values('C2') >= 10

lvl1 = df.index[filter1].remove_unused_levels().levels[0]
lvl2 = df.index[filter2].remove_unused_levels().levels[0]

df1 = df.loc[set(lvl1).intersection(lvl2)]
print (df1)
          C3  C4
C1 C2           
A  12   True  89
   9   False  77
   5    True  23
C  11   True  10
   8   False   8
   12  False   9

153

answered Nov 03 '22 14:11

jezrael

Related questions
                            
                                How to get the same percent_rank in SQL and pandas?
                            
                                How to scale target values of a Keras autoencoder model using a sklearn pipeline?
                            
                                import _tkinter # If this fails your Python may not be configured for Tk error in python 3.8
                            
                                How to see Python print statements from running Fargate ECS task?
                            
                                Finetune SavedModel Failure due to No Gradient loaded
                            
                                How to aggregate, combining dataframes, with pandas groupby
                            
                                Trying to "pip install reportlab==3.0" and I get a crazy long error with include header <string.h>
                            
                                Keras vertical ensemble model with condition in between
                            
                                How long does the event_loop live in a Django>=3.1 async view
                            
                                Is there an essential difference between await and async-with while doing request in aiohttp?
                            
                                How to debug a stuck asyncio coroutine in Python?
                            
                                How to execute program or batch file? [duplicate]
                            
                                python debug adapter not being found in vscode - WSL:Ubuntu
                            
                                Find path of python_notebook.ipynb when running it with Google Colab
                            
                                Plot confusion matrix with Keras data generator using sklearn
                            
                                When running python 3.9.4 I am unable to import tkinter, but downgrading to 3.8.2 works perfectly fine
                            
                                Efficiently insert multiple elements in a list (or another data structure) keeping their order
                            
                                SQLAlchemy - Adding a ForeignKeyConstraint to a many-to-many table that is based on another relationship
                            
                                How can I solve this arithmetic puzzle? My solution is too slow after n = 14
                            
                                No Logging on Azure DevOps Pipeline

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Filtering a mutli-index

Tags:

python

pandas

multi-index

Wagsforever

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us