Remove group if contains record with status 300

Question

I would like to group records by ID from df and delete group if any of records has STATUS = 300.

import pandas as pd


df1 = pd.DataFrame(
    {
        "ID": ["A0", "A0", "A0", "A1", "A1", "A1", "A2", "A2", "A2"],
        "STATUS": [100, 100, 300, 100, 100, 100, 300, 100, 100],
    },
    index=[0, 1, 2, 3, 4, 5, 6, 7, 8],
)

output:

   ID  STATUS
0  A0     100
1  A0     100
2  A0     300
3  A1     100
4  A1     100
5  A1     100
6  A2     300
7  A2     100
8  A2     100

I would like to get:

   ID  STATUS
0  A1     100
1  A1     100
2  A1     100

I tried: dfnew = df1.groupby('ID').filter(lambda x: x['STATUS'] != 300)

But I got the error: TypeError: filter function returned a Series, but expected a scalar bool

RJ Adriaansen · Accepted Answer

df1.groupby('ID').filter(lambda x: 300 not in x['STATUS'].to_list())

mozway · Answer

An efficient method to match any value from a list (see OP's comment) is to use isin coupled with groupby+transform:

df1[~df1['STATUS'].isin([300, 500]).groupby(df1['ID']).transform('any')]

output:

   ID  STATUS
3  A1     100
4  A1     100
5  A1     100

Remove group if contains record with status 300

Tags:

python

pandas

datasciencebegginer

2 Answers

RJ Adriaansen

mozway

Recent Activity

Donate For Us

Remove group if contains record with status 300

Tags:

python

pandas

datasciencebegginer

2 Answers

RJ Adriaansen

mozway

Related questions

Recent Activity

Donate For Us