How to drop row at certain index in every group in GroupBy object?

Question

I'm trying to drop a row at certain index in every group inside a GroupBy object.

The best I have been able to manage is:

import pandas as pd
    
x_train = x_train.groupby('ID')
x_train.apply(lambda x: x.drop([0], axis=0))

However, this doesn't work. I have spent a whole day on this to no solution, so have turned to stack.

Edit: A solution for any index value is needed as well

BENY · Accepted Answer

You can do it with cumcount

idx= x_train.groupby('ID').cumcount()
x_train = x_train[idx!=0]

Shaido · Answer

The problem with using drop inside the groupby is the index numbers are still the same as before the groupby. So when using drop([0]), only the row that originally had 0 as index will be dropped. In the other groups, there will not be any row with index 0 as long as the index is unique.

If you want to use drop then what you can do is to first use reset_index inside the grouped data:

x_train.groupby('ID').apply(lambda x: x.reset_index().drop([0]))

If you want to use drop then what you can do is to first use reset_index inside the grouped data:

x_train.groupby('ID').apply(lambda x: x.reset_index().drop([0]))

How to drop row at certain index in every group in GroupBy object?

Tags:

python

pandas

dataframe

pandas-groupby

Aayush Panda

2 Answers

BENY

Shaido

Recent Activity

Donate For Us

How to drop row at certain index in every group in GroupBy object?

Tags:

python

pandas

dataframe

pandas-groupby

Aayush Panda

2 Answers

BENY

Shaido

Related questions

Recent Activity

Donate For Us