Pandas DataFrame - delete rows that have same value at a particular column as a previous row

Question

I have a pandas dataframe, I want to check for each row if it has the same value at a particular column(let's call it porduct_type), and if it does, delete it. In other words, out of a group of consecutive rows with the same value at a particular column, I want to keep only one.

Example, if column A is the one on which we don't want consecutive duplicates:

DSM · Accepted Answer

It's a little tricky, but you could do something like

>>> df.groupby((df["A"] != df["A"].shift()).cumsum().values).first()
   A   B    C
1  0   1    1
2  2   1   10
3  0  11  100
4  5   2  200

Pandas DataFrame - delete rows that have same value at a particular column as a previous row

Tags:

python

pandas

dataframe

Baron Yugovich

1 Answers

DSM

Recent Activity

Donate For Us

Pandas DataFrame - delete rows that have same value at a particular column as a previous row

Tags:

python

pandas

dataframe

Baron Yugovich

1 Answers

DSM

Related questions

Recent Activity

Donate For Us