Pandas Dataframe: For loop that adds a fixed integer if the value exists in previous rows

Question

for the following dataframe

df = pd.DataFrame({'Rounds':[1000,1000,1000,1000,3000,3000,4000,5000,6000,6000]})

I would like to have a for loop that if the value already exists in previous rows, a fixed int, in this case 25, is added to the value and creates:

df = pd.DataFrame({'Rounds':[1000,1025,1050,1075,3000,3025,4000,5000,6000,6025]})

Initially I tried

for i in df.index:
    if df.iat[i,1] == df.iloc[i-1,1]:
        df.iat[i,1] = df.iat[i-1,1]+25

The problem is that it doesn't work for more than two similar values in a column and I would like to give column name "Rounds" instead of the index of column.

mozway · Accepted Answer

You need groupby.cumcount:

df['Rounds'] += df.groupby('Rounds').cumcount()*25

output:

intermediate:

df.groupby('Rounds').cumcount()

0    0
1    1
2    2
3    3
4    0
5    1
6    0
7    0
8    0
9    1
dtype: int64

Pandas Dataframe: For loop that adds a fixed integer if the value exists in previous rows

Tags:

python

loops

for-loop

pandas

dataframe

kokoroko

1 Answers

mozway

Recent Activity

Donate For Us

Pandas Dataframe: For loop that adds a fixed integer if the value exists in previous rows

Tags:

python

loops

for-loop

pandas

dataframe

kokoroko

1 Answers

mozway

Related questions

Recent Activity

Donate For Us