In pandas, how to find the row/index where the cumulative sum is greater than a threshold?

Question

I would like find the row (index) where the cumulative sum of the values in some column exceeds a threshold.

I can, and do, find this location using a simple loop, like below:

def sum_to(df, col, threshold):
    s = 0
    for r in df.iterrows():
        if s + r[1][col] > threshold:
            return r[0]
        else:
            s += r[1][col]

    return len(df)

However, I would like to know if there is a better/nicer way to achieve this in Pandas.

Isaac · Accepted Answer

The simplest way is probably

df[col].cumsum().searchsorted(threshold)

but this assumes that you have no negative numbers in your column.

In pandas, how to find the row/index where the cumulative sum is greater than a threshold?

Tags:

python

pandas

mibm

1 Answers

Isaac

Recent Activity

Donate For Us

In pandas, how to find the row/index where the cumulative sum is greater than a threshold?

Tags:

python

pandas

mibm

1 Answers

Isaac

Related questions

Recent Activity

Donate For Us