Rolling grouped cumulative sum

Tags:

pandas

pandas-groupby

I'm looking to create a rolling grouped cumulative sum. I can get the result via iteration, but wanted to see if there was a more intelligent way.

Here's what the source data looks like:

Here is the desired result:

982

asked Mar 04 '18 11:03

decipher

1 Answers

This is a very interesting problem. Try below to see if it works for you.

(
    pd.concat([df.loc[df.Per<=i][['C','V']].assign(Per=i) for i in df.Per.unique()])
    .groupby(by=['Per','C'])
    .sum()
    .reset_index()
)

Out[197]: 
    Per  C   V
0     1  a   4
1     1  c   4
2     2  a  10
3     2  b   5
4     2  c   4
5     3  a  10
6     3  b   5
7     3  c   4
8     3  j   7
9     4  a  19
10    4  b   5
11    4  c   4
12    4  j   7
13    4  x  11
14    5  a  21
15    5  b   5
16    5  c   4
17    5  j   7
18    5  x  11
19    6  a  21
20    6  b   5
21    6  c   7
22    6  j   7
23    6  k   6
24    6  x  11

164

answered Oct 05 '22 02:10

Allen

Related questions
                            
                                random sampling with Pandas data frame disjoint groups
                            
                                Python - easy way to "comparison" map one array to another
                            
                                Pandas: Resample dataframe column, get discrete feature that corresponds to max value
                            
                                Pandas Crosstabulation and counting
                            
                                pandas display: truncate column display rather than wrapping
                            
                                Find annual average of pandas dataframe with date column
                            
                                Add extra column as the cumulative time difference
                            
                                Finding duplicate rows in a Pandas Dataframe then Adding a column in the Dataframe that states if the row is a duplicate
                            
                                Consecutive NaN larger than threshold in Pandas DataFrame
                            
                                Replicating SAS' first and last functionality with Python
                            
                                Python Pandas: Denormalize data from one data frame into another
                            
                                Pandas Get Day of Week from date type column
                            
                                Group python pandas dataframe per weeks (starting on Monday)
                            
                                How to append new dataframe rows to a csv using pandas?
                            
                                Pandas Loc select by index as well as boolean condition in single expression
                            
                                How can I use a custom function within an expression using the eval dataframe method?
                            
                                Python Replace Whole Values in Dataframe String and Not Substrings
                            
                                REGEX filter with Pandas (any numeric combination followed by 'plus' sign)
                            
                                Calculating event based on the continuous timestep
                            
                                Count each group sequentially pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With