pandas rolling cumsum over the trailing n elements

Tags:

pandas

cumsum

Using pandas, what is the easiest way to calculate a rolling cumsum over the previous n elements, for instance to calculate trailing three days sales:

df = pandas.Series(numpy.random.randint(0,10,10), index=pandas.date_range('2020-01', periods=10))
df
2020-01-01    8
2020-01-02    4
2020-01-03    1
2020-01-04    0
2020-01-05    5
2020-01-06    8
2020-01-07    3
2020-01-08    8
2020-01-09    9
2020-01-10    0
Freq: D, dtype: int64

Desired output:

2020-01-01     8
2020-01-02    12
2020-01-03    13
2020-01-04     5
2020-01-05     6
2020-01-06    13
2020-01-07    16
2020-01-08    19
2020-01-09    20
2020-01-10    17
Freq: D, dtype: int64

805

asked May 27 '17 21:05

CarlosE

1 Answers

You need rolling.sum:

df.rolling(3, min_periods=1).sum()
Out: 
2020-01-01     8.0
2020-01-02    12.0
2020-01-03    13.0
2020-01-04     5.0
2020-01-05     6.0
2020-01-06    13.0
2020-01-07    16.0
2020-01-08    19.0
2020-01-09    20.0
2020-01-10    17.0
dtype: float64

min_periods ensures the first two elements are calculated, too. With a window size of 3, by default, the first two elements are NaN.

129

answered Oct 28 '22 06:10

ayhan

Related questions
                            
                                Pandas calculate number of values between each range
                            
                                Python: which is a fast way to find index in pandas dataframe?
                            
                                Pandas: DataFrame groupby for year/month and return with new DatetimeIndex
                            
                                Find rows that have same values in another column - Python
                            
                                Calculating and creating percentage column from two columns
                            
                                How do I remove/omit the count column from the dataframe in Pandas?
                            
                                pandas: write tab-separated dataframe with literal tabs with no quotes
                            
                                How to scan a pandas dataframe for all values greater than something and returns row and column number corresponding to that value?
                            
                                Slicing strings in a column in pandas
                            
                                Conditional cumsum based on column
                            
                                Pandas add new columns based on splitting another column
                            
                                Calculate percentiles/quantiles for a timeseries with resample or groupby - pandas
                            
                                How to drop null values in Pandas? [duplicate]
                            
                                Pandas, split dataframe by monotonic increase of column value
                            
                                How to plot multiple lines in one figure in Pandas Python based on data from multiple columns? [duplicate]
                            
                                Using pandas read_csv with zip compression
                            
                                How to retrive more than 10k lines from InfluxDB using Pandas?
                            
                                Read a Latex table into a Pandas DataFrame
                            
                                Pandas: Join dataframe with condition
                            
                                Efficiently updating NaN's in a pandas dataframe from a prior row & specific columns value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With