Pandas monthly rolling operation

Problem

Suppose we have a DataFrame, df, containing this data.

import pandas as pd from io import StringIO  data = StringIO( """\ date          spendings  category 2014-03-25    10         A 2014-04-05    20         A 2014-04-15    10         A 2014-04-25    10         B 2014-05-05    10         B 2014-05-15    10         A 2014-05-25    10         A """ )  df = pd.read_csv(data,sep="\s+",parse_dates=True,index_col="date")

Goal

For each row, sum the spendings over every row that is within one month of it, ideally using DataFrame.rolling as it's a very clean syntax.

What I have tried

df = df.rolling("M").sum()

But this throws an exception

ValueError: <MonthEnd> is a non-fixed frequency

version: pandas==0.19.2

204

asked Apr 22 '17 07:04

Filip Kilibarda

1 Answers

Use the "D" offset rather than "M" and specifically use "30D" for 30 days or approximately one month.

df = df.rolling("30D").sum()

Initially, I intuitively jumped to using "M" as I figured it stands for one month, but now it's clear why that doesn't work.

128

answered Sep 24 '22 22:09

Filip Kilibarda

Related questions
                            
                                Using ^ to match beginning of line in Python regex
                            
                                Printing the loss during TensorFlow training
                            
                                Reading a pickle file (PANDAS Python Data Frame) in R
                            
                                python equivalent of functools 'partial' for a class / constructor
                            
                                How to use await in a python lambda
                            
                                Google Colab is very slow compared to my PC
                            
                                How to mock os.walk in python with a temporary filesystem?
                            
                                Loading model with custom loss + keras
                            
                                WARNING: Ignoring invalid distribution -ip (c:\python39\lib\site-packages) How do I fix this and what does it mean? [duplicate]
                            
                                What is the way data is stored in *.npy?
                            
                                How do I use xml namespaces with find/findall in lxml?
                            
                                How to use custom AdminSite class?
                            
                                How do I see the Django debug toolbar?
                            
                                uwsgi throws IO error caused by uwsgi_response_write_body_do broken pipe
                            
                                Internal Server Error when using Flask session
                            
                                Python unittest.TestCase object has no attribute 'runTest'
                            
                                Nested validation with the flask-restful RequestParser
                            
                                when to use if vs elif in python
                            
                                Hide some maybe-no-member Pylint errors
                            
                                sklearn doesn't have attribute 'datasets'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas monthly rolling operation

Tags:

python

pandas

Problem

Goal

What I have tried

Filip Kilibarda

People also ask

1 Answers

Filip Kilibarda

Recent Activity

Donate For Us