Running sum in pandas (without loop)

Tags:

python

pandas

I'd like to build a running sum over a pandas dataframe. I have something like:

10/10/2012:  50,  0
10/11/2012: -10, 90
10/12/2012: 100, -5

And I would like to get:

10/10/2012:  50,  0
10/11/2012:  40, 90
10/12/2012: 140, 85

So every cell should be the sum of itself and all previous cells, how should I do this without using a loop.

628

asked Dec 14 '12 12:12

leo

1 Answers

As @JonClements mentions, you can do this using the cumsum DataFrame method:

from pandas import DataFrame
df = DataFrame({0: {'10/10/2012': 50, '10/11/2012': -10, '10/12/2012': 100}, 1: {'10/10/2012': 0, '10/11/2012': 90, '10/12/2012': -5}})

In [3]: df
Out[3]: 
              0   1
10/10/2012   50   0
10/11/2012  -10  90
10/12/2012  100  -5

In [4]: df.cumsum()
Out[4]: 
              0   1
10/10/2012   50   0
10/11/2012   40  90
10/12/2012  140  85

answered Oct 21 '22 06:10

Andy Hayden

Related questions
                            
                                google.auth.exceptions.DefaultCredentialsError:
                            
                                Seaborn lineplot high cpu; very slow compared to matplotlib
                            
                                selenium.common.exceptions.WebDriverException: Message: invalid session id using Selenium with ChromeDriver and Chrome through Python
                            
                                Filter data in pytorch tensor
                            
                                pdb step into a function when already in pdb mode
                            
                                Why do I get an 'Unhandled exception in event loop' error on ipython
                            
                                pd.NA vs np.nan for pandas
                            
                                HTML to IMAGE using Python
                            
                                Reading numeric Excel data as text using xlrd in Python
                            
                                Paramiko SSH exec_command (shell script) returns before completion
                            
                                Plotting mplot3d / axes3D xyz surface plot with log scale?
                            
                                Matplotlib svg as string and not a file
                            
                                What's difference between a simple webserver and Apache server?
                            
                                How would you properly break this line to match pep8 rules?
                            
                                When are objects garbage collected in python?
                            
                                How do you see the return value from a function in the Python debugger, without an intermediate?
                            
                                Is there a Spock-like testing library for Python
                            
                                Goodness of fit tests in SciPy
                            
                                New to flask and Flask-Login - ImportError: No module named login
                            
                                Array elementwise operations

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With