How can I use cumsum within a group in Pandas?

Tags:

I have

df = pd.DataFrame.from_dict({'id': ['A', 'B', 'A', 'C', 'D', 'B', 'C'], 'val': [1,2,-3,1,5,6,-2], 'stuff':['12','23232','13','1234','3235','3236','732323']})    id   stuff  val 0  A      12    1 1  B   23232    2 2  A      13   -3 3  C    1234    1 4  D    3235    5 5  B    3236    6 6  C  732323   -2

I'd like to get running some of val for each id, so the desired output looks like this:

  id   stuff  val  cumsum 0  A      12    1   1 1  B   23232    2   2 2  A      13   -3   -2 3  C    1234    1   1 4  D    3235    5   5 5  B    3236    6   8 6  C  732323   -2  -1

This is what I tried:

df['cumsum'] = df.groupby('id').cumsum(['val'])

and

df['cumsum'] = df.groupby('id').cumsum(['val'])

This is the error I got:

ValueError: Wrong number of items passed 0, placement implies 1

950

asked Sep 29 '15 15:09

Baron Yugovich

1 Answers

You can call transform and pass the cumsum function to add that column to your df:

In [156]: df['cumsum'] = df.groupby('id')['val'].transform(pd.Series.cumsum) df  Out[156]:   id   stuff  val  cumsum 0  A      12    1       1 1  B   23232    2       2 2  A      13   -3      -2 3  C    1234    1       1 4  D    3235    5       5 5  B    3236    6       8 6  C  732323   -2      -1

With respect to your error, you can't call cumsum on a Series groupby object, secondly you're passing the name of the column as a list which is meaningless.

So this works:

In [159]: df.groupby('id')['val'].cumsum()  Out[159]: 0    1 1    2 2   -2 3    1 4    5 5    8 6   -1 dtype: int64

answered Oct 03 '22 00:10

EdChum

Related questions
                            
                                How to remove trailing whitespace in PyDev plugin for Eclipse
                            
                                Django Passing data between views
                            
                                selecting across multiple columns with python pandas?
                            
                                Python csv.DictReader: parse string?
                            
                                How to import functions from other projects in Python?
                            
                                How to limit mongo query in python
                            
                                Is there an easy way to convert ISO 8601 duration to timedelta?
                            
                                What does " -r " do in pip install -r requirements.txt
                            
                                python's webbrowser launches IE, instead of default browser, on Windows relative path
                            
                                Calling private function within the same class python
                            
                                Pandas: Combining Two DataFrames Horizontally [duplicate]
                            
                                Python - Flask Default Route possible?
                            
                                Python debugger tells me value of Numpy array is "*** Newest frame"
                            
                                What is the difference between "a is b" and "id(a) == id(b)" in Python?
                            
                                Python Implementation of Viterbi Algorithm
                            
                                Testing for positive infinity, or negative infinity, individually in Python
                            
                                How to avoid overlapping of labels & autopct in a matplotlib pie chart?
                            
                                Can't find msguniq. Make sure you have GNU gettext tools 0.15 or newer installed. (Django 1.8 and OSX ElCapitan)
                            
                                Django template filters, tags, simple_tags, and inclusion_tags
                            
                                moment.calendar() without the time

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I use cumsum within a group in Pandas?

Tags:

python

pandas

dataframe

group-by

cumsum

Baron Yugovich

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us