I have a pandas DataFrame like this: <pre class="prettyprint"><code>>>> df = pd.DataFrame({'MONTREGL':[10,10,2222,35,200,56,5555],'SINID':['aaa','aaa','aaa','bbb','bbb','ccc','ccc'],'EXTRA':[400,400,400,500,500,333,333]}) >>> df MONTREGL SINID EXTRA 0 10 aaa 400 1 10 aaa 400 2 2222 aaa 400 3 35 bbb 500 4 200 bbb 500 5 56 ccc 333 6 5555 ccc 333 </code></pre> I want to sum the column <code>MONTREGL</code> for each groupby <code>SINID</code>... So I get 2242 for aaa and so on... ALSO I want to keep the value of column <code>EXTRA</code>. This is the expected result: <pre class="prettyprint"><code> MONTREGL SINID EXTRA 0 2242 aaa 400 1 235 bbb 500 2 5611 ccc 333 </code></pre> Thanks for your help in advance!

I ended up using this script: <pre class="prettyprint"><code>dff = df.groupby(["SINID","EXTRA"]).MONTREGL.sum().reset_index() </code></pre> And it works in this test and production.

Sum column based on another column in Pandas DataFrame

Tags:

python

pandas

dataframe

I have a pandas DataFrame like this:

>>> df = pd.DataFrame({'MONTREGL':[10,10,2222,35,200,56,5555],'SINID':['aaa','aaa','aaa','bbb','bbb','ccc','ccc'],'EXTRA':[400,400,400,500,500,333,333]})
>>> df
   MONTREGL SINID EXTRA
0        10   aaa   400
1        10   aaa   400
2      2222   aaa   400
3        35   bbb   500
4       200   bbb   500
5        56   ccc   333
6      5555   ccc   333

I want to sum the column MONTREGL for each groupby SINID...

So I get 2242 for aaa and so on... ALSO I want to keep the value of column EXTRA.

This is the expected result:

   MONTREGL SINID EXTRA
0      2242   aaa   400
1       235   bbb   500
2      5611   ccc   333

Thanks for your help in advance!

371

asked May 29 '19 12:05

Soufiane Sabiri

1 Answers

I ended up using this script:

dff = df.groupby(["SINID","EXTRA"]).MONTREGL.sum().reset_index()

And it works in this test and production.

106

answered Oct 12 '22 19:10

Soufiane Sabiri

Related questions
                            
                                compare a list with values in dictionary
                            
                                Modify seaborn line relplot legend title
                            
                                Dataflow/apache beam - how to access current filename when passing in pattern?
                            
                                Rename the less frequent categories by "OTHER" python
                            
                                Python error when building Python package Docker Image
                            
                                Percentage of array between values
                            
                                AttributeError: 'int' object has no attribute 'lower' in TFIDF and CountVectorizer
                            
                                Parallel loading of Input Files in Pandas Dataframe
                            
                                How to execute file.py on HTML button press using Django?
                            
                                sort Persian strings for python [duplicate]
                            
                                convert Dataframe to 2d Array
                            
                                More efficient method of finding minimum sum after k operations
                            
                                How To Call Postgres 11 Stored Procedure From Python
                            
                                Could not find a version that satisfies the requirement flask (from versions: ) No matching distribution found for flask
                            
                                Sum only numeric columns in pandas
                            
                                What is the process "python3 unattended upgrade shutdown"?
                            
                                Storing OAuth Token in Python Library
                            
                                Is it possible to sort a list with reduce?
                            
                                `try ... except not` construction
                            
                                COCO api evaluation for subset of classes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sum column based on another column in Pandas DataFrame

Tags:

python

pandas

dataframe

Soufiane Sabiri

People also ask

1 Answers

Soufiane Sabiri

Recent Activity

Donate For Us