Group by one columns and find sum and max value for another in pandas

Tags:

I have a dataframe like this:

Name  id  col1  col2  col3  cl4 
PL    252  0     747   3     53  
PL2   252  1     24    2     35 
PL3   252  4     75    24    13 
AD    889  53    24    0     95 
AD2   889  23    2     0     13  
AD3   889  0     24    3     6  
BG    024  12    89    53    66 
BG1   024  43    16    13    0   
BG2   024  5     32    101   4

And now I need to group by ID, and for columns col1 and col4 find the sum for each id and put that into a new column near to parent column (example: col3(sum)) But for col2 and col3 find max value. Desired output:

Name  id  col1 col1(sum) col2 col2(max) col3 col(max) col4 col4(sum)
PL    252  0       5      747    747     3     24    6    18
PL2   252  1       5      24     747     2     24    12   18
PL3   252  4       5      75     747     24    24    0    18
AD    889  53      76     24     24      95    95    23   33
AD2   889  23      76     2      24      13    95    5    33
AD3   889  0       76     24     24      6     95    5    33
BG    024  12      60     89     89      66    66    0    67   
BG1   024  43      60     16     89      0     66    63   67    
BG2   024  5       60     32     89      4     66    4    67

What is the easiest and fastest way to calculate this?

992

asked Jun 23 '17 14:06

jovicbg

2 Answers

The most (pandas) native way to do this, is to use the .agg() method that allows you to specify the aggregation function you want to apply per column (just like you would do in SQL).

Sample from the documentation:

df.groupby('A').agg({'B': ['min', 'max'], 'C': 'sum'})

138

answered Oct 16 '22 19:10

Maresh

You can use merge when you have groupby and sum on id :

pd.merge(df,df.groupby("id").sum().reset_index(), on='id',how='outer')

output

enter image description here

answered Oct 16 '22 18:10

Tbaki

Related questions
                            
                                How to create all tables defined in models using peewee
                            
                                Removing backslashes from string
                            
                                Min-max normalisation of a NumPy array
                            
                                Extracting specific columns from pandas.dataframe
                            
                                How to cleanly loop over two files in parallel in Python
                            
                                Where is my local App Engine datastore?
                            
                                How to display index during list iteration with Django?
                            
                                break two for loops [duplicate]
                            
                                Flask: IOError when saving uploaded files
                            
                                R, Python: install packages on rpy2
                            
                                How to start daemon process from python on windows?
                            
                                Why accessing to class variable from within the class needs "self." in Python? [duplicate]
                            
                                Get attribute names and values from ElementTree
                            
                                How to read an array of integers from single line of input in python3
                            
                                What's the advantage of a trailing underscore in Python naming?
                            
                                Split large text file(around 50GB) into multiple files
                            
                                Disable images in Selenium Python
                            
                                How to find all ordered pairs of elements in array of integers whose sum lies in a given range of value
                            
                                Django 1.6: How to access static files in view
                            
                                Find all locations / cities / places in a text

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Group by one columns and find sum and max value for another in pandas

Tags:

python

pandas

dataframe

group-by

jovicbg

People also ask

2 Answers

Maresh

Tbaki

Recent Activity

Donate For Us