Python pandas equivalent to R groupby mutate

Tags:

So in R when I have a data frame consisting of say 4 columns, call it df and I want to compute the ratio by sum product of a group, I can it in such a way:

// generate data df = data.frame(a=c(1,1,0,1,0),b=c(1,0,0,1,0),c=c(10,5,1,5,10),d=c(3,1,2,1,2)); | a   b   c    d | | 1   1   10   3 | | 1   0   5    1 | | 0   0   1    2 | | 1   1   5    1 | | 0   0   10   2 | // compute sum product ratio df = df%>% group_by(a,b) %>%       mutate(           ratio=c/sum(c*d)       ); | a   b   c    d  ratio | | 1   1   10   3  0.286 | | 1   1   5    1  0.143 | | 1   0   5    1  1     | | 0   0   1    2  0.045 | | 0   0   10   2  0.454 |

But in python I need to resort to loops. I know there should be a more elegant way than raw loops in python, anyone got any ideas?

436

asked Dec 02 '16 01:12

asosnovsky

1 Answers

It can be done with similar syntax with groupby() and apply():

df['ratio'] = df.groupby(['a','b'], group_keys=False).apply(lambda g: g.c/(g.c * g.d).sum())

enter image description here

132

answered Sep 25 '22 08:09

Psidom

Related questions
                            
                                Pandas: Filtering multiple conditions
                            
                                Python Keep other columns when using sum() with groupby
                            
                                Python 3.6 Installation failed
                            
                                Django upload_to outside of MEDIA_ROOT
                            
                                Sequence find function in Python
                            
                                Python Remove last char from string and return it
                            
                                Sorting A List Comprehension In One Statement
                            
                                Groupby Pandas DataFrame and calculate mean and stdev of one column and add the std as a new column with reset_index
                            
                                Understanding the Python 'with' statement
                            
                                How to add space between two widgets placed in grid in tkinter ~ python?
                            
                                How to check if float pandas column contains only integer numbers?
                            
                                In vscode using Python, ctrl+F5 always asks for "select environment"
                            
                                How to seed Django project ? - insert a bunch of data into the project for initialization
                            
                                Defining constants in python class, is self really needed?
                            
                                Is there a convenient way to map a file uri to os.path?
                            
                                why is xrange able to go back to beginning in Python?
                            
                                How do I import a pre-existing python project into Eclipse?
                            
                                CSV read specific row
                            
                                Matplotlib with annotation cut off from the saved figure
                            
                                Matplotlib : display array values with imshow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python pandas equivalent to R groupby mutate

Tags:

python

pandas

r

dplyr

asosnovsky

People also ask

1 Answers

Psidom

Recent Activity

Donate For Us