Python: How to add specific columns of .mean to dataframe

Tags:

How can I add the means of b and c to my dataframe? I tried a merge but it didn't seem to work. So I want two extra columns b_mean and c_mean added to my dataframe with the results of df.groupBy('date').mean()

DataFrame

  a  b  c  date
0  2  3  5     1
1  5  9  1     1
2  3  7  1     1

I have the following code

import pandas as pd

a = [{'date': 1,'a':2, 'b':3, 'c':5}, {'date':1, 'a':5, 'b':9, 'c':1}, {'date':1, 'a':3, 'b':7, 'c':1}]

df = pd.DataFrame(a)

x =  df.groupby('date').mean()

Edit:

Desired output would be the following df.groupby('date').mean() returns:

             a         b         c
date                              
1     3.333333  6.333333  2.333333

My desired result would be the following data frame

   a  b  c  date  a_mean   b_mean
0  2  3  5     1  3.3333   6.3333
1  5  9  1     1  3.3333   6.3333 
2  3  7  1     1  3.3333   6.3333

824

asked Mar 26 '17 22:03

John Decker

1 Answers

As @ayhan mentioned, you can use pd.groupby.transform() for this. Transform is like apply, but it uses the same index as the original dataframe instead of the unique values in the column(s) grouped on.

df['a_mean'] = df.groupby('date')['a'].transform('mean')
df['b_mean'] = df.groupby('date')['b'].transform('mean')

>>> df
   a  b  c  date    b_mean    a_mean
0  2  3  5     1  6.333333  3.333333
1  5  9  1     1  6.333333  3.333333
2  3  7  1     1  6.333333  3.333333

137

answered Sep 23 '22 12:09

3novak

Related questions
                            
                                How can I remove a widget in kivy?
                            
                                NumPy boolean array warning?
                            
                                portable way to write csv file in python 2 or python 3
                            
                                Difference between Python 2 and 3 for shuffle with a given seed
                            
                                Multiple stacked bar plot with pandas
                            
                                How to check if character exists in DataFrame cell
                            
                                pandas convert text feature to numeric value
                            
                                type conversion in python from float to int
                            
                                Problems with updating anaconda and installing new packages
                            
                                Write Python OrderedDict to CSV
                            
                                Python- Why is my Paho Mqtt Message Different Than When I Sent It?
                            
                                Add text next to vertical line in matplotlib
                            
                                Generate random numbers from lognormal distribution in python
                            
                                How to make action logging in Django with Django Rest Framework
                            
                                appending values to dictionary in for loop
                            
                                matplotlib scatterplot with legend
                            
                                numpy savetxt is not adding comma delimiter
                            
                                Iterate over numpy with index (numpy equivalent of python enumerate)
                            
                                seaborn heatmap color scheme based on row values
                            
                                Why is checking isinstance(something, Mapping) so slow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python: How to add specific columns of .mean to dataframe

Tags:

python

pandas

dataframe

John Decker

People also ask

1 Answers

3novak

Recent Activity

Donate For Us