Groupby Pandas DataFrame and calculate mean and stdev of one column and add the std as a new column with reset_index

Tags:

pandas

I have a Pandas DataFrame as below:

   a      b      c      d 0  Apple  3      5      7 1  Banana 4      4      8 2  Cherry 7      1      3 3  Apple  3      4      7

I would like to group the rows by column 'a' while replacing values in column 'c' by the mean of values in grouped rows and add another column with std deviation of the values in column 'c' whose mean has been calculated. The values in column 'b' or 'd' are constant for all rows being grouped. So, the desired output would be:

   a      b      c      d      e 0  Apple  3      4.5    7      0.707107 1  Banana 4      4      8      0 2  Cherry 7      1      3      0

What is the best way to achieve this?

682

asked Oct 28 '14 01:10

kkhatri99

1 Answers

You could use a groupby-agg operation:

In [38]: result = df.groupby(['a'], as_index=False).agg(                       {'c':['mean','std'],'b':'first', 'd':'first'})

and then rename and reorder the columns:

In [39]: result.columns = ['a','c','e','b','d']  In [40]: result.reindex(columns=sorted(result.columns)) Out[40]:          a  b    c  d         e 0   Apple  3  4.5  7  0.707107 1  Banana  4  4.0  8       NaN 2  Cherry  7  1.0  3       NaN

Pandas computes the sample std by default. To compute the population std:

def pop_std(x):     return x.std(ddof=0)  result = df.groupby(['a'], as_index=False).agg({'c':['mean',pop_std],'b':'first', 'd':'first'})  result.columns = ['a','c','e','b','d'] result.reindex(columns=sorted(result.columns))

yields

        a  b    c  d    e 0   Apple  3  4.5  7  0.5 1  Banana  4  4.0  8  0.0 2  Cherry  7  1.0  3  0.0

112

answered Sep 19 '22 14:09

unutbu

Related questions
                            
                                Best way to convert a Unicode URL to ASCII (UTF-8 percent-escaped) in Python?
                            
                                Can I count on order being preserved in a Python tuple?
                            
                                Regression with Date variable using Scikit-learn
                            
                                passing a function as an argument in python
                            
                                Testing Flask login and authentication?
                            
                                How to Manage Google API Errors in Python
                            
                                matplotlib.pyplot has no attribute 'style'
                            
                                Python 3: os.walk() file paths UnicodeEncodeError: 'utf-8' codec can't encode: surrogates not allowed
                            
                                AWS Elastic Beanstalk logging with python (django)
                            
                                How to add placeholder to an Entry in tkinter?
                            
                                Bokeh Plotting: Enable tooltips for only some glyphs
                            
                                When plotting with Bokeh, how do you automatically cycle through a color pallette?
                            
                                Printing without parentheses varying error message using Python 3
                            
                                Pandas: Filtering multiple conditions
                            
                                Python Keep other columns when using sum() with groupby
                            
                                Python 3.6 Installation failed
                            
                                Django upload_to outside of MEDIA_ROOT
                            
                                Sequence find function in Python
                            
                                Python Remove last char from string and return it
                            
                                Sorting A List Comprehension In One Statement

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With