pandas get average of a groupby

Tags:

I am trying to find the average monthly cost per user_id but i am only able to get average cost per user or monthly cost per user.

Because i group by user and month, there is no way to get the average of the second groupby (month) unless i transform the groupby output to something else.

This is my df:

     df = { 'id' : pd.Series([1,1,1,1,2,2,2,2]),
            'cost' : pd.Series([10,20,30,40,50,60,70,80]),
            'mth': pd.Series([3,3,4,5,3,4,4,5])}

   cost  id  mth
0    10   1    3
1    20   1    3
2    30   1    4
3    40   1    5
4    50   2    3
5    60   2    4
6    70   2    4
7    80   2    5

I can get monthly sum but i want the average of the months for each user_id.

df.groupby(['id','mth'])['cost'].sum()

id  mth
1   3       30
    4       30
    5       40
2   3       50
    4      130
    5       80

i want something like this:

id average_monthly
1 (30+30+40)/3
2 (50+130+80)/3

348

asked Oct 16 '16 04:10

jxn

1 Answers

Resetting the index should work. Try this:

In [19]: df.groupby(['id', 'mth']).sum().reset_index().groupby('id').mean()  
Out[19]: 
    mth       cost
id                
1   4.0  33.333333
2   4.0  86.666667

You can just drop mth if you want. The logic is that after the sum part, you have this:

In [20]: df.groupby(['id', 'mth']).sum()
Out[20]: 
        cost
id mth      
1  3      30
   4      30
   5      40
2  3      50
   4     130
   5      80

Resetting the index at this point will give you unique months.

In [21]: df.groupby(['id', 'mth']).sum().reset_index()
Out[21]: 
   id  mth  cost
0   1    3    30
1   1    4    30
2   1    5    40
3   2    3    50
4   2    4   130
5   2    5    80

It's just a matter of grouping it again, this time using mean instead of sum. This should give you the averages.

Let us know if this helps.

164

answered Sep 25 '22 22:09

NullDev

Related questions
                            
                                How do you interpolate from an array containing datetime objects?
                            
                                Backup Odoo db from within odoo
                            
                                Why is it recommended to derive from Exception instead of BaseException class in Python?
                            
                                OrderedDict: are values ordered, too? [duplicate]
                            
                                How to map a series of conditions as keys in a dictionary?
                            
                                'numpy.ndarray' object has no attribute 'remove'
                            
                                Dictionary comprehension with inline functions
                            
                                How to print function arguments in sys.settrace?
                            
                                Spark using PySpark read images
                            
                                pandas create a series with n elements (sequential or randbetween)
                            
                                Tensorflow error using my own data
                            
                                Reconcile np.fromiter and multidimensional arrays in Python
                            
                                Testing matplotlib-based plots in Travis CI
                            
                                Python: format string with custom delimiters [duplicate]
                            
                                Can we have Django DateTimeField without timezone?
                            
                                python double colon with -1 as third parameter [duplicate]
                            
                                Keyboard shortcuts with tkinter in Python 3
                            
                                Django REST Framework - Set request in serializer test?
                            
                                python subprocess.Popen hanging
                            
                                Is there a Python equivalent to the C# ?. and ?? operators?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas get average of a groupby

Tags:

python

pandas

dataframe

group-by

jxn

People also ask

1 Answers

NullDev

Recent Activity

Donate For Us