Is there a way to write an aggregation function as is used in <code>DataFrame.agg</code> method, that would have access to more than one column of the data that is being aggregated? Typical use cases would be weighted average, weighted standard deviation funcs. I would like to be able to write something like <pre class="prettyprint"><code>def wAvg(c, w): return ((c * w).sum() / w.sum()) df = DataFrame(....) # df has columns c and w, i want weighted average # of c using w as weight. df.aggregate ({"c": wAvg}) # and somehow tell it to use w column as weights ... </code></pre>

Yes; use the <code>.apply(...)</code> function, which will be called on each sub-<code>DataFrame</code>. For example: <pre class="prettyprint"><code>grouped = df.groupby(keys) def wavg(group): d = group['data'] w = group['weights'] return (d * w).sum() / w.sum() grouped.apply(wavg) </code></pre>

Pandas DataFrame aggregate function using multiple columns

Tags:

python

pandas

Is there a way to write an aggregation function as is used in DataFrame.agg method, that would have access to more than one column of the data that is being aggregated? Typical use cases would be weighted average, weighted standard deviation funcs.

I would like to be able to write something like

def wAvg(c, w):     return ((c * w).sum() / w.sum())  df = DataFrame(....) # df has columns c and w, i want weighted average                      # of c using w as weight. df.aggregate ({"c": wAvg}) # and somehow tell it to use w column as weights ...

507

asked Jun 08 '12 15:06

user1444817

1 Answers

Yes; use the .apply(...) function, which will be called on each sub-DataFrame. For example:

grouped = df.groupby(keys)  def wavg(group):     d = group['data']     w = group['weights']     return (d * w).sum() / w.sum()  grouped.apply(wavg)

191

answered Sep 19 '22 02:09

Wes McKinney

Related questions
                            
                                IndexError: too many indices for array
                            
                                Django model manager objects.create where is the documentation?
                            
                                Why does map return a map object instead of a list in Python 3?
                            
                                Why use Django on Google App Engine?
                            
                                How to get stable results with TensorFlow, setting random seed
                            
                                Can I add custom methods/attributes to built-in Python types?
                            
                                How to get reproducible results in keras
                            
                                numpy division with RuntimeWarning: invalid value encountered in double_scalars
                            
                                Is there special significance to 16331239353195370.0?
                            
                                Understanding time.perf_counter() and time.process_time()
                            
                                str performance in python
                            
                                Why is the Borg pattern better than the Singleton pattern in Python
                            
                                Python - TypeError: 'int' object is not iterable
                            
                                Most suitable python library for Github API v3 [closed]
                            
                                Python Equivalent of setInterval()?
                            
                                Call a Python method by name
                            
                                Why is bool a subclass of int?
                            
                                How can I troubleshoot Python "Could not find platform independent libraries <prefix>"
                            
                                Mock attributes in Python mock?
                            
                                Converting a float to a string without rounding it

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With