Pandas Groupby apply function to count values greater than zero I am using groupby and agg in the following manner: <pre class="prettyprint"><code>df.groupby('group')['a'].agg({'mean' : np.mean, 'std' : np.std}) </code></pre> and I would like to also count the values above zero in the same column ['a'] the following line does the count as I want, <pre class="prettyprint"><code>sum(x > 0 for x in df['a']) </code></pre> but I can't get it work when applying to groupby. Following an example for applying a pandas calculation to a groupby I tried: <pre class="prettyprint"><code>df.groupby('group')['a'].apply(sum(x > 0 for x in df['a'])) </code></pre> but I get an error message: AttributeError: 'numpy.int32' object has no attribute 'module' Can anybody please suggest how this might be done?

Answer from the comments: <pre class="prettyprint"><code> .agg({'pos':lambda ts: (ts > 0).sum()}) # – behzad.nouri Mar 31 at 0:00 </code></pre> This is my contribution to the backlog of unanswered questions :) Credits to behzad.nouri Update 2020 In the latest pandas version, you need to do the following: <pre class="prettyprint"><code> .agg(pos=lambda ts: (ts > 0).sum()) </code></pre> otherwise it will result in the following error: <pre class="prettyprint"><code>SpecificationError: nested renamer is not supported </code></pre>

Pandas Groupby apply function to count values greater than zero

df.groupby('group')['a'].agg({'mean' : np.mean, 'std' : np.std})

and I would like to also count the values above zero in the same column ['a']

the following line does the count as I want,

sum(x > 0 for x in df['a'])

but I can't get it work when applying to groupby.

Following an example for applying a pandas calculation to a groupby I tried:

df.groupby('group')['a'].apply(sum(x > 0 for x in df['a']))

but I get an error message: AttributeError: 'numpy.int32' object has no attribute 'module'

Can anybody please suggest how this might be done?

957

asked Mar 30 '14 23:03

rdh9

1 Answers

Answer from the comments:

 .agg({'pos':lambda ts: (ts > 0).sum()}) # –  behzad.nouri Mar 31 at 0:00

This is my contribution to the backlog of unanswered questions :) Credits to behzad.nouri

Update 2020 In the latest pandas version, you need to do the following:

 .agg(pos=lambda ts: (ts > 0).sum())

otherwise it will result in the following error:

SpecificationError: nested renamer is not supported

123

answered Sep 23 '22 08:09

Reblochon Masque

Related questions
                            
                                How can I find all the possible combinations of a list of lists (in Python)?
                            
                                trouble installing rpy2 on win7 (R 2.12, Python 2.5)
                            
                                How to animate a time-ordered sequence of matplotlib plots
                            
                                Mocking before importing a module
                            
                                how to generate numbers given their prime factors, but with unknown exponents? [duplicate]
                            
                                Drawing Histogram in OpenCV-Python
                            
                                Django Generic Views: When to use ListView vs. DetailView
                            
                                Control Charts in Python [closed]
                            
                                Is there a way to step into decorated functions, skipping decorator code
                            
                                Should email "Date" header be sender's local time or UTC?
                            
                                PySide/PyQt - Starting a CPU intensive thread hangs the whole application
                            
                                Is it safe to use python's -S option?
                            
                                How to set up Python server side with javascript client side
                            
                                How to define a mutually exclusive group of two positional arguments?
                            
                                Resample a time series with the index of another time series
                            
                                Plot a 3D surface from {x,y,z}-scatter data in python
                            
                                SqlAlchemy select with max, group_by and order_by
                            
                                override class variable in python?
                            
                                Is a generator the callable? Which is the generator?
                            
                                Concatenate custom features with CountVectorizer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas Groupby apply function to count values greater than zero

Tags:

python

python-3.x

pandas

rdh9

People also ask

1 Answers

Reblochon Masque

Recent Activity

Donate For Us