I have a column of sites: ['Canada', 'USA', 'China' ....] Each site occurs many times in the SITE column and next to each instance is a true or false value. <pre class="prettyprint"><code>INDEX | VALUE | SITE 0 | True | Canada 1 | False | Canada 2 | True | USA 3 | True | USA </code></pre> And it goes on. Goal 1: I want to find, for each site, what percent of the VALUE column is True. Goal 2: I want to return a list of sites where % True in the VALUE column is greater than 10%. How do I use groupby to achieve this? I only know how to use groupby to find the mean for each site which won't help me here.

Something like this: <pre class="prettyprint"><code>In [13]: g = df.groupby('SITE')['VALUE'].mean() In [14]: g[g > 0.1] Out[14]: SITE Canada 0.5 USA 1.0 </code></pre>

Pandas groupby to find percent True and False

Tags:

python

pandas

python-2.7

I have a column of sites: ['Canada', 'USA', 'China' ....]

Each site occurs many times in the SITE column and next to each instance is a true or false value.

INDEX | VALUE | SITE

0     | True  | Canada
1     | False | Canada
2     | True  | USA
3     | True  | USA

And it goes on.

Goal 1: I want to find, for each site, what percent of the VALUE column is True.

Goal 2: I want to return a list of sites where % True in the VALUE column is greater than 10%.

How do I use groupby to achieve this? I only know how to use groupby to find the mean for each site which won't help me here.

454

asked May 18 '15 19:05

pythanaconda

1 Answers

Something like this:

In [13]: g = df.groupby('SITE')['VALUE'].mean()
In [14]: g[g > 0.1]
Out[14]: 
SITE
Canada    0.5
USA       1.0

answered Oct 22 '22 20:10

Roman Pekar

Related questions
                            
                                FuncAnimation goes past the frames argument
                            
                                HTMLParser for Python 3.4
                            
                                Print unicode string in python regardless of environment
                            
                                Send SIGINT in python to os.system
                            
                                Best way to permute contents of each column in numpy
                            
                                pandas: iterating over DataFrame index with loc
                            
                                Add a tuple to a specific cell of a pandas dataframe
                            
                                Determining whether a word is a noun or not
                            
                                Why is object.__getattr__ missing?
                            
                                Multiple pipes in subprocess
                            
                                Opening file in append mode and seeking to start
                            
                                Error in python-igraph 'module' object has no attribute 'Graph'
                            
                                How to include .pyx file in python package
                            
                                VideoCapture Does Not Work in Anaconda
                            
                                Convert superclass instance to subclass instance
                            
                                Modify Python/PIP to automatically install modules when failed to import
                            
                                Find and replace multiple values in python
                            
                                How to get Asynchronous Javascript responses from Selenium Webdriver
                            
                                How to Download PDFs from Scraped Links [Python]?
                            
                                Creating class instance from dictionary?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With