I am creating a <code>groupby</code> object from a Pandas <code>DataFrame</code> and want to select out all the groups with > 1 size. Example: <pre class="prettyprint"><code> A B 0 foo 0 1 bar 1 2 foo 2 3 foo 3 </code></pre> The following doesn't seem to work: <pre class="prettyprint"><code>grouped = df.groupby('A') grouped[grouped.size > 1] </code></pre> Expected Result: <pre class="prettyprint"><code>A foo 0 2 3 </code></pre>

As of pandas 0.12 you can do: <pre class="prettyprint"><code>>>> grouped.filter(lambda x: len(x) > 1) A B 0 foo 0 2 foo 2 3 foo 3 </code></pre>

I have found <code>transform</code> to be much more efficient than <code>filter</code> for very large dataframes: <pre class="prettyprint"><code>element_group_sizes = df['A'].groupby(df['A']).transform('size') df[element_group_sizes>1] </code></pre> Or, in one line: <pre class="prettyprint"><code>df[df['A'].groupby(df['A']).transform('size')>1] </code></pre>

Filtering grouped DataFrame in Pandas

     A  B 0  foo  0 1  bar  1 2  foo  2 3  foo  3

The following doesn't seem to work:

grouped = df.groupby('A') grouped[grouped.size > 1]

Expected Result:

A foo 0     2     3

710

asked Oct 31 '12 21:10

Abhi

2 Answers

As of pandas 0.12 you can do:

>>> grouped.filter(lambda x: len(x) > 1)       A  B 0  foo  0 2  foo  2 3  foo  3

168

answered Sep 22 '22 08:09

elyase

I have found transform to be much more efficient than filter for very large dataframes:

element_group_sizes = df['A'].groupby(df['A']).transform('size') df[element_group_sizes>1]

Or, in one line:

df[df['A'].groupby(df['A']).transform('size')>1]

answered Sep 20 '22 08:09

Sealander

Related questions
                            
                                Difference between scipy.spatial.KDTree and scipy.spatial.cKDTree
                            
                                Define an order for ManyToManyField with Django
                            
                                Can subprocess.call be invoked without waiting for process to finish?
                            
                                Tensorflow variable scope: reuse if variable exists
                            
                                Pythonic way to create a numpy array from a list of numpy arrays
                            
                                How do you pass a Queue reference to a function managed by pool.map_async()?
                            
                                How can I detect Heroku's environment?
                            
                                sqlalchemy simple example of `sum`, `average`, `min`, `max`
                            
                                sqlalchemy foreign key relationship attributes
                            
                                Nested Json to pandas DataFrame with specific format
                            
                                Convolve2d just by using Numpy
                            
                                How does COPY work and why is it so much faster than INSERT?
                            
                                Python urllib2 with keep alive
                            
                                Invoking Pylint programmatically
                            
                                Converting a 2D numpy array to a structured array
                            
                                Numpy Array to base64 and back to Numpy Array - Python
                            
                                kalman 2d filter in python
                            
                                What is the Python equivalent of `set -x` in shell?
                            
                                How to convert datetime to integer in python
                            
                                User Authentication in Pyramid

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Filtering grouped DataFrame in Pandas

Tags:

python

pandas

group-by

Abhi

People also ask

2 Answers

elyase

Sealander

Recent Activity

Donate For Us