I have a dataframe with different values in column <code>x</code>. I want to drop values that appear only once in a column. So this: <pre class="prettyprint"><code> x 1 10 2 30 3 30 4 40 5 40 6 50 </code></pre> Should turn into this: <pre class="prettyprint"><code> x 2 30 3 30 4 40 5 40 </code></pre> I was wondering if there is a way to do that.

You can easily get this by using <code>groupby</code> and <code>transform</code> : <pre class="prettyprint"><code>In [1]: import pandas as pd In [2]: df = pd.DataFrame([10, 30, 30, 40, 40, 50], columns=['x']) In [3]: df = df[df.groupby('x').x.transform(len) > 1] In [4]: df Out[4]: x 1 30 2 30 3 40 4 40 </code></pre>

You can use <code>groupby</code> and then <code>filter</code> it: <pre class="prettyprint"><code>In [9]: df = pd.DataFrame([10, 30, 30, 40, 40, 50], columns=['x']) df = df.groupby('x').filter(lambda x: len(x) > 1) df Out[9]: x 1 30 2 30 3 40 4 40 </code></pre>

Remove values that appear only once in a DataFrame column

Should turn into this:

I was wondering if there is a way to do that.

910

asked Oct 11 '15 23:10

Francisco García

2 Answers

You can easily get this by using groupby and transform :

In [1]: import pandas as pd

In [2]: df = pd.DataFrame([10, 30, 30, 40, 40, 50], columns=['x'])

In [3]: df = df[df.groupby('x').x.transform(len) > 1]

In [4]: df
Out[4]: 
    x
1  30
2  30
3  40
4  40

answered Oct 07 '22 00:10

Dimitris Fasarakis Hilliard

You can use groupby and then filter it:

In [9]:    
df = pd.DataFrame([10, 30, 30, 40, 40, 50], columns=['x'])
df = df.groupby('x').filter(lambda x: len(x) > 1)
df

Out[9]:
    x
1  30
2  30
3  40
4  40

answered Oct 06 '22 22:10

EdChum

Related questions
                            
                                Replacing Filename characters with python
                            
                                Adding a string to a list using augmented assignment
                            
                                Admin interface for SQLAlchemy?
                            
                                How can I use xdotool from within a python module/script?
                            
                                How to convert (inherit) parent to child class?
                            
                                How can I pass configuration variable values into the pyodbc connect command?
                            
                                Get file size from "Content-Length" value from a file in python 3.2
                            
                                How to write to CSV and not overwrite past text
                            
                                Python printing without commas
                            
                                Python - Splitting List That Contains Strings and Integers
                            
                                Sending a Dictionary using Sockets in Python?
                            
                                Python: categorising a list by orders of magnitude
                            
                                Filtering Characters from a String [duplicate]
                            
                                Getting attribute's value using BeautifulSoup
                            
                                Want to seperate the integer part and fractional part of float number in python [duplicate]
                            
                                How to add logging to a file with timestamps to a Python TCP Server for Raspberry Pi
                            
                                Is it possible to override __new__ in an enum to parse strings to an instance?
                            
                                Read file and plot CDF in Python
                            
                                Making a Fast Port Scanner
                            
                                What's the Perl equivalent of Python's enumerate?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove values that appear only once in a DataFrame column

Tags:

python

pandas

dataframe

filtering

Francisco García

People also ask

2 Answers

Dimitris Fasarakis Hilliard

EdChum

Recent Activity

Donate For Us