How to drop rows from pandas data frame that contains a particular string in a particular column? [duplicate]

People also ask

How do you delete rows based on duplicates in one column in Python?

To remove duplicates of only one or a subset of columns, specify subset as the individual column or list of columns that should be unique. To do this conditional on a different column's value, you can sort_values(colname) and specify keep equals either first or last .

How do you drop rows with certain values?

One of the fastest ways to delete rows that contain a specific value or fulfill a given condition is to filter these. Once you have the filtered data, you can delete all these rows (while the remaining rows remain intact).

pandas has vectorized string operations, so you can just filter out the rows that contain the string you don't want:

In [91]: df = pd.DataFrame(dict(A=[5,3,5,6], C=["foo","bar","fooXYZbar", "bat"]))

In [92]: df
Out[92]:
   A          C
0  5        foo
1  3        bar
2  5  fooXYZbar
3  6        bat

In [93]: df[~df.C.str.contains("XYZ")]
Out[93]:
   A    C
0  5  foo
1  3  bar
3  6  bat

If your string constraint is not just one string you can drop those corresponding rows with:

df = df[~df['your column'].isin(['list of strings'])]

The above will drop all rows containing elements of your list

This will only work if you want to compare exact strings. It will not work in case you want to check if the column string contains any of the strings in the list.

The right way to compare with a list would be :

searchfor = ['john', 'doe']
df = df[~df.col.str.contains('|'.join(searchfor))]

Slight modification to the code. Having na=False will skip empty values. Otherwise you can get an error TypeError: bad operand type for unary ~: float

df[~df.C.str.contains("XYZ", na=False)]

Source: TypeError: bad operand type for unary ~: float

Related questions
                            
                                Numpy: Divide each row by a vector element
                            
                                What does preceding a string literal with "r" mean? [duplicate]
                            
                                Can I use __init__.py to define global variables?
                            
                                How do I call setattr() on the current module?
                            
                                How to duplicate virtualenv
                            
                                What is the difference between json.dumps and json.load? [closed]
                            
                                What do >> and << mean in Python?
                            
                                Add single element to array in numpy
                            
                                Getting attributes of a class
                            
                                VSCode -- how to set working directory for debug
                            
                                What's the easiest way to escape HTML in Python?
                            
                                Function for Factorial in Python
                            
                                How do lexical closures work?
                            
                                How can I connect to MySQL in Python 3 on Windows?
                            
                                Concatenate strings from several rows using Pandas groupby
                            
                                How to use Python requests to fake a browser visit a.k.a and generate User Agent?
                            
                                Sleeping in a batch file
                            
                                How can I enable CORS on Django REST Framework
                            
                                Timeout function if it takes too long to finish [duplicate]
                            
                                How do I calculate square root in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to drop rows from pandas data frame that contains a particular string in a particular column? [duplicate]

Tags:

python

pandas

People also ask

Recent Activity

Donate For Us