Remove lines in dataframe using a list in Pandas

Tags:

python

pandas

It's a generic question about filtering a pandas dataframe using a list. The problem is the following:

I have a pandas dataframe df with a column field
I have a list of banned fields, for example ban_field=['field1','field2','field3']
All elements of ban_field appear in df.field

For the moment, to retrieve the dataframe without the banned field, I proceed as follows:

for f in ban_field:
    df = df[df.field!=f]

Is there a more pythonic way to proceed (in one line?)?

342

asked Sep 10 '14 14:09

Colonel Beauvel

1 Answers

Method #1: use isin and a boolean array selector:

In [47]: df = pd.DataFrame({"a": [2]*10, "field": range(10)})

In [48]: ban_field = [3,4,6,7,8]

In [49]: df[~df.field.isin(ban_field)]
Out[49]: 
   a  field
0  2      0
1  2      1
2  2      2
5  2      5
9  2      9

[5 rows x 2 columns]

Method #2: use query:

In [51]: df.query("field not in @ban_field")
Out[51]: 
   a  field
0  2      0
1  2      1
2  2      2
5  2      5
9  2      9

[5 rows x 2 columns]

173

answered Sep 17 '22 23:09

DSM

Related questions
                            
                                unsupported format character '_' (0x5f) at index 1
                            
                                Get Python command line output when called from a bash script
                            
                                Popen with conflicting executable/path
                            
                                Pythonic way to get the single element of a 1-sized list
                            
                                Partial derivative in Python
                            
                                Django Templates and MongoDB _id
                            
                                Python urllib getting access denied when browser works
                            
                                How can I lookup dns service records in consul in python?
                            
                                Box Plot Trellis
                            
                                Overlay two numpy arrays treating fourth plane as alpha level [duplicate]
                            
                                Why manual string reverse is worse than slice reverse in Python 2.7? What is the algorithm being used in Slice?
                            
                                PIL Clipboard Image to Base64 string
                            
                                How to match anything (DOTALL) without DOTALL?
                            
                                Kivy official Pong tutorial - usage of Vector (kivy.vector)
                            
                                Non-overlapping scatter plot labels using matplotlib
                            
                                Returning nearby locations in Django
                            
                                Python ignores default values of arguments supplied to tuple in inherited class
                            
                                XHR request URL says does not exist when attempting to parse it's content
                            
                                Understanding len function with iterators
                            
                                Find class in which a method is defined

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With