I have some values in the <code>risk</code> column that are neither, <code>Small</code>, <code>Medium</code> or <code>High</code>. I want to delete the rows with the value not being <code>Small</code>, <code>Medium</code> and <code>High</code>. I tried the following: <pre class="prettyprint"><code>df = df[(df.risk == "Small") | (df.risk == "Medium") | (df.risk == "High")] </code></pre> But this returns an empty DataFrame. How can I filter them correctly?

Another nice and readable approach is the following: <pre class="prettyprint"><code>small_risk = df["risk"] == "Small" medium_risk = df["risk"] == "Medium" high_risk = df["risk"] == "High" </code></pre> Then you can use it like this: <pre class="prettyprint"><code>df[small_risk | medium_risk | high_risk] </code></pre> or <pre class="prettyprint"><code>df[small_risk & medium_risk] </code></pre>

Pandas filter rows based on multiple conditions

Tags:

python

pandas

dataframe

I have some values in the risk column that are neither, Small, Medium or High. I want to delete the rows with the value not being Small, Medium and High. I tried the following:

df = df[(df.risk == "Small") | (df.risk == "Medium") | (df.risk == "High")]

But this returns an empty DataFrame. How can I filter them correctly?

949

asked Apr 27 '14 13:04

ArtDijk

2 Answers

I think you want:

df = df[(df.risk.isin(["Small","Medium","High"]))]

Example:

In [5]:
import pandas as pd
df = pd.DataFrame({'risk':['Small','High','Medium','Negligible', 'Very High']})
df

Out[5]:

         risk
0       Small
1        High
2      Medium
3  Negligible
4   Very High

[5 rows x 1 columns]

In [6]:

df[df.risk.isin(['Small','Medium','High'])]

Out[6]:

     risk
0   Small
1    High
2  Medium

[3 rows x 1 columns]

answered Sep 20 '22 15:09

EdChum

Another nice and readable approach is the following:

small_risk = df["risk"] == "Small"
medium_risk = df["risk"] == "Medium"
high_risk = df["risk"] == "High"

Then you can use it like this:

df[small_risk | medium_risk | high_risk]

df[small_risk & medium_risk]

answered Sep 21 '22 15:09

Rafael

Related questions
                            
                                Z3/Python getting python values from model
                            
                                from sys import argv - what is the function of "script"
                            
                                Using pandas to select rows using two different columns from dataframe?
                            
                                Testing Python C libraries - get build path
                            
                                Define a route for url ending with integer in python
                            
                                Does it make sense to install my Python unit tests in site-packages?
                            
                                How to combine multiple numpy masks
                            
                                'variable' or 'variable is not None' [duplicate]
                            
                                Gradient in noisy data, python
                            
                                Python curses Redirection is not supported
                            
                                Python Random Map Generation with Perlin Noise
                            
                                How to run specific test in Nose2
                            
                                Python Redis connection should be closed on every request? (flask)
                            
                                Get data points from a histogram in Python
                            
                                Python - Using str.replace with a wildcard
                            
                                How to set window size using phantomjs and selenium webdriver in python
                            
                                How to store django objects as session variables ( object is not JSON serializable)?
                            
                                How to get type of multidimensional Numpy array elements in Python
                            
                                Python: running multiple processes simultaneously
                            
                                pretty_print option in tostring not working in lxml

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With