I have this code to drop null values from column Type, specifically looking at Dog. <pre class="prettyprint"><code>cd.loc[cd['Type'] == 'Dog'].dropna(subset = ['Killed'], inplace = True) </code></pre> I would like to dropna when the ['Killed'] column associating with Type = Dog has NaN value. The code above generate this pandas error: <pre class="prettyprint"><code> A value is trying to be set on a copy of a slice from a DataFrame </code></pre> Is there another way where can I dropna on ['Killed'] when ['Type'] == 'Dog'? (This is my first post), sorry if I can't explain properly Cheers

Very similar to @BrenBarn's answer but using <code>drop</code> and <code>inplace</code> <pre class="prettyprint"><code>cd.drop(cd[(cd.Type == 'Dog') & (cd.Killed.isnull())].index, inplace=True) </code></pre> <h3>Setup</h3> <pre class="prettyprint"><code>cd = pd.DataFrame([ ['Dog', 'Yorkie'], ['Cat', 'Rag Doll'], ['Cat', None], ['Bird', 'Caique'], ['Dog', None], ], columns=['Type', 'Killed']) </code></pre> <h3>Solution</h3> <pre class="prettyprint"><code>cd.drop(cd[(cd.Type == 'Dog') & (cd.Killed.isnull())].index, inplace=True) cd </code></pre> <img src="https://i.stack.imgur.com/2pPHv.png" alt="enter image description here"> <hr> Equivalently with DeMorgan's law <pre class="prettyprint"><code>cond1 = cd.Type == 'Dog' cond2 = cd.Killed.isnull() cd[~cond1 | ~cond2] </code></pre> <hr> A silly one, because I felt like it! <pre class="prettyprint"><code>cd.groupby('Type', group_keys=False) \ .apply(lambda df: df.dropna(subset=['Killed']) if df.name == 'Dog' else df) </code></pre>

It sounds like what you are saying is you want to remove rows where Type is "Dog" and Killed is <code>NaN</code>. So just select the negation of that condition: <pre class="prettyprint"><code>cd = cd.loc[~((cd.Type=="Dog") & cd.Killed.isnull())] </code></pre>

Pandas .dropna() on specify attribute

Tags:

python

pandas

I have this code to drop null values from column Type, specifically looking at Dog.

cd.loc[cd['Type'] == 'Dog'].dropna(subset = ['Killed'], inplace = True)

I would like to dropna when the ['Killed'] column associating with Type = Dog has NaN value.

The code above generate this pandas error:

 A value is trying to be set on a copy of a slice from a DataFrame

Is there another way where can I dropna on ['Killed'] when ['Type'] == 'Dog'?

(This is my first post), sorry if I can't explain properly Cheers

469

asked Aug 31 '16 05:08

Phurich.P

2 Answers

Very similar to @BrenBarn's answer but using drop and inplace

cd.drop(cd[(cd.Type == 'Dog') & (cd.Killed.isnull())].index, inplace=True)

Setup

cd = pd.DataFrame([
        ['Dog', 'Yorkie'],
        ['Cat', 'Rag Doll'],
        ['Cat', None],
        ['Bird', 'Caique'],
        ['Dog', None],
    ], columns=['Type', 'Killed'])

Solution

cd.drop(cd[(cd.Type == 'Dog') & (cd.Killed.isnull())].index, inplace=True)

cd

enter image description here

Equivalently with DeMorgan's law

cond1 = cd.Type == 'Dog'
cond2 = cd.Killed.isnull()
cd[~cond1 | ~cond2]

A silly one, because I felt like it!

cd.groupby('Type', group_keys=False) \
    .apply(lambda df: df.dropna(subset=['Killed']) if df.name == 'Dog' else df)

167

answered Sep 18 '22 08:09

piRSquared

It sounds like what you are saying is you want to remove rows where Type is "Dog" and Killed is NaN. So just select the negation of that condition:

cd = cd.loc[~((cd.Type=="Dog") & cd.Killed.isnull())]

answered Sep 20 '22 08:09

BrenBarn

Related questions
                            
                                Flatten nested pandas dataframe
                            
                                Pandas Flatten a dataframe to a single column
                            
                                Synchronizing code between jupyter/iPython notebook script and class methods
                            
                                python get data from div blocks
                            
                                mypy "invalid type" error
                            
                                Create dictionary from another dictionary with the fastest and scalable way
                            
                                Python->Beautifulsoup->Webscraping->Looping over URL (1 to 53) and saving Results
                            
                                How do I make a force-directed graph in python?
                            
                                How restrict creation of objects of one class to instances of another in Python?
                            
                                Using for loops in sqlalchemy query
                            
                                Why does the local_binary_pattern function in scikit-image provide same value for different patterns?
                            
                                How to savefig as show in matplotlib.pyplot?
                            
                                Django and celery on different servers and celery being able to send a callback to django once a task gets completed
                            
                                GtkInfoBar doesn't show again after hide
                            
                                Passing a Python list to C function using the Python/C API
                            
                                Canonical way to run Flask app locally
                            
                                How to write CSV files into XLSX using Python Pandas?
                            
                                What's the standard way to document a namedtuple?
                            
                                Label Not Showing in Odoo-9
                            
                                Floodfill segmented image in numpy/python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With