I have a large dataframe. When it was created 'None' was used as the value where a number could not be calculated (instead of 'nan') How can I delete all rows that have 'None' in any of it's columns? I though I could use <code>df.dropna</code> and set the value of <code>na</code>, but I can't seem to be able to. Thanks I think this is a good representation of the dataframe: <pre class="prettyprint"><code>temp = pd.DataFrame(data=[['str1','str2',2,3,5,6,76,8],['str3','str4',2,3,'None',6,76,8]]) </code></pre>

Setup Borrowed @MaxU's <code>df</code> <pre class="prettyprint"><code>df = pd.DataFrame([ [1, 2, 3], [4, None, 6], [None, 7, 8], [9, 10, 11] ], dtype=object) </code></pre> Solution You can just use <code>pd.DataFrame.dropna</code> as is <pre class="prettyprint"><code>df.dropna() 0 1 2 0 1 2 3 3 9 10 11 </code></pre> <hr> Supposing you have <code>None</code> strings like in this <code>df</code> <pre class="prettyprint"><code>df = pd.DataFrame([ [1, 2, 3], [4, 'None', 6], ['None', 7, 8], [9, 10, 11] ], dtype=object) </code></pre> Then combine <code>dropna</code> with <code>mask</code> <pre class="prettyprint"><code>df.mask(df.eq('None')).dropna() 0 1 2 0 1 2 3 3 9 10 11 </code></pre> You can ensure that the entire dataframe is <code>object</code> when you compare with. <pre class="prettyprint"><code>df.mask(df.astype(object).eq('None')).dropna() 0 1 2 0 1 2 3 3 9 10 11 </code></pre>

Thanks for all your help. In the end I was able to get <code>df = df.replace(to_replace='None', value=np.nan).dropna()</code> to work. I'm not sure why your suggestions didn't work for me.

Python Pandas Dataframe, remove all rows where 'None' is the value in any column

Thanks

I think this is a good representation of the dataframe:

temp = pd.DataFrame(data=[['str1','str2',2,3,5,6,76,8],['str3','str4',2,3,'None',6,76,8]])

310

asked Aug 04 '17 17:08

jlt199

2 Answers

Setup
Borrowed @MaxU's df

df = pd.DataFrame([
    [1, 2, 3],
    [4, None, 6],
    [None, 7, 8],
    [9, 10, 11]
], dtype=object)

Solution
You can just use pd.DataFrame.dropna as is

df.dropna()

   0   1   2
0  1   2   3
3  9  10  11

Supposing you have None strings like in this df

df = pd.DataFrame([
    [1, 2, 3],
    [4, 'None', 6],
    ['None', 7, 8],
    [9, 10, 11]
], dtype=object)

Then combine dropna with mask

df.mask(df.eq('None')).dropna()

   0   1   2
0  1   2   3
3  9  10  11

You can ensure that the entire dataframe is object when you compare with.

df.mask(df.astype(object).eq('None')).dropna()

   0   1   2
0  1   2   3
3  9  10  11

187

answered Oct 16 '22 06:10

piRSquared

Thanks for all your help. In the end I was able to get

df = df.replace(to_replace='None', value=np.nan).dropna()

to work. I'm not sure why your suggestions didn't work for me.

answered Oct 16 '22 07:10

jlt199

Related questions
                            
                                What is the best way to access stored procedures in Django's ORM
                            
                                Paging output from python
                            
                                Python's hasattr on list values of dictionaries always returns false?
                            
                                Change the values of a NumPy array that are NOT in a list of indices
                            
                                Figure and axes methods in matplotlib
                            
                                How to get html with javascript rendered sourcecode by using selenium
                            
                                What does `from six.moves import urllib` do in Python?
                            
                                Documents and examples of PythonMagick
                            
                                class is not defined despite being imported
                            
                                How to find the last row in a column using openpyxl normal workbook?
                            
                                Anyone using Django in the "Enterprise"
                            
                                Writing a help for python script
                            
                                What's wrong with my except? [duplicate]
                            
                                Quadratic Program (QP) Solver that only depends on NumPy/SciPy?
                            
                                How to upload a file using an ajax call in flask
                            
                                How to display all label values in matplotlib
                            
                                Hide Axis in Bokeh
                            
                                Building multi-regression model throws error: `Pandas data cast to numpy dtype of object. Check input data with np.asarray(data).`
                            
                                Trailing slash in Flask route
                            
                                Do datetime objects need to be deep-copied?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas Dataframe, remove all rows where 'None' is the value in any column

Tags:

python

pandas

dataframe

jlt199

People also ask

2 Answers

piRSquared

jlt199

Recent Activity

Donate For Us