Basically, I'm doing some data analysis. I read in a dataset as a numpy.ndarray and some of the values are missing (either by just not being there, being <code>NaN</code>, or by being a string written "<code>NA</code>"). I want to clean out all rows containing any entry like this. How do I do that with a numpy ndarray?

<pre class="prettyprint"><code>>>> a = np.array([[1,2,3], [4,5,np.nan], [7,8,9]]) array([[ 1., 2., 3.], [ 4., 5., nan], [ 7., 8., 9.]]) >>> a[~np.isnan(a).any(axis=1)] array([[ 1., 2., 3.], [ 7., 8., 9.]]) </code></pre> and reassign this to <code>a</code>. Explanation: <code>np.isnan(a)</code> returns a similar array with <code>True</code> where <code>NaN</code>, <code>False</code> elsewhere. <code>.any(axis=1)</code> reduces an <code>m*n</code> array to <code>n</code> with an logical <code>or</code> operation on the whole rows, <code>~</code> inverts <code>True/False</code> and <code>a[ ]</code> chooses just the rows from the original array, which have <code>True</code> within the brackets.

How to remove all rows in a numpy.ndarray that contain non-numeric values

1 Answers

Click to copy

>>> a = np.array([[1,2,3], [4,5,np.nan], [7,8,9]]) array([[  1.,   2.,   3.],        [  4.,   5.,  nan],        [  7.,   8.,   9.]])  >>> a[~np.isnan(a).any(axis=1)] array([[ 1.,  2.,  3.],        [ 7.,  8.,  9.]])

and reassign this to a.

Explanation: np.isnan(a) returns a similar array with True where NaN, False elsewhere. .any(axis=1) reduces an m*n array to n with an logical or operation on the whole rows, ~ inverts True/False and a[ ] chooses just the rows from the original array, which have True within the brackets.

193

answered Sep 19 '22 13:09

eumiro

Related questions
                            
                                How can you determine a point is between two other points on a line segment?
                            
                                Finding all possible permutations of a given string in python
                            
                                Boto3 to download all files from a S3 Bucket
                            
                                How can I get the current language in Django?
                            
                                How can I make a scatter plot colored by density in matplotlib?
                            
                                How to make the python interpreter correctly handle non-ASCII characters in string operations?
                            
                                multiprocessing: Understanding logic behind `chunksize`
                            
                                Changing file extension in Python
                            
                                ImportError: No module named psycopg2
                            
                                How to find the cumulative sum of numbers in a list?
                            
                                Quicksort with Python
                            
                                Inherit docstrings in Python class inheritance
                            
                                How to convert Python's .isoformat() string back into datetime object [duplicate]
                            
                                Resolving new pip backtracking runtime issue
                            
                                Seaborn - Why import as sns?
                            
                                Catching KeyboardInterrupt in Python during program shutdown
                            
                                Is there a way to compile a python application into static binary?
                            
                                How to handle both `with open(...)` and `sys.stdout` nicely?
                            
                                matplotlib taking time when being imported
                            
                                Get filename from file pointer [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove all rows in a numpy.ndarray that contain non-numeric values

Tags:

python

numpy

zebra

People also ask

1 Answers

eumiro

Recent Activity

Donate For Us