Is there a quick way of replacing all NaN values in a numpy array with (say) the linearly interpolated values? For example, <pre class="prettyprint"><code>[1 1 1 nan nan 2 2 nan 0] </code></pre> would be converted into <pre class="prettyprint"><code>[1 1 1 1.3 1.6 2 2 1 0] </code></pre>

Lets define first a simple helper function in order to make it more straightforward to handle indices and logical indices of NaNs: <pre class="prettyprint"><code>import numpy as np def nan_helper(y): """Helper to handle indices and logical indices of NaNs. Input: - y, 1d numpy array with possible NaNs Output: - nans, logical indices of NaNs - index, a function, with signature indices= index(logical_indices), to convert logical indices of NaNs to 'equivalent' indices Example: >>> # linear interpolation of NaNs >>> nans, x= nan_helper(y) >>> y[nans]= np.interp(x(nans), x(~nans), y[~nans]) """ return np.isnan(y), lambda z: z.nonzero()[0] </code></pre> Now the <code>nan_helper(.)</code> can now be utilized like: <pre class="prettyprint"><code>>>> y= array([1, 1, 1, NaN, NaN, 2, 2, NaN, 0]) >>> >>> nans, x= nan_helper(y) >>> y[nans]= np.interp(x(nans), x(~nans), y[~nans]) >>> >>> print y.round(2) [ 1. 1. 1. 1.33 1.67 2. 2. 1. 0. ] </code></pre> --- Although it may seem first a little bit overkill to specify a separate function to do just things like this: <pre class="prettyprint"><code>>>> nans, x= np.isnan(y), lambda z: z.nonzero()[0] </code></pre> it will eventually pay dividends. So, whenever you are working with NaNs related data, just encapsulate all the (new NaN related) functionality needed, under some specific helper function(s). Your code base will be more coherent and readable, because it follows easily understandable idioms. Interpolation, indeed, is a nice context to see how NaN handling is done, but similar techniques are utilized in various other contexts as well.

Interpolate NaN values in a numpy array

Tags:

python

nan

numpy

interpolation

Is there a quick way of replacing all NaN values in a numpy array with (say) the linearly interpolated values?

For example,

Click to copy

[1 1 1 nan nan 2 2 nan 0]

would be converted into

Click to copy

[1 1 1 1.3 1.6 2 2  1  0]

266

asked Jun 29 '11 09:06

Petter

2 Answers

Lets define first a simple helper function in order to make it more straightforward to handle indices and logical indices of NaNs:

Click to copy

import numpy as np  def nan_helper(y):     """Helper to handle indices and logical indices of NaNs.      Input:         - y, 1d numpy array with possible NaNs     Output:         - nans, logical indices of NaNs         - index, a function, with signature indices= index(logical_indices),           to convert logical indices of NaNs to 'equivalent' indices     Example:         >>> # linear interpolation of NaNs         >>> nans, x= nan_helper(y)         >>> y[nans]= np.interp(x(nans), x(~nans), y[~nans])     """      return np.isnan(y), lambda z: z.nonzero()[0]

Now the nan_helper(.) can now be utilized like:

Click to copy

>>> y= array([1, 1, 1, NaN, NaN, 2, 2, NaN, 0]) >>> >>> nans, x= nan_helper(y) >>> y[nans]= np.interp(x(nans), x(~nans), y[~nans]) >>> >>> print y.round(2) [ 1.    1.    1.    1.33  1.67  2.    2.    1.    0.  ]

---
Although it may seem first a little bit overkill to specify a separate function to do just things like this:

Click to copy

>>> nans, x= np.isnan(y), lambda z: z.nonzero()[0]

it will eventually pay dividends.

So, whenever you are working with NaNs related data, just encapsulate all the (new NaN related) functionality needed, under some specific helper function(s). Your code base will be more coherent and readable, because it follows easily understandable idioms.

Interpolation, indeed, is a nice context to see how NaN handling is done, but similar techniques are utilized in various other contexts as well.

167

answered Sep 18 '22 19:09

eat

I came up with this code:

Click to copy

import numpy as np nan = np.nan  A = np.array([1, nan, nan, 2, 2, nan, 0])  ok = -np.isnan(A) xp = ok.ravel().nonzero()[0] fp = A[-np.isnan(A)] x  = np.isnan(A).ravel().nonzero()[0]  A[np.isnan(A)] = np.interp(x, xp, fp)  print A

It prints

Click to copy

 [ 1.          1.33333333  1.66666667  2.          2.          1.          0.        ]

answered Sep 19 '22 19:09

Petter

Related questions
                            
                                Python socket connection timeout
                            
                                Remove non-numeric rows in one column with pandas
                            
                                Remove nodes from graph or reset entire default graph
                            
                                Python equivalent of npm or rubygems
                            
                                Copy all values in a column to a new column in a pandas dataframe
                            
                                Convert a string to integer with decimal in Python
                            
                                Splitting a string into words and punctuation
                            
                                Exponentials in python: x**y vs math.pow(x, y)
                            
                                No module named serial
                            
                                How can files be added to a tarfile with Python, without adding the directory hierarchy?
                            
                                Python pip broken after OS X 10.8 upgrade
                            
                                Link Conda environment with Jupyter Notebook
                            
                                groupby weighted average and sum in pandas dataframe
                            
                                Python MySQLdb issues (TypeError: %d format: a number is required, not str)
                            
                                No Module named django.core
                            
                                AttributeError: 'Tensor' object has no attribute 'numpy'
                            
                                ImportError: No module named sqlalchemy
                            
                                Are there any reasons not to use an OrderedDict?
                            
                                Run Python script without Windows console appearing
                            
                                datetime to string with series in pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Interpolate NaN values in a numpy array

Tags:

python

nan

numpy

interpolation

Petter

People also ask

2 Answers

eat

Petter

Recent Activity

Donate For Us