Numpy masked arrays - indicating missing values

Tags:

import numpy as np
import numpy.ma as ma

"""This operates as expected with one value masked"""
a = [0., 1., 1.e20, 9.]
error_value = 1.e20
b = ma.masked_values(a, error_value)
print b

"""This does not, all values are masked """
d = [0., 1., 'NA', 9.]
error_value = 'NA'
e = ma.masked_values(d, error_value)
print e

How can I use 'nan', 'NA', 'None', or some similar value to indicate missing data?

366

asked Jul 02 '11 04:07

Dick Eshelman

1 Answers

Are you getting your data from a text file or similar? If so, I'd suggest using the genfromtxt function directly to specify your masked value:

In [149]: f = StringIO('0.0, 1.0, NA, 9.0')

In [150]: a = np.genfromtxt(f, delimiter=',', missing_values='NA', usemask=True)

In [151]: a
Out[151]:
masked_array(data = [0.0 1.0 -- 9.0],
             mask = [False False  True False],
       fill_value = 1e+20)

I think the problem in your example is that the python list you're using to initialize the numpy array has heterogeneous types (floats and a string). The values are coerced to a strings in a numpy array, but the masked_values function uses floating point equality yielding the strange results.

Here's one way to overcome this by creating an array with object dtype:

In [152]: d = np.array([0., 1., 'NA', 9.], dtype=object)

In [153]: e = ma.masked_values(d, 'NA')

In [154]: e
Out[154]:
masked_array(data = [0.0 1.0 -- 9.0],
             mask = [False False  True False],
       fill_value = ?)

You may prefer the first solution since the result has a float dtype.

answered Nov 14 '22 14:11

ars

Related questions
                            
                                What is causing a negative bias in my super-sampling simulation?
                            
                                Render a textured rectangle with PyOpenGL
                            
                                Can cx-freeze be used in Ubuntu to freeze a python script to a Windows executable?
                            
                                What is a good on-disk "set" implementation for Python?
                            
                                add field first_name and last_name in django-profile
                            
                                Reprioritizing priority queue (efficient manner)
                            
                                SqlAlchemy: Check if one object is in any relationship (or_(object.relationship1.contains(otherObject), object.relationship2.contains(otherObject))
                            
                                Drop-in replacement for `urllib2.urlopen` that does cert verification
                            
                                (python) colour printing with decorator in a function
                            
                                Prevent automatic type conversion in ctypes callback functions
                            
                                Error using SqlSoup with database views
                            
                                multiprocessing imap_unordered in python
                            
                                PyQt Import Error
                            
                                Send a print job to USB printer using Python
                            
                                Excel-like text import in Python: automatically parsing fixed width columns
                            
                                Django - Static Files from App Directories
                            
                                (still) cannot properly install lxml 2.3 for python, but at least 2.2.8 works
                            
                                Zombie process in python multiprocessing daemon
                            
                                Extending SWIG builtin classes
                            
                                What is a good audio library for validating files in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Numpy masked arrays - indicating missing values

Tags:

python

arrays

numpy

Dick Eshelman

People also ask

1 Answers

ars

Recent Activity

Donate For Us