Creating a masked array in Python with multiple given values

Tags:

I am graphing several columns of a large array of data (through numpy.genfromtxt) against an equally sized time column. Missing data is often referred to as nan, -999, -9999, etc. However I can't figure out how to remove multiple values from the array. This is what I currently have:

for cur_col in range(start_col, total_col):
    # Generate what is to be graphed by removing nan values
    data_mask = (file_data[:, cur_col] != nan_values)
    y_data = file_data[:, cur_col][data_mask]
    x_data = file_data[:, time_col][data_mask]

After which point I use matplotlib to create the appropriate figures for each column. This works fine if the nan_values is a single integer, but I am looking to use a list.

EDIT: Here is a working example.

import numpy as np

file_data = np.arange(12.0).reshape((4,3))
file_data[1,1] = np.nan
file_data[2,2] = -999
nan_values = -999

for cur_col in range(1,3):
    # Generate what is to be graphed by removing nan values
    data_mask = (file_data[:, cur_col] != nan_values)
    y_data = file_data[:, cur_col][data_mask]
    x_data = file_data[:, 0][data_mask]
    print 'y: ' + str(y_data)
    print 'x: ' + str(x_data)
print file_data

>>> y: [  1.  nan   7.  10.]
    x: [ 0.  3.  6.  9.]
    y: [  2.   5.  11.]
    x: [ 0.  3.  9.]
    [[   0.    1.    2.]
    [   3.   nan    5.]
    [   6.    7. -999.]
    [   9.   10.   11.]]

This will not work if nan_values = ['nan', -999] which is what I am looking to accomplish.

602

asked Jun 21 '12 20:06

Josiah

2 Answers

I would suggest using masked arrays like so:

>>> a = np.arange(12.0).reshape((4,3))
>>> a[1,1] = np.nan
>>> a[2,2] = -999
>>> a
array([[   0.,    1.,    2.],
       [   3.,   nan,    5.],
       [   6.,    7., -999.],
       [   9.,   10.,   11.]])
>>> m = np.ma.array(a,mask=(~np.isfinite(a) | (a == -999)))
>>> m
masked_array(data =
 [[0.0 1.0 2.0]
 [3.0 -- 5.0]
 [6.0 7.0 --]
 [9.0 10.0 11.0]],
             mask =
 [[False False False]
 [False  True False]
 [False False  True]
 [False False False]],
       fill_value = 1e+20)

answered Sep 28 '22 10:09

user545424

I would try something like (pseudo-code):

nan_values = [...]

for cur_col in range(start_col, total_col):
    # Generate what is to be graphed by removing nan values
    y_data = [file_data[i,cur_col] for i in range(len(file_data)) if not(file_data[i,cur_col] in nan_values)]
    x_data = [file_data[i,time_col] for i in range(len(file_data)) if not(file_data[i,cur_col] in nan_values)]

answered Sep 28 '22 08:09

GL770

Related questions
                            
                                how to fit a function using PyBrain networks?
                            
                                Pythonic way to write package for easy importing
                            
                                Update/Refresh Dynamically–Created WxPython Widgets
                            
                                Converting timestamps larger than maxint into datetime objects
                            
                                Python Beautifulsoup img tag parsing
                            
                                How can I load a password-protected private key from a .pem file with M2Crypto?
                            
                                Test if file under version control in pysvn (python subversion)
                            
                                Merge two lists,one as keys, one as values, into a dict in Python [duplicate]
                            
                                Dive into python and-or fail
                            
                                Django HttpResponseRedirect with int parameter
                            
                                Python implementation of "median of medians" algorithm
                            
                                How to collect performance metrics in a Flask Application?
                            
                                Handling failures with Fabric
                            
                                Import instance of class from a different module
                            
                                Django static files, and filepaths in settings.py
                            
                                Functional programming — for and while loops
                            
                                cv2 and BGR2YCrCb not working with Python bindings
                            
                                imaplib/gmail how to download full message (all parts) while not marking read [duplicate]
                            
                                Change the color of text in python shell?
                            
                                Django guests vote only once poll

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Creating a masked array in Python with multiple given values

Tags:

python

arrays

numpy

mask

Josiah

People also ask

2 Answers

user545424

GL770

Recent Activity

Donate For Us