Imagine you have a structured numpy array, generated from a csv with the first row as field names. The array has the form: <pre class="prettyprint"><code>dtype([('A', '<f8'), ('B', '<f8'), ('C', '<f8'), ..., ('n','<f8']) </code></pre> Now, lets say you want to remove from this array the 'ith' column. Is there a convenient way to do that? I'd like a it to work like delete: <pre class="prettyprint"><code>new_array = np.delete(old_array, 'i') </code></pre> Any ideas?

It's not quite a single function call, but the following shows one way to drop the i-th field: <pre class="prettyprint"><code>In [67]: a Out[67]: array([(1.0, 2.0, 3.0), (4.0, 5.0, 6.0)], dtype=[('A', '<f8'), ('B', '<f8'), ('C', '<f8')]) In [68]: i = 1 # Drop the 'B' field In [69]: names = list(a.dtype.names) In [70]: names Out[70]: ['A', 'B', 'C'] In [71]: new_names = names[:i] + names[i+1:] In [72]: new_names Out[72]: ['A', 'C'] In [73]: b = a[new_names] In [74]: b Out[74]: array([(1.0, 3.0), (4.0, 6.0)], dtype=[('A', '<f8'), ('C', '<f8')]) </code></pre> Wrapped up as a function: <pre class="prettyprint"><code>def remove_field_num(a, i): names = list(a.dtype.names) new_names = names[:i] + names[i+1:] b = a[new_names] return b </code></pre> It might be more natural to remove a given field name: <pre class="prettyprint"><code>def remove_field_name(a, name): names = list(a.dtype.names) if name in names: names.remove(name) b = a[names] return b </code></pre> Also, check out the <code>drop_rec_fields</code> function that is part of the <code>mlab</code> module of matplotlib. <hr> Update: See my answer at How to remove a column from a structured numpy array *without copying it*? for a method to create a view of subsets of the fields of a structured array without making a copy of the array.

How do you remove a column from a structured numpy array?

Tags:

python

numpy

Imagine you have a structured numpy array, generated from a csv with the first row as field names. The array has the form:

Click to copy

dtype([('A', '<f8'), ('B', '<f8'), ('C', '<f8'), ..., ('n','<f8'])

Now, lets say you want to remove from this array the 'ith' column. Is there a convenient way to do that?

I'd like a it to work like delete:

Click to copy

new_array = np.delete(old_array, 'i')

Any ideas?

228

asked Mar 22 '13 16:03

Dobbs_Head

2 Answers

It's not quite a single function call, but the following shows one way to drop the i-th field:

Click to copy

In [67]: a
Out[67]: 
array([(1.0, 2.0, 3.0), (4.0, 5.0, 6.0)], 
      dtype=[('A', '<f8'), ('B', '<f8'), ('C', '<f8')])

In [68]: i = 1   # Drop the 'B' field

In [69]: names = list(a.dtype.names)

In [70]: names
Out[70]: ['A', 'B', 'C']

In [71]: new_names = names[:i] + names[i+1:]

In [72]: new_names
Out[72]: ['A', 'C']

In [73]: b = a[new_names]

In [74]: b
Out[74]: 
array([(1.0, 3.0), (4.0, 6.0)], 
      dtype=[('A', '<f8'), ('C', '<f8')])

Wrapped up as a function:

Click to copy

def remove_field_num(a, i):
    names = list(a.dtype.names)
    new_names = names[:i] + names[i+1:]
    b = a[new_names]
    return b

It might be more natural to remove a given field name:

Click to copy

def remove_field_name(a, name):
    names = list(a.dtype.names)
    if name in names:
        names.remove(name)
    b = a[names]
    return b

Also, check out the drop_rec_fields function that is part of the mlab module of matplotlib.

Update: See my answer at How to remove a column from a structured numpy array *without copying it*? for a method to create a view of subsets of the fields of a structured array without making a copy of the array.

181

answered Oct 12 '22 23:10

Warren Weckesser

Having googled my way here and learned what I needed to know from Warren's answer, I couldn't resist posting a more succinct version, with the added option to remove multiple fields efficiently in one go:

Click to copy

def rmfield( a, *fieldnames_to_remove ):
    return a[ [ name for name in a.dtype.names if name not in fieldnames_to_remove ] ]

Examples:

Click to copy

a = rmfield(a, 'foo')
a = rmfield(a, 'foo', 'bar')  # remove multiple fields at once

Or if we're really going to golf it, the following is equivalent:

Click to copy

rmfield=lambda a,*f:a[[n for n in a.dtype.names if n not in f]]

answered Oct 13 '22 01:10

jez

Related questions
                            
                                Check if a function is a method of some object
                            
                                Dynamically add member function to an instance of a class in Python
                            
                                python -- measuring pixel brightness
                            
                                3D vector field in matplotlib
                            
                                eval calling lambda don't see self
                            
                                Is it possible to print a string at a certain screen position inside IDLE?
                            
                                HTTPS request in Python
                            
                                How to save dictionaries and arrays in the same archive (with numpy.savez)
                            
                                python: How do I capture a variable declared in a non global ancestral outer scope?
                            
                                Why does += of a list within a Python tuple raise TypeError but modify the list anyway? [duplicate]
                            
                                Can you recommend some Python HTTP client library? [closed]
                            
                                How to debug C extensions for Python on Windows
                            
                                imshow(img, cmap=cm.gray) shows a white for 128 value
                            
                                PyCrypto install error on Windows
                            
                                reloading module which has been imported to another module
                            
                                scipy with py2exe
                            
                                How to vectorize this python code?
                            
                                How can I configure ipython to display integers in hex format?
                            
                                Installing VTK for Python
                            
                                python - why is read-only property writable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you remove a column from a structured numpy array?

Tags:

python

numpy

Dobbs_Head

People also ask

2 Answers

Warren Weckesser

jez

Recent Activity

Donate For Us