I have two numpy-arrays: <pre class="prettyprint"><code>p_a_colors=np.array([[0,0,0], [0,2,0], [119,103,82], [122,122,122], [122,122,122], [3,2,4]]) p_rem = np.array([[119,103,82], [122,122,122]]) </code></pre> I want to delete all the columns from p_a_colors that are in p_rem, so I get: <pre class="prettyprint"><code>p_r_colors=np.array([[0,0,0], [0,2,0], [3,2,4]]) </code></pre> I think, something should work like <pre class="prettyprint"><code>p_r_colors= np.delete(p_a_colors, np.where(np.all(p_a_colors==p_rem, axis=0)),0) </code></pre> but I just don't get the axis or [:] right. I know, that <pre class="prettyprint"><code>p_r_colors=copy.deepcopy(p_a_colors) for i in range(len(p_rem)): p_r_colors= np.delete(p_r_colors, np.where(np.all(p_r_colors==p_rem[i], axis=-1)),0) </code></pre> would work, but I am trying to avoid (python)loops, because I also want the performance right.

This is how I would do it: <pre class="prettyprint"><code>dtype = np.dtype((np.void, (p_a_colors.shape[1] * p_a_colors.dtype.itemsize))) mask = np.in1d(p_a_colors.view(dtype), p_rem.view(dtype)) p_r_colors = p_a_colors[~mask] >>> p_r_colors array([[0, 0, 0], [0, 2, 0], [3, 2, 4]]) </code></pre> You need to do the void dtype thing so that numpy compares rows as a whole. After that using the built-in set routines seems like the obvious way to go.

It's ugly, but <pre class="prettyprint"><code>tmp = reduce(lambda x, y: x | np.all(p_a_colors == y, axis=-1), p_rem, np.zeros(p_a_colors.shape[:1], dtype=np.bool)) indices = np.where(tmp)[0] np.delete(p_a_colors, indices, axis=0) </code></pre> (edit: corrected) <pre class="prettyprint"><code>>>> tmp = reduce(lambda x, y: x | np.all(p_a_colors == y, axis=-1), p_rem, np.zeros(p_a_colors.shape[:1], dtype=np.bool)) >>> >>> indices = np.where(tmp)[0] >>> >>> np.delete(p_a_colors, indices, axis=0) array([[0, 0, 0], [0, 2, 0], [3, 2, 4]]) >>> </code></pre>

find and delete from more-dimensional numpy array

Tags:

python

arrays

numpy

I have two numpy-arrays:

p_a_colors=np.array([[0,0,0],
                     [0,2,0],
                     [119,103,82],
                     [122,122,122],
                     [122,122,122],
                     [3,2,4]])

p_rem = np.array([[119,103,82],
                     [122,122,122]])

I want to delete all the columns from p_a_colors that are in p_rem, so I get:

p_r_colors=np.array([[0,0,0],
                    [0,2,0],
                    [3,2,4]])

I think, something should work like

p_r_colors= np.delete(p_a_colors, np.where(np.all(p_a_colors==p_rem, axis=0)),0)

but I just don't get the axis or [:] right.

I know, that

p_r_colors=copy.deepcopy(p_a_colors)
for i in range(len(p_rem)):
    p_r_colors= np.delete(p_r_colors, np.where(np.all(p_r_colors==p_rem[i], axis=-1)),0)

would work, but I am trying to avoid (python)loops, because I also want the performance right.

582

asked May 30 '13 14:05

a.j. tawleed

2 Answers

This is how I would do it:

dtype = np.dtype((np.void, (p_a_colors.shape[1] * 
                            p_a_colors.dtype.itemsize)))
mask = np.in1d(p_a_colors.view(dtype), p_rem.view(dtype))
p_r_colors = p_a_colors[~mask]

>>> p_r_colors
array([[0, 0, 0],
       [0, 2, 0],
       [3, 2, 4]])

You need to do the void dtype thing so that numpy compares rows as a whole. After that using the built-in set routines seems like the obvious way to go.

answered Oct 19 '22 22:10

Jaime

It's ugly, but

tmp = reduce(lambda x, y: x |  np.all(p_a_colors == y, axis=-1), p_rem, np.zeros(p_a_colors.shape[:1], dtype=np.bool))

indices = np.where(tmp)[0]

np.delete(p_a_colors, indices, axis=0)

(edit: corrected)

>>> tmp = reduce(lambda x, y: x |  np.all(p_a_colors == y, axis=-1), p_rem, np.zeros(p_a_colors.shape[:1], dtype=np.bool))
>>> 
>>> indices = np.where(tmp)[0]
>>> 
>>> np.delete(p_a_colors, indices, axis=0)
array([[0, 0, 0],
       [0, 2, 0],
       [3, 2, 4]])
>>>

answered Oct 19 '22 22:10

YXD

Related questions
                            
                                Scrapy with a nested array
                            
                                How can I make a class method return a new instance of itself?
                            
                                File download via Post form
                            
                                Numpy: fast calculations considering items' neighbors and their position inside the array
                            
                                Running Boto on Google App Engine (GAE)
                            
                                How to get a list of tags and create new tags with python and dulwich in git?
                            
                                Is there a way to know whether a Unicode string contains any Chinese/Japanese character in Python?
                            
                                How to call a shell script function/variable from python?
                            
                                python : template var without space
                            
                                Can't pickle : attribute lookup builtin.function failed
                            
                                python socket recv() and signals
                            
                                networkx:creating a subgraph induced from edges
                            
                                OpenERP fields.function() explanation [duplicate]
                            
                                Creating an XML document with BeautifulSoup
                            
                                How does one save python pandas scatter_matrix as a figure?
                            
                                incrementing defaultdict inside list comprehension (Python)
                            
                                Using relative vs absolute URL for STATIC_URL in Django
                            
                                Unit-testing a flask-principal application
                            
                                Why would "\n" become "^@" when writing Python in a .vim file?
                            
                                check whether a string is in a 2-GB list of strings in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With