Assuming that we have a large matrix A, and the indices of two matrix elements (c1, r1), (c2, r2) that we want to swap: <pre class="prettyprint"><code>import numpy as np A = np.random.rand(1000,1000) c1, r1 = 10, 10 c2, r2 = 20, 40 </code></pre> The pythonic way to do so would be: <pre class="prettyprint"><code>A[c1, r1], A[c2, r2] = A[c2, r2], A[c1, r1] </code></pre> However, this solution can be slow if you want to do a large number of swappings. Is there a more efficient way to swap two elements in a numpy array? Thanks in advance.

<h3>Preliminary answer, which does not work</h3> You can easily vectorize the swap operation, by using arrays of indexes (c1, r1, c2, r2) instead of iterating over lists of scalar indices. <pre class="prettyprint"><code>c1 = np.array(<all the "c1" values>, dtype=int) r1 = np.array(<all the "r1" values>, dtype=int) c2 = np.array(<all the "c2" values>, dtype=int) r2 = np.array(<all the "r2" values>, dtype=int) A[c1, r1], A[c2, r2] = A[c2, r2], A[c1, r1] </code></pre> Note this performs all the swaps in one go, which can be different than iteratively, if the order of the swapping makes a difference. For this reason, this is not a valid answer to your question. E.g. swapping p1 with p2, then p2 with p3, is different from swapping p1 and p2, and p2 and p3 in one go. In the latter case, both p1 and p3 get the original value of p2, and p2 gets the last of the values between p1 and p3, i.e. p3 (according to the order they appear in the index-array). However, since it is speed you're after, vectorizing the operation (in some way) must be the way to go. <hr> <h3>Adding correctness to the above solution</h3> So how can we perform vectorized swapping, while getting the output we need? We can take a hybrid approach, by breaking the lists of indexes into chunks (as few as possible), where each chunk only contains unique points, thus guaranteeing the order makes no difference. Swapping each chunk is done vercrorized-ly, and we only iterate over the chunks. Here's a sketch of how this can work: <pre class="prettyprint"><code>c1, r1 = np.array([ np.arange(10), np.arange(10) ]) c2, r2 = np.array([ [2,4,6,8,0,1,3,5,7,9], [9,1,6,8,2,2,2,2,7,0] ]) A = np.empty((15,15)) def get_chunk_boundry(c1, r1, c2, r2): a1 = c1 + 1j * r1 a2 = c2 + 1j * r2 set1 = set() set2 = set() for i, (x1, x2) in enumerate(zip(a1, a2)): if x1 in set2 or x2 in set1: return i set1.add(x1); set2.add(x2) return len(c1) while len(c1) > 0: i = get_chunk_boundry(c1, r1, c2, r2) c1b = c1[:i]; r1b = r1[:i]; c2b = c2[:i]; r2b = r2[:i] print 'swapping %d elements' % i A[c1b,r1b], A[c2b,r2b] = A[c2b,r2b], A[c1b,r1b] c1 = c1[i:]; r1 = r1[i:]; c2 = c2[i:]; r2 = r2[i:] </code></pre> Slicing here will be faster if you store the indices as a 2dim array (N x 4) to begin with.

Efficient swapping of elements in numpy array

Tags:

performance

python

arrays

numpy

Assuming that we have a large matrix A, and the indices of two matrix elements (c1, r1), (c2, r2) that we want to swap:

Click to copy

import numpy as np
A = np.random.rand(1000,1000)
c1, r1 = 10, 10
c2, r2 = 20, 40

The pythonic way to do so would be:

Click to copy

A[c1, r1], A[c2, r2] = A[c2, r2], A[c1, r1]

However, this solution can be slow if you want to do a large number of swappings.

Is there a more efficient way to swap two elements in a numpy array?

Thanks in advance.

956

asked Feb 20 '15 11:02

lackadaisical

1 Answers

Preliminary answer, which does not work

You can easily vectorize the swap operation, by using arrays of indexes (c1, r1, c2, r2) instead of iterating over lists of scalar indices.

Click to copy

c1 = np.array(<all the "c1" values>, dtype=int)
r1 = np.array(<all the "r1" values>, dtype=int)
c2 = np.array(<all the "c2" values>, dtype=int)
r2 = np.array(<all the "r2" values>, dtype=int)
A[c1, r1], A[c2, r2] = A[c2, r2], A[c1, r1]

Note this performs all the swaps in one go, which can be different than iteratively, if the order of the swapping makes a difference. For this reason, this is not a valid answer to your question.

E.g. swapping p1 with p2, then p2 with p3, is different from swapping p1 and p2, and p2 and p3 in one go. In the latter case, both p1 and p3 get the original value of p2, and p2 gets the last of the values between p1 and p3, i.e. p3 (according to the order they appear in the index-array).

However, since it is speed you're after, vectorizing the operation (in some way) must be the way to go.

Adding correctness to the above solution

So how can we perform vectorized swapping, while getting the output we need? We can take a hybrid approach, by breaking the lists of indexes into chunks (as few as possible), where each chunk only contains unique points, thus guaranteeing the order makes no difference. Swapping each chunk is done vercrorized-ly, and we only iterate over the chunks.

Here's a sketch of how this can work:

Click to copy

c1, r1 = np.array([ np.arange(10), np.arange(10) ])
c2, r2 = np.array([ [2,4,6,8,0,1,3,5,7,9], [9,1,6,8,2,2,2,2,7,0] ])
A = np.empty((15,15))

def get_chunk_boundry(c1, r1, c2, r2):
    a1 = c1 + 1j * r1
    a2 = c2 + 1j * r2
    set1 = set()
    set2 = set()
    for i, (x1, x2) in enumerate(zip(a1, a2)):
        if x1 in set2 or x2 in set1:
            return i
        set1.add(x1); set2.add(x2)
    return len(c1)

while len(c1) > 0:
    i = get_chunk_boundry(c1, r1, c2, r2)
    c1b = c1[:i]; r1b = r1[:i]; c2b = c2[:i]; r2b = r2[:i]
    print 'swapping %d elements' % i
    A[c1b,r1b], A[c2b,r2b] = A[c2b,r2b], A[c1b,r1b]
    c1 = c1[i:]; r1 = r1[i:]; c2 = c2[i:]; r2 = r2[i:]

Slicing here will be faster if you store the indices as a 2dim array (N x 4) to begin with.

158

answered Oct 27 '22 21:10

shx2

Related questions
                            
                                Using matplotlib on headless Ubuntu 14.04 Server
                            
                                Django is adding quotes to cookies with colons
                            
                                Python gcd calulation of rsa
                            
                                Images not copied to output folder in Pelican
                            
                                Update properties of a kivy widget while running code
                            
                                Flask, Python - Increase timeout length on Heroku app [duplicate]
                            
                                How can I make my Python script faster?
                            
                                avoid sub-modules and external packages in a module's namespace
                            
                                How can you generate a POLLPRI event on a regular file?
                            
                                matplotlib FuncAnimation: when to stop?
                            
                                How to check if an IAM access key has specific permissions?
                            
                                How to tell pip to install test dependencies?
                            
                                Easiest and quickest way to increase Django's default username max length from 30 to 75
                            
                                SqlAlchemy join on tables with no foreign keys
                            
                                Bokeh logarithmic scale for Bar chart
                            
                                ImportError: No module named django.core.wsgi
                            
                                tools to convert jsonschema into Django REST serializisers?
                            
                                New index level name after DataFrame.stack()
                            
                                Running Python 3 from Light Table
                            
                                Prefer BytesIO or bytes for internal interface in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Efficient swapping of elements in numpy array

Tags:

performance

python

arrays

numpy

lackadaisical

People also ask

1 Answers

Preliminary answer, which does not work

Adding correctness to the above solution

shx2

Recent Activity

Donate For Us