I am performing a large number of these calculations: <code>A == A[np.newaxis].T</code> where A is a dense numpy array which frequently has common values. For benchmarking purposes we can use: <pre class="prettyprint"><code>n = 30000 A = np.random.randint(0, 1000, n) A == A[np.newaxis].T </code></pre> When I perform this calculation, I run into memory issues. I believe this is because the output isn't in more efficient bitarray or np.packedbits format. A secondary concern is we are performing twice as many comparisons as necessary, since the resulting Boolean array is symmetric. The questions I have are: <ol> <li>Is it possible to produce the Boolean numpy array output in a more memory efficient fashion without sacrificing speed? The options I know about are bitarray and np.packedbits, but I only know how to apply these after the large Boolean array is created.</li> <li>Can we utilise the symmetry of our calculation to halve the number of comparisons processed, again without sacrificing speed?</li> </ol> I will need to be able to perform & and | operations on Boolean arrays output. I have tried bitarray, which is super-fast for these bitwise operations. But it is slow to pack np.ndarray -> bitarray and then unpack bitarray -> np.ndarray. [Edited to provide clarification.]

Here's one with <code>numba</code> to give us a NumPy boolean array as output - <pre class="prettyprint"><code>from numba import njit @njit def numba_app1(idx, n, s, out): for i,j in zip(idx[:-1],idx[1:]): s0 = s[i:j] c = 0 for p1 in s0[c:]: for p2 in s0[c+1:]: out[p1,p2] = 1 out[p2,p1] = 1 c += 1 return out def app1(A): s = A.argsort() b = A[s] n = len(A) idx = np.flatnonzero(np.r_[True,b[1:] != b[:-1],True]) out = np.zeros((n,n),dtype=bool) numba_app1(idx, n, s, out) out.ravel()[::out.shape[1]+1] = 1 return out </code></pre> Timings - <pre class="prettyprint"><code>In [287]: np.random.seed(0) ...: n = 30000 ...: A = np.random.randint(0, 1000, n) # Original soln In [288]: %timeit A == A[np.newaxis].T 1 loop, best of 3: 317 ms per loop # @Daniel F's soln-1 that skips assigning lower diagonal in output In [289]: %timeit sparse_outer_eq(A) 1 loop, best of 3: 450 ms per loop # @Daniel F's soln-2 (complete one) In [291]: %timeit sparse_outer_eq(A) 1 loop, best of 3: 634 ms per loop # Solution from this post In [292]: %timeit app1(A) 10 loops, best of 3: 66.9 ms per loop </code></pre>

Comparing numpy array with itself by element efficiently

Tags:

python

numpy

I am performing a large number of these calculations:

A == A[np.newaxis].T

where A is a dense numpy array which frequently has common values.

For benchmarking purposes we can use:

Click to copy

n = 30000
A = np.random.randint(0, 1000, n)
A == A[np.newaxis].T

When I perform this calculation, I run into memory issues. I believe this is because the output isn't in more efficient bitarray or np.packedbits format. A secondary concern is we are performing twice as many comparisons as necessary, since the resulting Boolean array is symmetric.

The questions I have are:

Is it possible to produce the Boolean numpy array output in a more memory efficient fashion without sacrificing speed? The options I know about are bitarray and np.packedbits, but I only know how to apply these after the large Boolean array is created.
Can we utilise the symmetry of our calculation to halve the number of comparisons processed, again without sacrificing speed?

I will need to be able to perform & and | operations on Boolean arrays output. I have tried bitarray, which is super-fast for these bitwise operations. But it is slow to pack np.ndarray -> bitarray and then unpack bitarray -> np.ndarray.

[Edited to provide clarification.]

227

asked Jan 16 '18 10:01

jpp

1 Answers

Here's one with numba to give us a NumPy boolean array as output -

Click to copy

from numba import njit

@njit
def numba_app1(idx, n, s, out):
    for i,j in zip(idx[:-1],idx[1:]):
        s0 = s[i:j]
        c = 0
        for p1 in s0[c:]:
            for p2 in s0[c+1:]:
                out[p1,p2] = 1
                out[p2,p1] = 1
            c += 1
    return out

def app1(A):
    s = A.argsort()
    b = A[s]
    n = len(A)
    idx = np.flatnonzero(np.r_[True,b[1:] != b[:-1],True])
    out = np.zeros((n,n),dtype=bool)
    numba_app1(idx, n, s, out)
    out.ravel()[::out.shape[1]+1] = 1
    return out

Timings -

Click to copy

In [287]: np.random.seed(0)
     ...: n = 30000
     ...: A = np.random.randint(0, 1000, n)

# Original soln
In [288]: %timeit A == A[np.newaxis].T
1 loop, best of 3: 317 ms per loop

# @Daniel F's soln-1 that skips assigning lower diagonal in output
In [289]: %timeit sparse_outer_eq(A)
1 loop, best of 3: 450 ms per loop

# @Daniel F's soln-2 (complete one)
In [291]: %timeit sparse_outer_eq(A)
1 loop, best of 3: 634 ms per loop

# Solution from this post
In [292]: %timeit app1(A)
10 loops, best of 3: 66.9 ms per loop

192

answered Sep 19 '22 01:09

Divakar

Related questions
                            
                                ValueError: invalid literal for int() with base 10: '196.41'
                            
                                Logistic Regression Gradient Descent [closed]
                            
                                reflecting every schema from postgres DB using SQLAlchemy
                            
                                How to upgrade to the latest Anaconda 5.0.1
                            
                                Remove add another from django admin
                            
                                How to display all images in a directory with flask [duplicate]
                            
                                Filter a 2D numpy array
                            
                                Sort CSV by column name
                            
                                Is there a function similar to OpenCV findContours that detects curves and replaces points with a spline?
                            
                                Django build video website similar to YouTube
                            
                                Python: How to Remove mouseCallback in OpenCV
                            
                                Scrapy: downloader/response_count vs response_received_count
                            
                                Boxplots with multiple categories with seaborn
                            
                                python (boto3) program to delete old snapshots in aws
                            
                                Limit GPU devices in Tensorflow
                            
                                Submitting Google Cloud ML Engine Jobs from Python Directly
                            
                                How do I specify server options?
                            
                                Getting an error importing Excel file into pandas selecting the usecols parameter
                            
                                Change default backend for matplotlib in Jupyter Ipython
                            
                                pydrive get only folders from list

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Comparing numpy array with itself by element efficiently

Tags:

python

numpy

jpp

People also ask

1 Answers

Divakar

Recent Activity

Donate For Us