Faster for-loops with arrays in Python

Tags:

N, M = 1000, 4000000
a = np.random.uniform(0, 1, (N, M))
k = np.random.randint(0, N, (N, M))

out = np.zeros((N, M))
for i in range(N):
    for j in range(M):
        out[k[i, j], j] += a[i, j]

I work with very long for-loops; %%timeit on above with pass replacing the operation yields

1min 19s ± 663 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

this is unacceptable in context (C++ took 6.5 sec). There's no reason for above to be done with Python objects; arrays have well-defined types. Implementing this in C/C++ as an extension is an overkill on both developer and user ends; I'm just passing arrays to loop and do arithmetic on.

Is there a way to tell Numpy "move this logic to C", or another library that can handle nested loops involving only arrays? I seek it for the general case, not workarounds for this specific example (but if you have one I can open a separate Q&A).

504

asked Oct 26 '20 16:10

OverLordGoldDragon

1 Answers

This is basically the idea behind Numba. Not as fast as C, but it can get close... It uses a jit compiler to compile python code to machine and it's compatible with most Numpy functions. (In the docs you find all the details)

import numpy as np
from numba import njit


@njit
def f(N, M):
    a = np.random.uniform(0, 1, (N, M))
    k = np.random.randint(0, N, (N, M))

    out = np.zeros((N, M))
    for i in range(N):
        for j in range(M):
            out[k[i, j], j] += a[i, j]
    return out


def f_python(N, M):
    a = np.random.uniform(0, 1, (N, M))
    k = np.random.randint(0, N, (N, M))

    out = np.zeros((N, M))
    for i in range(N):
        for j in range(M):
            out[k[i, j], j] += a[i, j]
    return out

Pure Python:

%%timeit

N, M = 100, 4000
f_python(M, N)

338 ms ± 12.6 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

With Numba:

%%timeit

N, M = 100, 4000
f(M, N)

12 ms ± 534 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

132

answered Oct 01 '22 22:10

dzang

Related questions
                            
                                What method does Python call when I access an attribute of a class via the class name?
                            
                                How do you use pipenv in a GitHub action?
                            
                                Cascade multiple RNN models for N-dimensional output
                            
                                Can't find model 'en_core_web_md'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory
                            
                                How to save and edit server rendering data?
                            
                                Installing socketio module on python3 seems to be corrupting pip
                            
                                Identify the first and all non-zero values in every row in Pandas DataFrame
                            
                                How to convert a sklearn pipeline into a pyspark pipeline?
                            
                                Delete diagonals of zero elements
                            
                                Is there a way to use Python 3.9 type hinting in its previous versions?
                            
                                Kivy sounds do not play on android device even though they play fine on laptop
                            
                                enabling CORS Google Cloud Function (Python)
                            
                                How to read, format, sort, and save a csv file, without pandas
                            
                                what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?
                            
                                Updating dataframe value based on list
                            
                                How to update a pandas dataframe, from multiple API calls
                            
                                How to I compute matching features between high resolution images?
                            
                                How to register typing.Callable with Python @singledispatch?
                            
                                tf.newaxis operation in TensorFlow
                            
                                'tuple' object has no attribute '_committed' error while updating image objects?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Faster for-loops with arrays in Python

Tags:

performance

python

for-loop

python-3.x

numpy

OverLordGoldDragon

People also ask

1 Answers

dzang

Recent Activity

Donate For Us