Use numba to speed up for loop

Tags:

python

numba

From what I've read, numba can significantly speed up a python program. Could my program's time efficiency be increased using numba?

import numpy as np

def f_big(A, k, std_A, std_k, mean_A=10, mean_k=0.2, hh=100):
    return ( 1 / (std_A * std_k * 2 * np.pi) ) * A * (hh/50) ** k * np.exp( -1*(k - mean_k)**2 / (2 * std_k **2 ) - (A - mean_A)**2 / (2 * std_A**2))

outer_sum = 0
dk = 0.000001
for k in np.arange(dk,0.4, dk):
    inner_sum = 0
    for A in np.arange(dk, 20, dk):
        inner_sum += dk * f_big(A, k, 1e-5, 1e-5)
    outer_sum += inner_sum * dk

print outer_sum

596

asked Mar 03 '16 21:03

kilojoules

1 Answers

Yes, this is the sort of problem that Numba really works for. I changed your value of dk because it wasn't sensible for a simple demonstration. Here is the code:

import numpy as np
import numba as nb

def f_big(A, k, std_A, std_k, mean_A=10, mean_k=0.2, hh=100):
    return ( 1 / (std_A * std_k * 2 * np.pi) ) * A * (hh/50) ** k * np.exp( -1*(k - mean_k)**2 / (2 * std_k **2 ) - (A - mean_A)**2 / (2 * std_A**2))

def func():
    outer_sum = 0
    dk = 0.01 #0.000001
    for k in np.arange(dk, 0.4, dk):
        inner_sum = 0
        for A in np.arange(dk, 20, dk):
            inner_sum += dk * f_big(A, k, 1e-5, 1e-5)
        outer_sum += inner_sum * dk

    return outer_sum

@nb.jit(nopython=True)
def f_big_nb(A, k, std_A, std_k, mean_A=10, mean_k=0.2, hh=100):
    return ( 1 / (std_A * std_k * 2 * np.pi) ) * A * (hh/50) ** k * np.exp( -1*(k - mean_k)**2 / (2 * std_k **2 ) - (A - mean_A)**2 / (2 * std_A**2))

@nb.jit(nopython=True)
def func_nb():
    outer_sum = 0
    dk = 0.01 #0.000001
    X = np.arange(dk, 0.4, dk)
    Y = np.arange(dk, 20, dk)
    for i in xrange(X.shape[0]):
        k = X[i] # faster to do lookup than iterate over an array directly
        inner_sum = 0
        for j in xrange(Y.shape[0]):
            A = Y[j]
            inner_sum += dk * f_big_nb(A, k, 1e-5, 1e-5)
        outer_sum += inner_sum * dk

    return outer_sum

And then timings:

In [7]: np.allclose(func(), func_nb())
Out[7]: True

In [8]: %timeit func()
1 loops, best of 3: 222 ms per loop

In [9]: %timeit func_nb()
The slowest run took 419.10 times longer than the fastest. This could mean that an intermediate result is being cached 
1000 loops, best of 3: 362 µs per loop

So the numba version is approx 600 times faster on my laptop.

answered Sep 21 '22 12:09

JoshAdel

Related questions
                            
                                Storing Pandas objects along with regular Python objects in HDF5
                            
                                "Fire and forget" a process from a Python script
                            
                                AppEngine: warning during python app update
                            
                                Finding out an exception context
                            
                                NumPy resize method
                            
                                What is the correct way to get the previous page of results given an NDB cursor?
                            
                                Python - Remove any element from a list of strings that is a substring of another element
                            
                                Mocking a HTTP server in Python
                            
                                Pandas dataframe and character encoding when reading excel file
                            
                                PyCharm: debugging line by line?
                            
                                Using sys.exit() with app.exec_ in pyqt
                            
                                How to remove parameters from URL in Flask python
                            
                                Django and REST API to serve calculation-based requests
                            
                                Sending TLS 1.2 request in Python 2.6
                            
                                Replacing row values in pandas
                            
                                Precision difference when printing Python and C++ doubles
                            
                                Python stop multiple process when one returns a result?
                            
                                How to use python socket.settimeout() properly
                            
                                Context Manager without Yield
                            
                                How do I use perform_create to set a field automatically in Django Rest Framework?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With