I am using numpy's where function many times inside several <code>for</code> loops, but it becomes way too slow. Are there any ways to perform this functionality faster? I read you should try to do in-line for loops, as well as make local variables for functions before the <code>for</code> loops, but nothing seems to improve speed by much (< 1%). The <code>len(UNIQ_IDS)</code> ~ 800. <code>emiss_data</code> and <code>obj_data</code> are numpy ndarrays with shape = (2600,5200). I've used <code>import profile</code> to get a handle on where the bottlenecks are, and <code>where</code> in <code>for</code> loops is a big one. <pre class="prettyprint"><code>import numpy as np max = np.max where = np.where MAX_EMISS = [max(emiss_data[where(obj_data == i)]) for i in UNIQ_IDS)] </code></pre>

Can't you just do <pre class="prettyprint"><code>emiss_data[obj_data == i] </code></pre> ? I'm not sure why you're using <code>where</code> at all.

fast python numpy where functionality?

Q: Why NumPy Where is fast?

NumPy is fast because it can do all its calculations without calling back into Python. Since this function involves looping in Python, we lose all the performance benefits of using NumPy. For a 10,000,000-entry NumPy array, this functions takes 2.5 seconds to run on my computer.

Q: Where is NumPy function?

The numpy. where() function returns the indices of elements in an input array where the given condition is satisfied. Parameters: condition : When True, yield x, otherwise yield y.

Q: Is NumPy where faster than for loop?

NumPy operations are much faster than pure Python operations when you can find corresponding functions in NumPy to replace single for loops. Double for loops can sometimes be replaced by the NumPy broadcasting operation and it can save even more computational time.

Tags:

performance

python

for-loop

numpy

where

I am using numpy's where function many times inside several for loops, but it becomes way too slow. Are there any ways to perform this functionality faster? I read you should try to do in-line for loops, as well as make local variables for functions before the for loops, but nothing seems to improve speed by much (< 1%). The len(UNIQ_IDS) ~ 800. emiss_data and obj_data are numpy ndarrays with shape = (2600,5200). I've used import profile to get a handle on where the bottlenecks are, and where in for loops is a big one.

import numpy as np
max = np.max
where = np.where
MAX_EMISS = [max(emiss_data[where(obj_data == i)]) for i in UNIQ_IDS)]

210

asked Aug 26 '13 20:08

weather guy

2 Answers

Can't you just do

emiss_data[obj_data == i]

? I'm not sure why you're using where at all.

answered Oct 21 '22 09:10

user2357112 supports Monica

It turns out that a pure Python loop can be much much faster than NumPy indexing (or calls to np.where) in this case.

Consider the following alternatives:

import numpy as np
import collections
import itertools as IT

shape = (2600,5200)
# shape = (26,52)
emiss_data = np.random.random(shape)
obj_data = np.random.random_integers(1, 800, size=shape)
UNIQ_IDS = np.unique(obj_data)

def using_where():
    max = np.max
    where = np.where
    MAX_EMISS = [max(emiss_data[where(obj_data == i)]) for i in UNIQ_IDS]
    return MAX_EMISS

def using_index():
    max = np.max
    MAX_EMISS = [max(emiss_data[obj_data == i]) for i in UNIQ_IDS]
    return MAX_EMISS

def using_max():
    MAX_EMISS = [(emiss_data[obj_data == i]).max() for i in UNIQ_IDS]
    return MAX_EMISS

def using_loop():
    result = collections.defaultdict(list)
    for val, idx in IT.izip(emiss_data.ravel(), obj_data.ravel()):
        result[idx].append(val)
    return [max(result[idx]) for idx in UNIQ_IDS]

def using_sort():
    uind = np.digitize(obj_data.ravel(), UNIQ_IDS) - 1
    vals = uind.argsort()
    count = np.bincount(uind)
    start = 0
    end = 0
    out = np.empty(count.shape[0])
    for ind, x in np.ndenumerate(count):
        end += x
        out[ind] = np.max(np.take(emiss_data, vals[start:end]))
        start += x
    return out

def using_split():
    uind = np.digitize(obj_data.ravel(), UNIQ_IDS) - 1
    vals = uind.argsort()
    count = np.bincount(uind)
    return [np.take(emiss_data, item).max()
            for item in np.split(vals, count.cumsum())[:-1]]

for func in (using_index, using_max, using_loop, using_sort, using_split):
    assert using_where() == func()

Here are the benchmarks, with shape = (2600,5200):

In [57]: %timeit using_loop()
1 loops, best of 3: 9.15 s per loop

In [90]: %timeit using_sort()
1 loops, best of 3: 9.33 s per loop

In [91]: %timeit using_split()
1 loops, best of 3: 9.33 s per loop

In [61]: %timeit using_index()
1 loops, best of 3: 63.2 s per loop

In [62]: %timeit using_max()
1 loops, best of 3: 64.4 s per loop

In [58]: %timeit using_where()
1 loops, best of 3: 112 s per loop

Thus using_loop (pure Python) turns out to be more than 11x faster than using_where.

I'm not entirely sure why pure Python is faster than NumPy here. My guess is that the pure Python version zips (yes, pun intended) through both arrays once. It leverages the fact that despite all the fancy indexing, we really just want to visit each value once. Thus it side-steps the issue with having to determine exactly which group each value in emiss_data falls in. But this is just vague speculation. I didn't know it would be faster until I benchmarked.

answered Oct 21 '22 08:10

unutbu

Related questions
                            
                                Calculate within categories: Equivalent of R's ddply in Python?
                            
                                Convert Python to R
                            
                                sqlalchemy tutorial example not working
                            
                                Python - check if a system is 32 or 64 bit to determine whether to run the function or not? [duplicate]
                            
                                Is it possible to use the GPU to accelerate hashing in Python?
                            
                                How do I insert a space after a certain amount of characters in a string using python?
                            
                                Efficiently create a density plot for high-density regions, points for sparse regions
                            
                                ImportError: No module named zope.interface
                            
                                MongoDB shell's db.stats() in php and python
                            
                                reducelist in Python: like reduce but giving the list of intermediate results
                            
                                Opinions about Enthought Traits/TraitsUI for Python desktop development
                            
                                Creating a profile model with both an InlineAdmin and a post_save signal in Django
                            
                                Python Coin Toss
                            
                                Imported module not found in PyInstaller
                            
                                python csv reader, loop from the second row
                            
                                pySerial 2.6: specify end-of-line in readline()
                            
                                Convert string to JSON in Python?
                            
                                Write a binary integer or string to a file in python
                            
                                How to get reference to module by string name and call its method by string name?
                            
                                Migrating to MongoDB: how to query GROUP BY + WHERE

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With