Say, I have a numpy array consists of <code>10</code> elements, for example: <code>a = np.array([2, 23, 15, 7, 9, 11, 17, 19, 5, 3])</code> Now I want to efficiently set all <code>a</code> values higher than <code>10</code> to <code>0</code>, so I'll get: <code>[2, 0, 0, 7, 9, 0, 0, 0, 5, 3]</code> Because I currently use a <code>for</code> loop, which is very slow: <pre class="prettyprint"><code># Zero values below "threshold value". def flat_values(sig, tv): """ :param sig: signal. :param tv: threshold value. :return: """ for i in np.arange(np.size(sig)): if sig[i] < tv: sig[i] = 0 return sig </code></pre> How can I achieve that in the most efficient way, having in mind big arrays of, say, <code>10^6</code> elements?

<pre class="prettyprint"><code>In [7]: a = np.array([2, 23, 15, 7, 9, 11, 17, 19, 5, 3]) In [8]: a[a > 10] = 0 In [9]: a Out[9]: array([2, 0, 0, 7, 9, 0, 0, 0, 5, 3]) </code></pre>

Set numpy array elements to zero if they are above a specific threshold

Tags:

python

arrays

numpy

Say, I have a numpy array consists of 10 elements, for example:

a = np.array([2, 23, 15, 7, 9, 11, 17, 19, 5, 3])

Now I want to efficiently set all a values higher than 10 to 0, so I'll get:

[2, 0, 0, 7, 9, 0, 0, 0, 5, 3]

Because I currently use a for loop, which is very slow:

# Zero values below "threshold value". def flat_values(sig, tv):     """     :param sig: signal.     :param tv: threshold value.     :return:     """     for i in np.arange(np.size(sig)):         if sig[i] < tv:             sig[i] = 0     return sig

How can I achieve that in the most efficient way, having in mind big arrays of, say, 10^6 elements?

219

asked Feb 10 '15 11:02

bluevoxel

2 Answers

In [7]: a = np.array([2, 23, 15, 7, 9, 11, 17, 19, 5, 3])  In [8]: a[a > 10] = 0  In [9]: a Out[9]: array([2, 0, 0, 7, 9, 0, 0, 0, 5, 3])

147

answered Sep 23 '22 22:09

unutbu

Generally, list comprehensions are faster than for loops in python (because python knows that it doesn't need to care for a lot of things that might happen in a regular for loop):

a = [0 if a_ > thresh else a_ for a_ in a]

but, as @unutbu correctly pointed out, numpy allows list indexing, and element-wise comparison giving you index lists, so:

super_threshold_indices = a > thresh a[super_threshold_indices] = 0

would be even faster.

Generally, when applying methods on vectors of data, have a look at numpy.ufuncs, which often perform much better than python functions that you map using any native mechanism.

answered Sep 21 '22 22:09

Marcus Müller

Related questions
                            
                                How to fix "could not find or load the Qt platform plugin windows" while using Matplotlib in PyCharm
                            
                                formatting long numbers as strings in python
                            
                                ModuleNotFoundError: No module named 'virtualenv.seed.embed.via_app_data' when I created new env by virtualenv
                            
                                Is there a function to make scatterplot matrices in matplotlib?
                            
                                How can I check if a string only contains letters in Python?
                            
                                How can I quickly estimate the distance between two (latitude, longitude) points?
                            
                                How can I get the Unix permission mask from a file? [duplicate]
                            
                                Error #15: Initializing libiomp5.dylib, but found libiomp5.dylib already initialized
                            
                                Django package to generate random alphanumeric string
                            
                                Format string dynamically [duplicate]
                            
                                How to cache downloaded PIP packages [duplicate]
                            
                                How can I define a class in Python?
                            
                                Python: Random numbers into a list
                            
                                Python: count repeated elements in the list [duplicate]
                            
                                How to handle response encoding from urllib.request.urlopen() , to avoid TypeError: can't use a string pattern on a bytes-like object
                            
                                Get a list of numbers as input from the user
                            
                                Python Inverse of a Matrix
                            
                                Python - Locating the position of a regex match in a string?
                            
                                Python spacing and aligning strings
                            
                                What's the difference between raise, try, and assert?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With