I have a long list of float numbers ranging from 1 to 5, called "average", and I want to return the list of indices for elements that are smaller than a or larger than b <pre class="prettyprint"><code>def find(lst,a,b): result = [] for x in lst: if x<a or x>b: i = lst.index(x) result.append(i) return result matches = find(average,2,4) </code></pre> But surprisingly, the output for "matches" has a lot of repetitions in it, e.g. <code>[2, 2, 10, 2, 2, 2, 19, 2, 10, 2, 2, 42, 2, 2, 10, 2, 2, 2, 10, 2, 2, ...]</code>. Why is this happening?

You are using <code>.index()</code> which will only find the first occurrence of your value in the list. So if you have a value 1.0 at index 2, and at index 9, then <code>.index(1.0)</code> will always return <code>2</code>, no matter how many times <code>1.0</code> occurs in the list. Use <code>enumerate()</code> to add indices to your loop instead: <pre class="prettyprint"><code>def find(lst, a, b): result = [] for i, x in enumerate(lst): if x<a or x>b: result.append(i) return result </code></pre> You can collapse this into a list comprehension: <pre class="prettyprint"><code>def find(lst, a, b): return [i for i, x in enumerate(lst) if x<a or x>b] </code></pre>

if you're doing a lot of this kind of thing you should consider using <code>numpy</code>. <pre class="prettyprint"><code>In [56]: import random, numpy In [57]: lst = numpy.array([random.uniform(0, 5) for _ in range(1000)]) # example list In [58]: a, b = 1, 3 In [59]: numpy.flatnonzero((lst > a) & (lst < b))[:10] Out[59]: array([ 0, 12, 13, 15, 18, 19, 23, 24, 26, 29]) </code></pre> In response to Seanny123's question, I used this timing code: <pre class="prettyprint"><code>import numpy, timeit, random a, b = 1, 3 lst = numpy.array([random.uniform(0, 5) for _ in range(1000)]) def numpy_way(): numpy.flatnonzero((lst > 1) & (lst < 3))[:10] def list_comprehension(): [e for e in lst if 1 < e < 3][:10] print timeit.timeit(numpy_way) print timeit.timeit(list_comprehension) </code></pre> The numpy version is over 60 times faster.

Finding the indices of matching elements in list in Python

Tags:

python

find

list

indexing

I have a long list of float numbers ranging from 1 to 5, called "average", and I want to return the list of indices for elements that are smaller than a or larger than b

def find(lst,a,b):     result = []     for x in lst:         if x<a or x>b:             i = lst.index(x)             result.append(i)     return result  matches = find(average,2,4)

But surprisingly, the output for "matches" has a lot of repetitions in it, e.g. [2, 2, 10, 2, 2, 2, 19, 2, 10, 2, 2, 42, 2, 2, 10, 2, 2, 2, 10, 2, 2, ...].

Why is this happening?

715

asked May 22 '13 07:05

Logan Yang

2 Answers

You are using .index() which will only find the first occurrence of your value in the list. So if you have a value 1.0 at index 2, and at index 9, then .index(1.0) will always return 2, no matter how many times 1.0 occurs in the list.

Use enumerate() to add indices to your loop instead:

def find(lst, a, b):     result = []     for i, x in enumerate(lst):         if x<a or x>b:             result.append(i)     return result

You can collapse this into a list comprehension:

def find(lst, a, b):     return [i for i, x in enumerate(lst) if x<a or x>b]

181

answered Sep 19 '22 07:09

Martijn Pieters

if you're doing a lot of this kind of thing you should consider using numpy.

In [56]: import random, numpy  In [57]: lst = numpy.array([random.uniform(0, 5) for _ in range(1000)]) # example list  In [58]: a, b = 1, 3  In [59]: numpy.flatnonzero((lst > a) & (lst < b))[:10] Out[59]: array([ 0, 12, 13, 15, 18, 19, 23, 24, 26, 29])

In response to Seanny123's question, I used this timing code:

import numpy, timeit, random  a, b = 1, 3  lst = numpy.array([random.uniform(0, 5) for _ in range(1000)])  def numpy_way():     numpy.flatnonzero((lst > 1) & (lst < 3))[:10]  def list_comprehension():     [e for e in lst if 1 < e < 3][:10]  print timeit.timeit(numpy_way) print timeit.timeit(list_comprehension)

The numpy version is over 60 times faster.

answered Sep 18 '22 07:09

Alex Coventry

Related questions
                            
                                How to make unique short URL with Python?
                            
                                Sql Alchemy connection time Out
                            
                                Difference between using ' and "? [duplicate]
                            
                                Python file open/close every time vs keeping it open until the process is finished
                            
                                How can I limit the maximum running time for a unit test?
                            
                                Tensorflow Allocation Memory: Allocation of 38535168 exceeds 10% of system memory
                            
                                How to uninstall all unused packages in a conda virtual environment?
                            
                                os.getcwd() vs os.path.abspath(os.path.dirname(__file__))
                            
                                Reshape an array in NumPy
                            
                                Get the directory path of absolute file path in Python
                            
                                how to print contents of PYTHONPATH
                            
                                Convert Python list to pandas Series
                            
                                Any way to reset a mocked method to its original state? - Python Mock - mock 1.0b1
                            
                                How to detect lowercase letters in Python?
                            
                                python logging module is not writing anything to file
                            
                                Is there a Python equivalent for Scala's Option or Either?
                            
                                How to use numpy.void type
                            
                                PyTorch memory model: "torch.from_numpy()" vs "torch.Tensor()"
                            
                                Multiple mod_wsgi apps on one virtual host directing to wrong app
                            
                                Function not changing global variable

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With