Quick question, which is more efficient for finding the smallest number (float) in a long list (10000+ elements) is it <pre class="prettyprint"><code>min(mylist) </code></pre> or <pre class="prettyprint"><code>mylist.sort() </code></pre> and then returning <pre class="prettyprint"><code>mylist[0] </code></pre> or something else... thanks!

If the list is already populated, <code>min()</code> is the most efficient way. There are some tricks you might use in special scenarios: <ul> <li>If you build the list from scratch, simply keep the smallest item yet in an external variable, so that the answer will be given in <code>O(1)</code>.</li> <li>If there are only Floats in the list, use an Array which gives better performance.</li> <li>You can keep the list sorted using bisect.</li> <li>Use a Python Heap, which even has an efficient implementation of <code>min()</code>. Make sure you understand the effects, mainly a slower insertion. (credit: interjay)</li> </ul>

most efficent way of finding the minimum float in a python list

Tags:

python

list

Quick question, which is more efficient for finding the smallest number (float) in a long list (10000+ elements)

is it

min(mylist)

mylist.sort()

and then returning

mylist[0]

or something else...

thanks!

500

asked Feb 23 '12 12:02

user1228368

2 Answers

Frst, if you care about performance in Python (which isn't always a sensible thing to care about, but that's another conversation), you should be using the timeit module. Even in C it's hard to predict how certain functions will behave after compilation, and it's harder in Python. People are often confidently expressing opinions about which functions are faster which are data-dependent. Then -- by using timeit, I mean -- you could've found out yourself.

Second, if you really care about performance on lists of floats, you shouldn't be using lists at all, but numpy arrays. Using IPython here, under Python 2.7.2, which makes timing things easy:

In [41]: import random, numpy
In [42]: a = [0.1*i for i in range(10**5)]
In [43]: timeit min(a)
100 loops, best of 3: 4.55 ms per loop
In [44]: timeit sorted(a)[0]
100 loops, best of 3: 4.57 ms per loop
In [45]: random.shuffle(a)
In [46]: timeit min(a)
100 loops, best of 3: 6.06 ms per loop
In [47]: timeit min(a) # to make sure it wasn't a fluke
100 loops, best of 3: 6.07 ms per loop
In [48]: timeit sorted(a)[0]
10 loops, best of 3: 65.9 ms per loop
In [49]: b = numpy.array(a)
In [50]: timeit b.min()
10000 loops, best of 3: 97.5 us per loop

And we note a few things. (1) Python's sort (timsort) works very well on data which has sorted runs, so sorting an already sorted list has almost no penalty. (2) Sorting a random list, on the other hand, is very much slower, and this will only get worse as the data gets larger. (3) Numpy.min() on a float array works sixty times faster than min on a Python list, because it doesn't have to be as general.

answered Sep 28 '22 02:09

DSM

If the list is already populated, min() is the most efficient way.

There are some tricks you might use in special scenarios:

If you build the list from scratch, simply keep the smallest item yet in an external variable, so that the answer will be given in O(1).
If there are only Floats in the list, use an Array which gives better performance.
You can keep the list sorted using bisect.
Use a Python Heap, which even has an efficient implementation of min(). Make sure you understand the effects, mainly a slower insertion. (credit: interjay)

answered Sep 28 '22 00:09

Adam Matan

Related questions
                            
                                Converting a string into a list in Python
                            
                                Python Error Catching & FTP
                            
                                Python: Unicode and ElementTree.parse
                            
                                How do I create a numpy array from string?
                            
                                md5 from pil object
                            
                                Is it possible to override .objects on a django model?
                            
                                how to direct output into a txt file in python in windows
                            
                                Optimize non-abundant sums algorithm
                            
                                Django Custom File Storage system
                            
                                Rendering Plaintext as HTML maintaining whitespace – without <pre>
                            
                                python listing dirs in a different order based upon platform
                            
                                How to convert a string list into an integer in python [duplicate]
                            
                                Getting HTML with Pycurl
                            
                                Python 3.2 - readline() is skipping lines in source file
                            
                                Call super().__init__() in classes derived from `object`?
                            
                                change some lowercase letters to uppercase in string
                            
                                How to fill a list
                            
                                Python - Access object attributes as in a dictionary
                            
                                Leave arguments untouched with argparse
                            
                                3d rotation on image

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With