I have been comparing the relative efficiency of numpy versus Python list comprehensions in multiplying together arrays of random numbers. (Python 3.4/Spyder, Windows and Ubuntu). As one would expect, for all but the smallest arrays, numpy rapidly outperforms an list comprehension, and for increasing array length you get the expected sigmoid curve for performance. But the sigmoid is far from smooth, which I am puzzling to understand. Obviously there is a certain amount of quantization noise for shorter array lengths, but I am getting unexpectedly noisy results, particularly under Windows. The figures are the mean of 100 runs of the various array lengths, so should have any transient effects smoothed out (so I would have thought). <img src="https://i.stack.imgur.com/uquMS.png" alt="Performance characteristic"> <pre class="prettyprint"><code>Numpy and Python list performance comparison </code></pre> The figures below show the ratio of multiplying arrays of differing lengths using numpy against list comprehension. <pre class="prettyprint"><code>Array Length Windows Ubuntu 1 0.2 0.4 2 2.0 0.6 5 1.0 0.5 10 3.0 1.0 20 0.3 0.8 50 3.5 1.9 100 3.5 1.9 200 10.0 3.0 500 4.6 6.0 1,000 13.6 6.9 2,000 9.2 8.2 5,000 14.6 10.4 10,000 12.1 11.1 20,000 12.9 11.6 50,000 13.4 11.4 100,000 13.4 12.0 200,000 12.8 12.4 500,000 13.0 12.3 1,000,000 13.3 12.4 2,000,000 13.6 12.0 5,000,000 13.6 11.9 </code></pre> So I guess my question is can anyone explain why the results, particularly under Windows are so noisy. I have run the tests multiple times but the results always seem to be exactly the same. UPDATE. At Reblochon Masque's suggestion I have disabled grabage collection. Which smooths the Windows performance out somewhat, but the curves are still lumpy. <img src="https://i.stack.imgur.com/B0k4H.png" alt="Updated performance characteristic without garbage collection"> <pre class="prettyprint"><code>Numpy and Python list performance comparison (Updated to remove garbage collection) Array Length Windows Ubuntu 1 0.1 0.3 2 0.6 0.4 5 0.3 0.4 10 0.5 0.5 20 0.6 0.5 50 0.8 0.7 100 1.6 1.1 200 1.3 1.7 500 3.7 3.2 1,000 3.9 4.8 2,000 6.5 6.6 5,000 11.5 9.2 10,000 10.8 10.7 20,000 12.1 11.4 50,000 13.3 12.4 100,000 13.5 12.6 200,000 12.8 12.6 500,000 12.9 12.3 1,000,000 13.3 12.3 2,000,000 13.6 12.0 5,000,000 13.6 11.8 </code></pre> UPDATE At @Sid's suggestion, I've restricted it to running on a single core on each machine. The curves are slightly smoother (particularly the Linux one), but still with the inflexions and some noise, particularly under Windows. (It was actually the inflexions that I was originally going to post about, as they appear consistently in the same places.) <pre class="prettyprint"><code>Numpy and Python list performance comparison (Garbage collection disabled and running on 1 CPU) Array Length Windows Ubuntu 1 0.3 0.3 2 0.0 0.4 5 0.5 0.4 10 0.6 0.5 20 0.3 0.5 50 0.9 0.7 100 1.0 1.1 200 2.8 1.7 500 3.7 3.3 1,000 3.3 4.7 2,000 6.5 6.7 5,000 11.0 9.6 10,000 11.0 11.1 20,000 12.7 11.8 50,000 12.9 12.8 100,000 14.3 13.0 200,000 12.6 13.1 500,000 12.6 12.6 1,000,000 13.0 12.6 2,000,000 13.4 12.4 5,000,000 13.6 12.2 </code></pre> <img src="https://i.stack.imgur.com/JqXEY.png" alt="enter image description here">

The garbage collector explains the bulk of it. The rest could be fluctuation based on other programs running on your machine. How about turning most things off and running the bare minimum and testing it. Since you are using datetime (which is the actual time passed) it must be taking into account any processor context switches as well. You could also try running this while having it affixed to a processor using a unix call, that might help further smoothen it out. On Ubuntu it can be done hence: https://askubuntu.com/a/483827 For windows processor affinity can be set thus: http://www.addictivetips.com/windows-tips/how-to-set-processor-affinity-to-an-application-in-windows/

Why does the efficiency of numpy not scale

Tags:

python

arrays

numpy

I have been comparing the relative efficiency of numpy versus Python list comprehensions in multiplying together arrays of random numbers. (Python 3.4/Spyder, Windows and Ubuntu).

As one would expect, for all but the smallest arrays, numpy rapidly outperforms an list comprehension, and for increasing array length you get the expected sigmoid curve for performance. But the sigmoid is far from smooth, which I am puzzling to understand.

Obviously there is a certain amount of quantization noise for shorter array lengths, but I am getting unexpectedly noisy results, particularly under Windows. The figures are the mean of 100 runs of the various array lengths, so should have any transient effects smoothed out (so I would have thought).

Performance characteristic

Numpy and Python list performance comparison

The figures below show the ratio of multiplying arrays of differing lengths using numpy against list comprehension.

Array Length    Windows     Ubuntu
           1        0.2        0.4
           2        2.0        0.6
           5        1.0        0.5
          10        3.0        1.0
          20        0.3        0.8
          50        3.5        1.9
         100        3.5        1.9
         200       10.0        3.0
         500        4.6        6.0
       1,000       13.6        6.9
       2,000        9.2        8.2
       5,000       14.6       10.4
      10,000       12.1       11.1
      20,000       12.9       11.6
      50,000       13.4       11.4
     100,000       13.4       12.0
     200,000       12.8       12.4
     500,000       13.0       12.3
   1,000,000       13.3       12.4
   2,000,000       13.6       12.0
   5,000,000       13.6       11.9

So I guess my question is can anyone explain why the results, particularly under Windows are so noisy. I have run the tests multiple times but the results always seem to be exactly the same.

UPDATE. At Reblochon Masque's suggestion I have disabled grabage collection. Which smooths the Windows performance out somewhat, but the curves are still lumpy.

Updated performance characteristic without garbage collection

Numpy and Python list performance comparison
(Updated to remove garbage collection)

Array Length    Windows     Ubuntu
           1        0.1        0.3
           2        0.6        0.4
           5        0.3        0.4
          10        0.5        0.5
          20        0.6        0.5
          50        0.8        0.7
         100        1.6        1.1
         200        1.3        1.7
         500        3.7        3.2
       1,000        3.9        4.8
       2,000        6.5        6.6
       5,000       11.5        9.2
      10,000       10.8       10.7
      20,000       12.1       11.4
      50,000       13.3       12.4
     100,000       13.5       12.6
     200,000       12.8       12.6
     500,000       12.9       12.3
   1,000,000       13.3       12.3
   2,000,000       13.6       12.0
   5,000,000       13.6       11.8

UPDATE

At @Sid's suggestion, I've restricted it to running on a single core on each machine. The curves are slightly smoother (particularly the Linux one), but still with the inflexions and some noise, particularly under Windows.

(It was actually the inflexions that I was originally going to post about, as they appear consistently in the same places.)

Numpy and Python list performance comparison
(Garbage collection disabled and running on 1 CPU)

Array Length    Windows     Ubuntu
           1        0.3        0.3
           2        0.0        0.4
           5        0.5        0.4
          10        0.6        0.5
          20        0.3        0.5
          50        0.9        0.7
         100        1.0        1.1
         200        2.8        1.7
         500        3.7        3.3
       1,000        3.3        4.7
       2,000        6.5        6.7
       5,000       11.0        9.6
      10,000       11.0       11.1
      20,000       12.7       11.8
      50,000       12.9       12.8
     100,000       14.3       13.0
     200,000       12.6       13.1
     500,000       12.6       12.6
   1,000,000       13.0       12.6
   2,000,000       13.4       12.4
   5,000,000       13.6       12.2

enter image description here

709

asked Apr 05 '16 13:04

TimGJ

1 Answers

The garbage collector explains the bulk of it. The rest could be fluctuation based on other programs running on your machine. How about turning most things off and running the bare minimum and testing it. Since you are using datetime (which is the actual time passed) it must be taking into account any processor context switches as well.

You could also try running this while having it affixed to a processor using a unix call, that might help further smoothen it out. On Ubuntu it can be done hence: https://askubuntu.com/a/483827

For windows processor affinity can be set thus: http://www.addictivetips.com/windows-tips/how-to-set-processor-affinity-to-an-application-in-windows/

answered Sep 17 '22 08:09

Sid

Related questions
                            
                                Django makemessages decides to comment already existing translations
                            
                                Python pandas .isnull() does not work on NaT in object dtype
                            
                                Is numpy.sum implemented in such a way that numerical errors are avoided?
                            
                                How to detect a laser line in an image using Python
                            
                                Python StreamIO reading and writing from the same stream
                            
                                Execute shell command and retrieve stdout in Python [duplicate]
                            
                                botocore: how to close or clean up a session or client
                            
                                Return value from javascript onclick to variable or change global variable with onclick
                            
                                What is the standard way for writing default values in a python docstring?
                            
                                Scrapy download error and remove_request error
                            
                                How to make google app engine to support django's ImageField?
                            
                                How to convert Spark Streaming data into Spark DataFrame
                            
                                Python3 multiple assignment and memory address [duplicate]
                            
                                Saving 3D-plot in PDF 3D with Python
                            
                                Python Object Property Sharing
                            
                                Django: Postgres connection not closing
                            
                                pandas: FloatingPointError with np.seterr(all='raise') and missing data
                            
                                How to use a variable number of arguments in pyinvoke
                            
                                Alternative to numpy roll without copying array
                            
                                SWIG + setup.py: ImportError: dynamic module does not define init function (init_foo)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With