I have written the following code in Python, in order to estimate the value of <code>Pi</code>. It is called Monte Carlo method. Obviously by increasing the number of samples the code becomes slower and I assume that the slowest part of the code is in the sampling part. How can I make it faster? <pre class="prettyprint"><code>from __future__ import division import numpy as np a = 1 n = 1000000 s1 = np.random.uniform(0,a,n) s2 = np.random.uniform(0,a,n) ii=0 jj=0 for item in range(n): if ((s1[item])**2 + (s2[item])**2) < 1: ii = ii + 1 print float(ii*4/(n)) </code></pre> Do you suggest other (presumably faster) codes?

The bottleneck here is actually your <code>for</code> loop. Python <code>for</code> loops are relatively slow, so if you need to iterate over a million items, you can gain a lot of speed by avoiding them altogether. In this case, it's quite easy. Instead of this: <pre class="prettyprint"><code>for item in range(n): if ((s1[item])**2 + (s2[item])**2) < 1: ii = ii + 1 </code></pre> do this: <pre class="prettyprint"><code>ii = ((s1 ** 2 + s2 ** 2) < 1).sum() </code></pre> This works because <code>numpy</code> has built-in support for optimized array operations. The looping occurs in <code>c</code> instead of python, so it's much faster. I did a quick test so you can see the difference: <pre class="prettyprint"><code>>>> def estimate_pi_loop(x, y): ... total = 0 ... for i in xrange(len(x)): ... if x[i] ** 2 + y[i] ** 2 < 1: ... total += 1 ... return total * 4.0 / len(x) ... >>> def estimate_pi_numpy(x, y): ... return ((x ** 2 + y ** 2) < 1).sum() ... >>> %timeit estimate_pi_loop(x, y) 1 loops, best of 3: 3.33 s per loop >>> %timeit estimate_pi_numpy(x, y) 100 loops, best of 3: 10.4 ms per loop </code></pre> Here are a few examples of the kinds of operations that are possible, just so you have a sense of how this works. Squaring an array: <pre class="prettyprint"><code>>>> a = numpy.arange(5) >>> a ** 2 array([ 0, 1, 4, 9, 16]) </code></pre> Adding arrays: <pre class="prettyprint"><code>>>> a + a array([0, 2, 4, 6, 8]) </code></pre> Comparing arrays: <pre class="prettyprint"><code>>>> a > 2 array([False, False, False, True, True], dtype=bool) </code></pre> Summing boolean values: <pre class="prettyprint"><code>>>> (a > 2).sum() 2 </code></pre> As you probably realize, there are faster ways to estimate Pi, but I will admit that I've always admired the simplicity and effectiveness of this method.

You have assigned numpy arrays, so you should use those to your advantage. <pre class="prettyprint"><code>for item in range(n): if ((s1[item])**2 + (s2[item])**2) < 1: ii = ii + 1 </code></pre> becomes <pre class="prettyprint"><code>s1sqr = s1*s1 s2sqr = s2*s2 s_sum = s1sqr + s2sqr s_sum_bool = s_sum < 1 ii = s_sum_bool.sum() print float(ii*4/(n)) </code></pre> Where you are squaring the arrays, summing them, checking if the sum is less than 1 and then summing the boolean values (false = 0, true = 1) to get the total number that meet the criteria.

How to increase the performance for estimating `Pi`in Python

Tags:

performance

python

pi

I have written the following code in Python, in order to estimate the value of Pi. It is called Monte Carlo method. Obviously by increasing the number of samples the code becomes slower and I assume that the slowest part of the code is in the sampling part. How can I make it faster?

from __future__ import division
import numpy as np

a = 1
n = 1000000

s1 = np.random.uniform(0,a,n)
s2 = np.random.uniform(0,a,n)

ii=0
jj=0

for item in range(n):
    if ((s1[item])**2 + (s2[item])**2) < 1:
        ii = ii + 1

print float(ii*4/(n))

Do you suggest other (presumably faster) codes?

320

asked Jun 12 '15 15:06

NKN

2 Answers

The bottleneck here is actually your for loop. Python for loops are relatively slow, so if you need to iterate over a million items, you can gain a lot of speed by avoiding them altogether. In this case, it's quite easy. Instead of this:

for item in range(n):
    if ((s1[item])**2 + (s2[item])**2) < 1:
        ii = ii + 1

do this:

ii = ((s1 ** 2 + s2 ** 2) < 1).sum()

This works because numpy has built-in support for optimized array operations. The looping occurs in c instead of python, so it's much faster. I did a quick test so you can see the difference:

>>> def estimate_pi_loop(x, y):
...     total = 0
...     for i in xrange(len(x)):
...         if x[i] ** 2 + y[i] ** 2 < 1:
...             total += 1
...     return total * 4.0 / len(x)
... 
>>> def estimate_pi_numpy(x, y):
...     return ((x ** 2 + y ** 2) < 1).sum()
... 
>>> %timeit estimate_pi_loop(x, y)
1 loops, best of 3: 3.33 s per loop
>>> %timeit estimate_pi_numpy(x, y)
100 loops, best of 3: 10.4 ms per loop

Here are a few examples of the kinds of operations that are possible, just so you have a sense of how this works.

Squaring an array:

>>> a = numpy.arange(5)
>>> a ** 2
array([ 0,  1,  4,  9, 16])

Adding arrays:

>>> a + a
array([0, 2, 4, 6, 8])

Comparing arrays:

>>> a > 2
array([False, False, False,  True,  True], dtype=bool)

Summing boolean values:

>>> (a > 2).sum()
2

As you probably realize, there are faster ways to estimate Pi, but I will admit that I've always admired the simplicity and effectiveness of this method.

175

answered Oct 04 '22 22:10

senderle

You have assigned numpy arrays, so you should use those to your advantage.

for item in range(n):
    if ((s1[item])**2 + (s2[item])**2) < 1:
        ii = ii + 1

becomes

s1sqr = s1*s1
s2sqr = s2*s2
s_sum = s1sqr + s2sqr
s_sum_bool = s_sum < 1
ii = s_sum_bool.sum()
print float(ii*4/(n))

Where you are squaring the arrays, summing them, checking if the sum is less than 1 and then summing the boolean values (false = 0, true = 1) to get the total number that meet the criteria.

answered Oct 04 '22 20:10

James

Related questions
                            
                                Bottle web app not serving static css files
                            
                                Geopy: retrieving country names in English
                            
                                Solving for quartile and decile using Python
                            
                                setup.py: run build_ext before anything else
                            
                                Debug a python script with arguments from terminal
                            
                                Python regex explanation needed - $ character usage
                            
                                django-registration-redux add extra field
                            
                                pandas correlation matrix between each pair groupby item
                            
                                Eigenvector does not work in Sympy
                            
                                Modelformset in django generic CreateView and UpdateView
                            
                                Error importing h5py
                            
                                Regular expression to replace all script src attributes
                            
                                Python 2.7: Ints as objects
                            
                                Python peewee save() doesn't work as expected
                            
                                Convert number to time in Python
                            
                                Format seconds as float
                            
                                Having trouble with sending an email through SMTP Python
                            
                                How to sort row index case-insensitive way in Pandas DataFrame
                            
                                Converting to ASCII with numbers above 128
                            
                                Function name of wrapped function? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With