I want to compute binomial probabilities on python. I tried to apply the formula: <pre class="prettyprint"><code>probability = scipy.misc.comb(n,k)*(p**k)*((1-p)**(n-k)) </code></pre> Some of the probabilities I get are infinite. I checked some values for which p=inf. For one of them, n=450,000 and k=17. This value must be greater than 1e302 which is the maximum value handled by floats. I then tried to use <code>sum(np.random.binomial(n,p,numberOfTrials)==valueOfInterest)/numberOfTrials</code> This draws numberOfTrials samples and computes the average number of times the value valueOfInterest is drawn. This doesn't raise any infinite value. However, is this a valid way to proceed? And why this way wouldn't raise any infinite value whereas computing the probabilities does?

Because you're using scipy I thought I would mention that scipy already has statistical distributions implemented. Also note that when n is this large the binomial distribution is well approximated by the normal distribution (or Poisson if p is very small). <pre class="prettyprint"><code>n = 450000 p = .5 k = np.array([17., 225000, 226000]) b = scipy.stats.binom(n, p) print b.pmf(k) # array([ 0.00000000e+00, 1.18941527e-03, 1.39679862e-05]) n = scipy.stats.norm(n*p, np.sqrt(n*p*(1-p))) print n.pdf(k) # array([ 0.00000000e+00, 1.18941608e-03, 1.39680605e-05]) print b.pmf(k) - n.pdf(k) # array([ 0.00000000e+00, -8.10313274e-10, -7.43085142e-11]) </code></pre>

Computing a binomial probability for huge numbers

Tags:

python

algorithm

numpy

probability

binomial-coefficients

I want to compute binomial probabilities on python. I tried to apply the formula:

probability = scipy.misc.comb(n,k)*(p**k)*((1-p)**(n-k))

Some of the probabilities I get are infinite. I checked some values for which p=inf. For one of them, n=450,000 and k=17. This value must be greater than 1e302 which is the maximum value handled by floats.

I then tried to use sum(np.random.binomial(n,p,numberOfTrials)==valueOfInterest)/numberOfTrials

This draws numberOfTrials samples and computes the average number of times the value valueOfInterest is drawn.

This doesn't raise any infinite value. However, is this a valid way to proceed? And why this way wouldn't raise any infinite value whereas computing the probabilities does?

213

asked Mar 05 '14 15:03

bigTree

1 Answers

Because you're using scipy I thought I would mention that scipy already has statistical distributions implemented. Also note that when n is this large the binomial distribution is well approximated by the normal distribution (or Poisson if p is very small).

n = 450000
p = .5
k = np.array([17., 225000, 226000])

b = scipy.stats.binom(n, p)
print b.pmf(k)
# array([  0.00000000e+00,   1.18941527e-03,   1.39679862e-05])
n = scipy.stats.norm(n*p, np.sqrt(n*p*(1-p)))
print n.pdf(k)
# array([  0.00000000e+00,   1.18941608e-03,   1.39680605e-05])

print b.pmf(k) - n.pdf(k)
# array([  0.00000000e+00,  -8.10313274e-10,  -7.43085142e-11])

125

answered Sep 20 '22 15:09

Bi Rico

Related questions
                            
                                SQLAlchemy: how to filter on PgArray column types?
                            
                                Is mixing Clojure with Python a good idea?
                            
                                Python IMAP Search from or to designated email address
                            
                                How to show continuous real time updates like facebook ticker, meetup.com home page does?
                            
                                Can I add an operation to a list in Python?
                            
                                Python Packaging
                            
                                How to add extra object to tasty pie return json in python django
                            
                                double for loops in python
                            
                                How to create sqlalchemy to json
                            
                                How can I get PyCharm to recognize the static files?
                            
                                Django: How to disable ordering in model
                            
                                Python lambda function to calculate factorial of a number
                            
                                How to save application settings in a config file?
                            
                                Python: Can dumpdata cannot loaddata back. UnicodeDecodeError
                            
                                remove elements in one list present in another list [duplicate]
                            
                                How to strip newlines from each line during a file read? [duplicate]
                            
                                Django compatible web hosting services [closed]
                            
                                How does os.path.join() work?
                            
                                Python Wave byte data
                            
                                How to force os.system() to use bash instead of shell

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With