How to calculate cumulative normal distribution?

People also ask

What is the cumulative distribution of a normal distribution?

Calculates the normal distribution of the mean and standard deviation of a set of values. Returns either the cumulative distribution or the probability density. This function is widely applied in statistics, including in the area of hypothesis testing.

Here's an example:

>>> from scipy.stats import norm
>>> norm.cdf(1.96)
0.9750021048517795
>>> norm.cdf(-1.96)
0.024997895148220435

In other words, approximately 95% of the standard normal interval lies within two standard deviations, centered on a standard mean of zero.

If you need the inverse CDF:

>>> norm.ppf(norm.cdf(1.96))
array(1.9599999999999991)

It may be too late to answer the question but since Google still leads people here, I decide to write my solution here.

That is, since Python 2.7, the math library has integrated the error function math.erf(x)

The erf() function can be used to compute traditional statistical functions such as the cumulative standard normal distribution:

from math import *
def phi(x):
    #'Cumulative distribution function for the standard normal distribution'
    return (1.0 + erf(x / sqrt(2.0))) / 2.0

Ref:

https://docs.python.org/2/library/math.html

https://docs.python.org/3/library/math.html

How are the Error Function and Standard Normal distribution function related?

Starting Python 3.8, the standard library provides the NormalDist object as part of the statistics module.

It can be used to get the cumulative distribution function (cdf - probability that a random sample X will be less than or equal to x) for a given mean (mu) and standard deviation (sigma):

from statistics import NormalDist

NormalDist(mu=0, sigma=1).cdf(1.96)
# 0.9750021048517796

Which can be simplified for the standard normal distribution (mu = 0 and sigma = 1):

NormalDist().cdf(1.96)
# 0.9750021048517796

NormalDist().cdf(-1.96)
# 0.024997895148220428

Adapted from here http://mail.python.org/pipermail/python-list/2000-June/039873.html

from math import *
def erfcc(x):
    """Complementary error function."""
    z = abs(x)
    t = 1. / (1. + 0.5*z)
    r = t * exp(-z*z-1.26551223+t*(1.00002368+t*(.37409196+
        t*(.09678418+t*(-.18628806+t*(.27886807+
        t*(-1.13520398+t*(1.48851587+t*(-.82215223+
        t*.17087277)))))))))
    if (x >= 0.):
        return r
    else:
        return 2. - r

def ncdf(x):
    return 1. - 0.5*erfcc(x/(2**0.5))

To build upon Unknown's example, the Python equivalent of the function normdist() implemented in a lot of libraries would be:

def normcdf(x, mu, sigma):
    t = x-mu;
    y = 0.5*erfcc(-t/(sigma*sqrt(2.0)));
    if y>1.0:
        y = 1.0;
    return y

def normpdf(x, mu, sigma):
    u = (x-mu)/abs(sigma)
    y = (1/(sqrt(2*pi)*abs(sigma)))*exp(-u*u/2)
    return y

def normdist(x, mu, sigma, f):
    if f:
        y = normcdf(x,mu,sigma)
    else:
        y = normpdf(x,mu,sigma)
    return y

Alex's answer shows you a solution for standard normal distribution (mean = 0, standard deviation = 1). If you have normal distribution with mean and std (which is sqr(var)) and you want to calculate:

from scipy.stats import norm

# cdf(x < val)
print norm.cdf(val, m, s)

# cdf(x > val)
print 1 - norm.cdf(val, m, s)

# cdf(v1 < x < v2)
print norm.cdf(v2, m, s) - norm.cdf(v1, m, s)

Read more about cdf here and scipy implementation of normal distribution with many formulas here.

Related questions
                            
                                Python coding standards/best practices [closed]
                            
                                Inline labels in Matplotlib
                            
                                Convert RGBA PNG to RGB with PIL
                            
                                What is "thread local storage" in Python, and why do I need it?
                            
                                How do I install Python packages in Google's Colab?
                            
                                Python/Django: log to console under runserver, log to file under Apache
                            
                                Which is more preferable to use: lambda functions or nested functions ('def')?
                            
                                How to suppress or capture the output of subprocess.run()?
                            
                                Python Selenium accessing HTML source
                            
                                How to override the copy/deepcopy operations for a Python object?
                            
                                Case Insensitive Flask-SQLAlchemy Query
                            
                                What's the meaning of "(1,) == 1," in Python?
                            
                                Numpy `logical_or` for more than two arguments
                            
                                sqlalchemy: how to join several tables by one query?
                            
                                urllib2.HTTPError: HTTP Error 403: Forbidden
                            
                                Python [Errno 98] Address already in use
                            
                                Get folder name of the file in Python
                            
                                In Python list comprehension is it possible to access the item index?
                            
                                deciding among subprocess, multiprocessing, and thread in Python?
                            
                                Loading and parsing a JSON file with multiple JSON objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate cumulative normal distribution?

Tags:

python

numpy

statistics

scipy

People also ask

Recent Activity

Donate For Us