Which algorithm is using by the <code>rnorm</code> function by default to generate standard-normally distributed random numbers?

The other answer is sufficient, but left me with some more questions; in particular, I didn't see anywhere in the documentation* what on earth the <code>"Inversion"</code> algorithm is, so I dived into the source code, which also gives academic references to the papers originating the other possible algorithms, to figure out what exactly is being done. <pre class="prettyprint"><code> case INVERSION: #define BIG 134217728 /* 2^27 */ /* unif_rand() alone is not of high enough precision */ u1 = unif_rand(); u1 = (int)(BIG*u1) + unif_rand(); return qnorm5(u1/BIG, 0.0, 1.0, 1, 0); </code></pre> So it seems at base the default <code>"Inversion"</code> algorithm generates a high precision floating point number (looks like 53 bits, or the mantissa size for 64-bit floating numbers), then sends it to the <code>qnorm5</code> function which is a CDF function for the normal distribution. As to how the <code>qnorm5</code> function works (given there is no closed form for the Normal CDF nor inverse CDF), I haven't had much luck cracking what seems to be the source code here, but they do give further academic references, namely Beasley, J. D. and S. G. Springer (1977) and Wichura, M.J. (1988); the former being typically used for small quantiles of the CDF and the latter for large (<code>z>7</code> or so). It may also be interesting to note that (as of this writing) this algorithm appears to be shared by the Julia language, which also shares the <code>qnorm5</code> code used by <code>R</code>. *To be fair, in retrospect, Wichura is mentioned in <code>?qnorm</code> which is referenced above. Still it's worthwhile to spell things out in this thread, I think.

Which algorithm used by the rnorm function

2 Answers

See ?RNGkind. The default is an inversion algorithm:

normal.kind can be "Kinderman-Ramage", "Buggy Kinderman-Ramage" (not for set.seed), "Ahrens-Dieter", "Box-Muller", "Inversion" (the default), or "user-supplied". (For inversion, see the reference in qnorm.) The Kinderman-Ramage generator used in versions prior to 1.7.1 (now called "Buggy") had several approximation errors and should only be used for reproduction of old results. The "Box-Muller" generator is stateful as pairs of normals are generated and returned sequentially. The state is reset whenever it is selected (even if it is the current normal generator) and when kind is changed.

You can change the algorithm by

RNGkind(normal.kind = "Box-Muller")

You can find what is currently set by looking at RNGkind()[2].

142

answered Nov 07 '22 23:11

Jouni Helske

The other answer is sufficient, but left me with some more questions; in particular, I didn't see anywhere in the documentation* what on earth the "Inversion" algorithm is, so I dived into the source code, which also gives academic references to the papers originating the other possible algorithms, to figure out what exactly is being done.

    case INVERSION:
#define BIG 134217728 /* 2^27 */
    /* unif_rand() alone is not of high enough precision */
    u1 = unif_rand();
    u1 = (int)(BIG*u1) + unif_rand();
    return qnorm5(u1/BIG, 0.0, 1.0, 1, 0);

So it seems at base the default "Inversion" algorithm generates a high precision floating point number (looks like 53 bits, or the mantissa size for 64-bit floating numbers), then sends it to the qnorm5 function which is a CDF function for the normal distribution.

As to how the qnorm5 function works (given there is no closed form for the Normal CDF nor inverse CDF), I haven't had much luck cracking what seems to be the source code here, but they do give further academic references, namely Beasley, J. D. and S. G. Springer (1977) and Wichura, M.J. (1988); the former being typically used for small quantiles of the CDF and the latter for large (z>7 or so).

It may also be interesting to note that (as of this writing) this algorithm appears to be shared by the Julia language, which also shares the qnorm5 code used by R.

*To be fair, in retrospect, Wichura is mentioned in ?qnorm which is referenced above. Still it's worthwhile to spell things out in this thread, I think.

answered Nov 08 '22 01:11

MichaelChirico

Related questions
                            
                                Replacing NAs in a column with the values of other column
                            
                                Create a column based on the name of the element list that contain the data frame in R
                            
                                Latex Formulas or symbols in table cells using knitr and kableExtra in R-Markdown,
                            
                                Extract time (HMS) from lubridate date time object?
                            
                                How can I auto-number math equations in RMarkdown?
                            
                                Use recode to mutate across multiple columns using named list of named vectors
                            
                                Highlight (shade) plot background in specific time range
                            
                                Calculating all distances between one point and a group of points efficiently in R
                            
                                How can I suppress the line numbers output using R CMD BATCH?
                            
                                fast sampling in R
                            
                                Logarithmic y-axis Tick Marks in R plot() or ggplot2()
                            
                                Re-arrange multiple columns in a data set into one column using R
                            
                                Why does evaluating an expression in system.time() make variables available in global environment?
                            
                                R: How do I use coord_cartesian on facet_grid with free-ranging axis
                            
                                How to create a matrix from vector returned by rep() function?
                            
                                python's scipy.stats.ranksums vs. R's wilcox.test
                            
                                Find the index of the column in data frame that contains the string as value
                            
                                "scale" or "ruler" type plot in r
                            
                                Using an expression in plot text - Printing the value of a variable rather than its name
                            
                                Update a data frame in shiny server.R without restarting the App

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Which algorithm used by the rnorm function

Tags:

random

r

Klaus

People also ask

2 Answers

Jouni Helske

MichaelChirico

Recent Activity

Donate For Us