I'm trying to calculate p-values of a f-statistic with R. The formula R uses in the lm() function is equal to (e.g. assume x=100, df1=2, df2=40): <pre class="prettyprint"><code>pf(100, 2, 40, lower.tail=F) [1] 2.735111e-16 </code></pre> which should be equal to <pre class="prettyprint"><code>1-pf(100, 2, 40) [1] 2.220446e-16 </code></pre> It is not the same! There s no BIG difference, but where does it come from? If I calculate (x=5, df1=2, df2=40): <pre class="prettyprint"><code>pf(5, 2, 40, lower.tail=F) [1] 0.01152922 1-pf(5, 2, 40) [1] 0.01152922 </code></pre> it is exactly the same. Question is...what is happening here? Have I missed something?

<pre class="prettyprint"><code>> all.equal(pf(100, 2, 40, lower.tail=F),1-pf(100, 2, 40)) [1] TRUE </code></pre>

As the comments note, this is a floating point precision issue. In fact both of the examples you show are not precisely equal as evaluated: <pre class="prettyprint"><code>> pf(5, 2, 40, lower.tail=F) - (1-pf(5, 2, 40)) [1] 6.245005e-17 > pf(100, 2, 40, lower.tail=F) - (1-pf(500, 2, 40)) [1] 2.735111e-16 </code></pre> It's just that this difference is only apparent in your output for the much smaller number.

Calculation p-values of a f-statistic with R

Tags:

r

distribution

p-value

I'm trying to calculate p-values of a f-statistic with R. The formula R uses in the lm() function is equal to (e.g. assume x=100, df1=2, df2=40):

pf(100, 2, 40, lower.tail=F)
[1] 2.735111e-16

which should be equal to

1-pf(100, 2, 40)
[1] 2.220446e-16

It is not the same! There s no BIG difference, but where does it come from? If I calculate (x=5, df1=2, df2=40):

pf(5, 2, 40, lower.tail=F)
[1] 0.01152922

1-pf(5, 2, 40)
[1] 0.01152922

it is exactly the same. Question is...what is happening here? Have I missed something?

389

asked Jan 29 '14 14:01

cjena

2 Answers

> all.equal(pf(100, 2, 40, lower.tail=F),1-pf(100, 2, 40))
[1] TRUE

170

answered Oct 08 '22 08:10

George Dontas

As the comments note, this is a floating point precision issue. In fact both of the examples you show are not precisely equal as evaluated:

> pf(5, 2, 40, lower.tail=F) - (1-pf(5, 2, 40))
[1] 6.245005e-17

> pf(100, 2, 40, lower.tail=F) - (1-pf(500, 2, 40))
[1] 2.735111e-16

It's just that this difference is only apparent in your output for the much smaller number.

answered Oct 08 '22 08:10

Aaron Schumacher

Related questions
                            
                                mtext() to add horizontal y labels
                            
                                Redirect/intercept function calls within a package function
                            
                                Subsetting a dataframe for a specified month and year
                            
                                What does "hidden list" in the output of `str()` mean?
                            
                                Creating a continuous heat map in R
                            
                                positioning horizontal boxplots in ggplot2
                            
                                Difference between R.loess and org.apache.commons.math LoessInterpolator
                            
                                Getting .Rprofile to Load at Startup
                            
                                Using in line r code as part of a R markdown header
                            
                                How to minimize a function over one input parameter in R
                            
                                Removing non-English text from Corpus in R using tm()
                            
                                Is there an R equivalent of other languages triple quotes?
                            
                                Finding ngrams in R and comparing ngrams across corpora
                            
                                R: Cross validation on a dataset with factors
                            
                                Syntax highlighting rules and definitions
                            
                                Dotplot with error bars, two series, light jitter
                            
                                How to rethrow an error in R?
                            
                                Sending email from shiny
                            
                                Filling bars in barplot with textiles rather than color in R
                            
                                Is there a way to hide figure captions when using knitr and pandoc to create docx files?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With