Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Confidence Interval/Margin of Recall or Precision

If I have a confusion matrix that is based upon a sample set, how do I determine the statistical power (confidence margin/interval) of my recall/precision/etc metrics? I know how to do statistical power analysis for the probability of conversion itself but how do I do it for the recall/precision?

like image 292
SriK Avatar asked Dec 09 '16 21:12

SriK


People also ask

Do confidence intervals measure accuracy or precision?

We define accuracy in terms of whether or not the confidence interval contains the true population parameter. And precision refers to the width of a confidence interval.

Does confidence interval tell you precision?

The width of a confidence interval indicates the precision of our knowledge about the parameter. Narrow confidence intervals show precise knowledge, while wide confidence errors show imprecise knowledge. There is no necessary connection between the precision of an estimate and the size of a confidence interval.

What is the difference between precision interval and confidence interval?

Precision is usually referred to as the reciprocal of variance. There is another definition which treats it as the standard error of an estimate. Confidence intervals are different. They provide a statistical interval that in repeated sampling the true parameter will fall in the interval !-

How do you calculate confidence intervals for recalls?

(1-alpha) * 100% = 95% confidence interval. If you would like a 99% confidence interval use alpha=0.01 ; For a 90% confidence interval use alpha=0.1 and so on.


1 Answers

Found the answer to this. It is a slightly modified equation of standard confidence interval calculation of p +/- Z_score_at_alpha * std_error. The only difference is that p (basically your recall probability) is computed with an offset -> adjusted_recall=(TP+2)/(TP+FN+4).

The general idea is that the standard confidence interval equation doesnt work when p is at 0 or 1. This equation provides an adjustment that allows it to work. its just a fudge factor.

Also, the std error is now sqrt(adjusted_recall(1-adjusted_recall)/(N+4)). This is known as the wilson score interval - https://en.wikipedia.org/wiki/Binomial_proportion_confidence_interval#Wilson_score_interval

http://www.stat.ucdavis.edu/~kwwong/STA13-SS1-12/Statistics_13_files/lecture05.pdf

https://stats.stackexchange.com/questions/109429/wilsons-adjustment-for-sample-proportion

like image 55
SriK Avatar answered Sep 23 '22 05:09

SriK