Using Apache Commons Math to determine confidence intervals

Q: How do you calculate confidence intervals?

Compute the standard error as σ/√n = 0.5/√100 = 0.05 . Multiply this value by the z-score to obtain the margin of error: 0.05 × 1.959 = 0.098 . Add and subtract the margin of error from the mean value to obtain the confidence interval. In our case, the confidence interval is between 2.902 and 3.098.

Q: How do you find the confidence interval in machine learning?

Step 1: Identify the sample problem. Choose the statistic (like sample mean, etc) that you will use to estimate population parameter. Step 2: Select a confidence level. (Usually, it is 90%, 95% or 99%) Step 3: Find the margin of error.

Tags:

java

statistics

apache-commons-math

I have a set of benchmark data for which I compute summary statistics using Apache Math Commons. Now I want to use the package to compute confidence intervals for the arithmetic means of e.g. running time measurements.

Is this possible at all? I am convinced that the package supports this, however I am at a loss about where to start.

This is the solution I ended up using with the help of Brent Worden's suggestion:

private double getConfidenceIntervalWidth(StatisticalSummary statistics, double significance) {
    TDistribution tDist = new TDistribution(statistics.getN() - 1);
    double a = tDist.inverseCumulativeProbability(1.0 - significance / 2);
    return a * statistics.getStandardDeviation() / Math.sqrt(statistics.getN());
}

406

asked Apr 06 '11 10:04

Jannik Jochem

1 Answers

Apache Commons Math does not have direct support for constructing confidence intervals. However, it does have everything needed to compute them.

First, use SummaryStatistics, or some other StatisticalSummary implementation to summarize your data into sample statistics.

Next, use TDistribution to compute critical values for your desired confidence level. The degrees of freedom can be inferred from the summary statistics' n property.

Last, use the mean, variance, and n property values from the summary statistics and the t critical value from the distribution to compute your lower and upper confidence limits.

171

answered Oct 04 '22 13:10

Brent Worden

Related questions
                            
                                Get the Raw Request String from HttpServletRequest
                            
                                How to prevent tomcat session hijacking?
                            
                                Invalid access of stack red zone from Java VM
                            
                                What do -XX:-PrintGC and XX:-PrintGCDetails flags do?
                            
                                How do I get maven managed dependencies copied into war\web-inf\lib so I can run my GWT 2.0 app in debug mode within Eclipse?
                            
                                How to unload an already loaded class in Java? [duplicate]
                            
                                Are there any examples/tutorials of using Spring 3.0 with Cassandra as a backend? [closed]
                            
                                Meta Search Engine Architecture
                            
                                how do I create my own training corpus for stanford tagger?
                            
                                How to estimate zip file size in java before creating it
                            
                                Retrofitting void methods to return its argument to facilitate fluency: breaking change?
                            
                                Decompress GZip string in Java
                            
                                Do I need to flush events when shutting down using logback?
                            
                                What's wrong with this example of Java property inheritance?
                            
                                Maximum number of digits after the decimal point using BigDecimal
                            
                                Producing LaTeX output in Java [closed]
                            
                                How can I build multiple projects in Ant with one build file?
                            
                                How to load multiple configuration files using apache common configuration(java)
                            
                                Is Memcache (Java) for Google App Engine a global cache?
                            
                                Measuring the number of queued requests for tomcat

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With