How do I determine a best-fit distribution in java?

Tags:

I have a bunch of sets of data (between 50 to 500 points, each of which can take a positive integral value) and need to determine which distribution best describes them. I have done this manually for several of them, but need to automate this going forward.

Some of the sets are completely modal (every datum has the value of 15), some are strongly modal or bimodal, some are bell-curves (often skewed and with differing degrees of kertosis/pointiness), some are roughly flat, and there are any number of other possible distributions (possion, power-law, etc.). I need a way to determine which distribution best describes the data and (ideally) also provides me with a fitness metric so that I know how confident I am in the analysis.

Existing open-source libraries would be ideal, followed by well documented algorithms that I can implement myself.

742

asked Jun 02 '10 21:06

Eadwacer

1 Answers

Looking for a distribution that fits is unlikely to give you good results in the absence of some a priori knowledge. You may find a distribution that coincidentally is a good fit but is unlikely to be the underlying distribution.

Do you have any metadata available that would hint at what the data means? E.g., "this is open-ended data sampled from a natural population, so it's some sort of normal distribution", vs. "this data is inherently bounded at 0 and discrete, so check for the best-fitting Poisson".

I don't know of any distribution solvers for Java off the top of my head, and I don't know of any that will guess which distribution to use. You could examine some statistical properties (skew/etc.) and make some guesses here--but you're more likely to end up with an accidentally good fit which does not adequately represent the underlying distribution. Real data is noisy and there are just too many degrees of freedom if you don't even know what distribution it is.

141

answered Sep 28 '22 09:09

Alex Feinman

Related questions
                            
                                general question about Java Swing
                            
                                Java JNI - associating resources allocated in C with java objects?
                            
                                Type-safe method reflection in Java
                            
                                Java swing "children" windows
                            
                                Strange problem with timezone, calendar and SimpleDateFormat
                            
                                How to map the JComboBox item to its corresponding ID?
                            
                                Setting Java Classpath to Load a Class File
                            
                                The best way of developing on Symbian
                            
                                Using reserved JPQL keywords with JPA
                            
                                Easymock using date expectation
                            
                                What does "this()" do in a constructor?
                            
                                using Hibernate to loading 20K products, modifying the entity and updating to db
                            
                                How do I set the Eclipse build path and class path from an Ant build file?
                            
                                Java Junit testing problem
                            
                                Is there an automated way to make sure that all parts of code is unit tested?
                            
                                What does look and feel (java swing) mean?
                            
                                how to take user input in Array using java?
                            
                                Understanding Java Wait and Notify methods
                            
                                Best way to daemonize Java application on Linux [closed]
                            
                                What is a reasonable OSGi development workflow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I determine a best-fit distribution in java?

Tags:

java

math

statistics

Eadwacer

People also ask

1 Answers

Alex Feinman

Recent Activity

Donate For Us