Random sampling from a dataset, while preserving original probability distribution

Tags:

I have a set of >2000 numbers, gathered from measurement. I want to sample from this data set, ~10 times in each test, while preserving probability distribution overall, and in each test (to extent approximately possible). For example, in each test, I want some small value, some middle class value, some big value, with the mean and variance approximately close to the original distribution. Combining all the tests, I also want the total mean and variance of all the samples, approximately close to the original distribution.

As my dataset is a long-tail probability distribution, the amount of data at each quantile are not the same:

Probability density

Fig 1. Density plot of ~2k elements of data.

I am using Java, and right now I am using a uniform distribution, and use a random int from the dataset, and return the data element at that position:

public int getRandomData() {
    int data[] ={1231,414,222,4211,,41,203,123,432,...};
    length=data.length;
    Random r=new Random();
    int randomInt = r.nextInt(length);
    return data[randomInt];
}

I don't know if it works as I want, because I use data in order it is measured, which has great amount of serial correlation.

587

asked Sep 12 '15 14:09

Ho1

1 Answers

It works as you want. The order of the data is irrelevant.

answered Oct 04 '22 04:10

Rex D

Related questions
                            
                                How to get TOTAL memory and internal storage size Android?
                            
                                TicTacToe minimax algorithm returns unexpected results in 4x4 games
                            
                                Ordered lists and class thread-safety
                            
                                JDBCTemplate for JavaEE and CDI
                            
                                How to add 'all-permissions' to manifest file with Maven in combination with Webstart Maven plugin?
                            
                                Getting a Loop Redirect with Spring Security + CAS, but should be working
                            
                                Does it matter if i write "INTEGER" or "int" in sql command inside java?[sqlite]
                            
                                Unable to connect with azure blob storage with local hadoop
                            
                                Is it possible to integrate Jbehave with testNG?
                            
                                How to inherit parent's inner class in this code?
                            
                                How to expand and do regroup a List of List using Java 8 Stream?
                            
                                How to integerate Mpesa Api with android
                            
                                File check permission
                            
                                Drools Rule Language from Java API
                            
                                @ManyToMany with cascade = CascadeType.REMOVE removes associations AND entities
                            
                                Matrix Multiplication with threads Java
                            
                                How to hide TieredCompilation warning?
                            
                                how to convert site minder xml configuration using Spring4 Java config
                            
                                Using scala.Future with Java 8 lambdas
                            
                                Realm.io [Java] notifications - How to listen for changes only in certain Table?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Random sampling from a dataset, while preserving original probability distribution

Tags:

java

sampling

probability-density

Ho1

People also ask

1 Answers

Rex D

Recent Activity

Donate For Us