Choose m elements randomly from a vector containing n elements

Tags:

I have a vector containing n elements. I need to choose a subset of m elements randomly from the vector without repetition. What is the most efficient way of doing this? I need to do this several thousands of times in my code.

The solution on top of my mind is to use rand() to generate a random number k between 0 and n. Then pick the kth element in the vector and insert it into a std::set. Keep doing this till the set's size becomes equal to m. I am now assured that the set contains m unique elements randomly chosen from the set of n elements.

What are the other possible solutions?

Thanks.

541

asked Feb 18 '12 23:02

Vinay

1 Answers

You want a Fisher-Yates shuffle (stop after M iterations):

template<class BidiIter > BidiIter random_unique(BidiIter begin, BidiIter end, size_t num_random) {     size_t left = std::distance(begin, end);     while (num_random--) {         BidiIter r = begin;         std::advance(r, rand()%left);         std::swap(*begin, *r);         ++begin;         --left;     }     return begin; }

Demo at http://ideone.com/3A3cv. This is significantly faster than std::random_shuffle when you only need a few random numbers out of the set, and should be just about the same speed even if N==M.

189

answered Oct 03 '22 11:10

Michael Burr

Related questions
                            
                                What is the idiomatic way to install a Debian package using Chef?
                            
                                GLSL gl_FragCoord.z Calculation and Setting gl_FragDepth
                            
                                IoC (Ninject) and Factories
                            
                                a Redirect_to from Destroy action always gets DELETE verb whatever :method I declare
                            
                                PDOstatement (MySQL): inserting value 0 into a bit(1) field results in 1 written in table
                            
                                Group models from different app/object into one Admin block
                            
                                Wanted: Matlab example of an anonymous function returning more than 1 output
                            
                                $(document).ready(function(){ Uncaught ReferenceError: $ is not defined
                            
                                HTML print with absolute postitions
                            
                                How is x86 instruction cache synchronized?
                            
                                Difference between @Named and @ManagedBean annotations in JSF2.0 Tomcat7 [duplicate]
                            
                                Automapper with base class and different configuration options for implementations

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With