Data structure for choosing random elements?

Tags:

Does anyone know of a data structure that supports the two operations efficiently?

Insert a value into the data structure.
Dequeue and return an entry from the data structure with uniformly random probability.

This is sort of like the canonical "bag of marbles" that always comes up in introductory probability classes. You can put arbitrary marbles into the bag, and can then efficiently remove the marbles at random.

The best solution I have is as follows - use a self-balancing binary search tree (AVL, AA, red/black, etc.) to store the elements. This gives O(lg n) insertion time. To remove a random element, pick a random index i, then locate and remove the ith element from the tree. If you've augmented the structure appropriately, this can be made to run in O(lg n) time as well.

This runtime certainly isn't bad, but I'm curious if it's possible to do better - perhaps O(1) for insertion and O(lg n) for dequeues? Or perhaps something that runs in expected O(1) insert and delete using hash functions? Or perhaps a stronger amortized bound?

Does anyone have any ideas on how to make this asymptotically faster?

588

asked Dec 30 '10 18:12

templatetypedef

1 Answers

Yes. Use a vector.

To insert, simply place at the end, and increment the size. To remove, pick an element at random, swap its contents with the end value, then pop off the end value (i.e., return the end value and decrement the vector's size).

Both operations are amortised O(1).

153

answered Oct 02 '22 02:10

Chris Jester-Young

Related questions
                            
                                Efficiently eliminate common sub-expressions in .NET Expression Tree
                            
                                any_of Versus find_if
                            
                                Calculating number of messages per second in a rolling window?
                            
                                How would you pick a uniform random element in linked list with unknown length?
                            
                                Implementing undo and redo for an array
                            
                                Efficient Out-Of-Core Sorting
                            
                                Given two arrays a and b .Find all pairs of elements (a1,b1) such that a1 belongs to Array A and b1 belongs to Array B whose sum a1+b1 = k
                            
                                Precise sum of floating point numbers
                            
                                Can a lambda be used to change a List's values in-place ( without creating a new list)?
                            
                                Why does backtracking make an algorithm non-deterministic?
                            
                                Nth largest element in a binary search tree
                            
                                Best way to find differences between two large arrays in PHP
                            
                                Python - Compress Ascii String
                            
                                Getting the number of trailing 1 bits
                            
                                Algorithm to locate local maxima
                            
                                How to perform binary search on NSArray?
                            
                                How to find if a graph is bipartite?
                            
                                Best articles to start learning about edge detection/image recognition
                            
                                Determining whether or not a directed or undirected graph is a tree
                            
                                Artificial Neural Network Question

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Data structure for choosing random elements?

Tags:

language-agnostic

algorithm

random

data-structures

templatetypedef

People also ask

1 Answers

Chris Jester-Young

Recent Activity

Donate For Us