Randomly choosing from a list with weighted probabilities

Tags:

I have an array of N elements (representing the N letters of a given alphabet), and each cell of the array holds an integer value, that integer value meaning the number of occurrences in a given text of that letter. Now I want to randomly choose a letter from all of the letters in the alphabet, based on his number of appearances with the given constraints:

If the letter has a positive (nonzero) value, then it can be always chosen by the algorithm (with a bigger or smaller probability, of course).
If a letter A has a higher value than a letter B, then it has to be more likely to be chosen by the algorithm.

Now, taking that into account, I've come up with a simple algorithm that might do the job, but I was just wondering if there was a better thing to do. This seems to be quite fundamental, and I think there might be more clever things to do in order to accomplish this more efficiently. This is the algorithm i thought:

Add up all the frequencies in the array. Store it in SUM
Choosing up a random value from 0 to SUM. Store it in RAN
[While] RAN > 0, Starting from the first, visit each cell in the array (in order), and subtract the value of that cell from RAN
The last visited cell is the chosen one

So, is there a better thing to do than this? Am I missing something?

I'm aware most modern computers can compute this so fast I won't even notice if my algorithm is inefficient, so this is more of a theoretical question rather than a practical one.

I prefer an explained algorithm rather than just code for an answer, but If you're more comfortable providing your answer in code, I have no problem with that.

493

asked Jun 22 '13 12:06

Setzer22

1 Answers

The idea:

Iterate through all the elements and set the value of each element as the cumulative frequency thus far.
Generate a random number between 1 and the sum of all frequencies
Do a binary search on the values for this number (finding the first value greater than or equal to the number).

Example:

Element    A B C D
Frequency  1 4 3 2
Cumulative 1 5 8 10

Generate a random number in the range 1-10 (1+4+3+2 = 10, the same as the last value in the cumulative list), do a binary search, which will return values as follows:

Number   Element returned
1        A
2        B
3        B
4        B
5        B
6        C
7        C
8        C
9        D
10       D

127

answered Sep 28 '22 08:09

Bernhard Barker

Related questions
                            
                                DFS and BFS Time and Space complexities of 'Number of islands' on Leetcode
                            
                                What is the design & architecture behind facebook's status update mechanism?
                            
                                Fastest way to check if an array is sorted
                            
                                How is it possible to do binary search on a singly-linked list in O(n) time?
                            
                                Lazily Tying the Knot for 1 Dimensional Dynamic Programming
                            
                                Fuzzy string matching in Python
                            
                                Explain why time complexity for summing digits in a number of length N is O(logN)
                            
                                Levenshtein distance based methods Vs Soundex
                            
                                What are the options for generating user friendly alpha numeric IDs (like business id, SKU)
                            
                                The Most Efficient Algorithm to Find First Prefix-Match From a Sorted String Array?
                            
                                Is there a nearest-key map datastructure?
                            
                                Proof that Fowler's money allocation algorithm is correct
                            
                                Map from integer ranges to arbitrary single integers
                            
                                Find the minimum number of elements required so that their sum equals or exceeds S
                            
                                Interview Question... Trying to work it out, but couldn't get an efficient solution
                            
                                Algorithm for finding the ratio of two floating-point numbers?
                            
                                Convert a number from Base B1 to Base B2 without using any intermediate base
                            
                                Count the number of "holes" in a bitmap
                            
                                Generating all possible permutations of a list recursively
                            
                                Reshape MATLAB vector in Row-wise manner

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Randomly choosing from a list with weighted probabilities

Tags:

language-agnostic

algorithm

letter

random

alphabet

Setzer22

People also ask

1 Answers

Bernhard Barker

Recent Activity

Donate For Us