How can I get a weighted random pick from Python's Counter class?

Tags:

I have a program where I'm keeping track of the success of various things using collections.Counter — each success of a thing increments the corresponding counter:

import collections
scoreboard = collections.Counter()

if test(thing):
    scoreboard[thing]+ = 1

Then, for future tests, I want to skew towards things which have generated the most success. Counter.elements() seemed ideal for this, since it returns the elements (in arbitrary order) repeated a number of times equal to the count. So I figured I could just do:

import random
nextthing=random.choice(scoreboard.elements())

But no, that raises TypeError: object of type 'itertools.chain' has no len(). Okay, so random.choice can't work with iterators. But, in this case, the length is known (or knowable) — it's sum(scoreboard.values()).

I know the basic algorithm for iterating through a list of unknown length and fairly picking an element at random, but I suspect that there's something more elegant. What should I be doing here?

415

asked Jan 31 '12 18:01

mattdm

2 Answers

You can do this rather easily by using itertools.islice to get the Nth item of an iterable:

>>> import random
>>> import itertools
>>> import collections
>>> c = collections.Counter({'a': 2, 'b': 1})
>>> i = random.randrange(sum(c.values()))
>>> next(itertools.islice(c.elements(), i, None))
'a'

answered Oct 09 '22 17:10

Felix Loether

Given a dictionary of choices with corresponding relative probabilities (can be the count in your case), you can use the new random.choices added in Python 3.6 like so:

import random

my_dict = {
    "choice a" : 1, # will in this case be chosen 1/3 of the time
    "choice b" : 2, # will in this case be chosen 2/3 of the time
}

choice = random.choices(*zip(*my_dict.items()))[0]

For your code that uses Counter, you can do the same thing, because Counter also has the items() getter.

import collections
import random

my_dict = collections.Counter(a=1, b=2, c=3)
choice = random.choices(*zip(*my_dict.items()))[0]

Explanation: my_dict.items() is [('a', 1), ('b', 2), ('c', 3)].
So zip(*my_dict.items()) is [('a', 'b', 'c'), (1, 2, 3)].
And random.choices(('a', 'b', 'c'), (1, 2, 3)) is exactly what you want.

answered Oct 09 '22 17:10

pbsds

Related questions
                            
                                Accepting multiple parameters in flask-restful add_resource()
                            
                                Send http request through specific network interface
                            
                                export conda environment without prefix variable which shows local path to executable
                            
                                How to apply linear regression to every pixel in a large multi-dimensional array containing NaNs?
                            
                                pip install AttributeError: _DistInfoDistribution__dep_map
                            
                                control initialize order when Python dataclass inheriting a class
                            
                                How to apply data augmentation in TensorFlow 2.0 after tfds.load()
                            
                                Django testing: Got an error creating the test database: database "database_name" already exists
                            
                                Can not get pytorch working with tensorboard
                            
                                How to fix locking failed in pipenv?
                            
                                Has anyone used SciPy with IronPython?
                            
                                Where to keep Python unit tests? [duplicate]
                            
                                Sandboxing in Linux
                            
                                Python: multiple calls to __init__() on the same instance
                            
                                Reason for low Pylint ratings of Python standard library code
                            
                                Change working directory in shell with a python script
                            
                                Prototypal programming in Python
                            
                                Detect last iteration over dictionary.iteritems() in python
                            
                                How is it possible to use raw_input() in a Python Git hook?
                            
                                How can I get generators/iterators to evaluate as False when exhausted?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I get a weighted random pick from Python's Counter class?

Tags:

python

iterator

random

counter

weighted

mattdm

People also ask

2 Answers

Felix Loether

pbsds

Recent Activity

Donate For Us