The problem I have is as follows I have a 1-D list of integers (or np.array) with 3 values <pre class="prettyprint"><code>l = [0,1,2] </code></pre> I have a 2-D list of probabilities (for simplicity, we'll use two rows) <pre class="prettyprint"><code>P = [[0.8, 0.1, 0.1], [0.3, 0.3, 0.4]] </code></pre> What I want is <code>numpy.random.choice(a=l, p=P)</code>, where each row in P (probability distribution) is applied to l. So, I want a random sample to be drawn from [0,1,2] with prob. dist. [0.8, 0.1, 0.1] first, then with prob. dist. [0.3, 0.3, 0.4] next, to give me two outputs. ===== Update ====== I can use for loops or list comprehension, but I am looking for a fast/vectorized solution.

Here's one way. Here's the array of probabilities: <pre class="prettyprint"><code>In [161]: p Out[161]: array([[ 0.8 , 0.1 , 0.1 ], [ 0.3 , 0.3 , 0.4 ], [ 0.25, 0.5 , 0.25]]) </code></pre> <code>c</code> holds the cumulative distributions: <pre class="prettyprint"><code>In [162]: c = p.cumsum(axis=1) </code></pre> Generate a set of uniformly distributed samples... <pre class="prettyprint"><code>In [163]: u = np.random.rand(len(c), 1) </code></pre> ...and then see where they "fit" in <code>c</code>: <pre class="prettyprint"><code>In [164]: choices = (u < c).argmax(axis=1) In [165]: choices Out[165]: array([1, 2, 2]) </code></pre>

How to apply numpy random.choice to a matrix of probability values (Vectorized solution)

Tags:

python

numpy

The problem I have is as follows

I have a 1-D list of integers (or np.array) with 3 values

l = [0,1,2]

I have a 2-D list of probabilities (for simplicity, we'll use two rows)

P = 
[[0.8, 0.1, 0.1],
 [0.3, 0.3, 0.4]]

What I want is numpy.random.choice(a=l, p=P), where each row in P (probability distribution) is applied to l. So, I want a random sample to be drawn from [0,1,2] with prob. dist. [0.8, 0.1, 0.1] first, then with prob. dist. [0.3, 0.3, 0.4] next, to give me two outputs.

===== Update ======

I can use for loops or list comprehension, but I am looking for a fast/vectorized solution.

615

asked Nov 07 '16 20:11

max_max_mir

1 Answers

Here's one way.

Here's the array of probabilities:

In [161]: p
Out[161]: 
array([[ 0.8 ,  0.1 ,  0.1 ],
       [ 0.3 ,  0.3 ,  0.4 ],
       [ 0.25,  0.5 ,  0.25]])

c holds the cumulative distributions:

In [162]: c = p.cumsum(axis=1)

Generate a set of uniformly distributed samples...

In [163]: u = np.random.rand(len(c), 1)

...and then see where they "fit" in c:

In [164]: choices = (u < c).argmax(axis=1)

In [165]: choices
Out[165]: array([1, 2, 2])

125

answered Oct 17 '22 22:10

Warren Weckesser

Related questions
                            
                                Python 2 __missing__ method
                            
                                How convert output tensor to one-hot tensor?
                            
                                A DRY approach to Python try-except blocks?
                            
                                Python open html file, take screenshot, crop and save as image
                            
                                Reading in file block by block using specified delimiter in python
                            
                                python map function with min argument and two lists
                            
                                Django Error: Your URL pattern is invalid. Ensure that urlpatterns is a list of url() instances
                            
                                Function annotation for subclasses of abstract class
                            
                                Convert complex NumPy array into (n, 2)-array of real and imaginary parts
                            
                                pd.Timedelta conversion on a dataframe column
                            
                                Django form. How hide colon from initial_text?
                            
                                lxml xsi:schemaLocation namespace URI validation issue
                            
                                Install Matlab engine in Anaconda Python (Linux)
                            
                                how to trigger function in another object when variable changed. Python
                            
                                "Stratify" parameter from sklearn's train_test_split not working correctly?
                            
                                How to get a list of matchable characters from a regex class
                            
                                Pandas Plot with Index causes 'KeyError [] not in index'
                            
                                Regex to remove periods in acronyms?
                            
                                Python shallow copy and deep copy in using append method
                            
                                Pandas sort row values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With