When I use this random generator: <code>numpy.random.multinomial</code>, I keep getting: <pre class="prettyprint"><code>ValueError: sum(pvals[:-1]) > 1.0 </code></pre> I am always passing the output of this softmax function: <pre class="prettyprint"><code>def softmax(w, t = 1.0): e = numpy.exp(numpy.array(w) / t) dist = e / np.sum(e) return dist </code></pre> except now that I am getting this error, I also added this for the parameter (<code>pvals</code>): <pre class="prettyprint"><code>while numpy.sum(pvals) > 1: pvals /= (1+1e-5) </code></pre> but that didn't solve it. What is the right way to make sure I avoid this error? EDIT: here is function that includes this code <pre class="prettyprint"><code>def get_MDN_prediction(vec): coeffs = vec[::3] means = vec[1::3] stds = np.log(1+np.exp(vec[2::3])) stds = np.maximum(stds, min_std) coe = softmax(coeffs) while np.sum(coe) > 1-1e-9: coe /= (1+1e-5) coeff = unhot(np.random.multinomial(1, coe)) return np.random.normal(means[coeff], stds[coeff]) </code></pre>

I also encountered this problem during my language modelling work. The root of this problem rises from numpy's implicit data casting: the output of my sorfmax() is in <code>float32</code> type, however, <code>numpy.random.multinomial()</code> will cast the <code>pval</code> into <code>float64</code> type IMPLICITLY. This data type casting would cause <code>pval.sum()</code> exceed 1.0 sometimes due to numerical rounding. This issue is recognized and posted here

I know the question is old but since I faced the same problem just now, it seems to me it's still valid. Here's the solution I've found for it: <pre class="prettyprint"> a = np.asarray(a).astype('float64') a = a / np.sum(a) b = np.random.multinomial(1, a, 1) </pre> I've made the important part bold. If you omit that part the problem you've mentioned will happen from time to time. But if you change the type of array into float64, it will never happen.

The <code>softmax</code> implementation I was using is not stable enough for the values I was using it with. As a result, sometimes the output has a sum greater than <code>1</code> (e.g. <code>1.0000024...</code>). This case should be handled by the while loop. But sometimes the output contains NaNs, in which case the loop is never triggered, and the error persists. Also, <code>numpy.random.multinomial</code> doesn't raise an error if it sees a NaN. Here is what I'm using right now, instead: <pre class="prettyprint"><code>def softmax(vec): vec -= min(A(vec)) if max(vec) > 700: a = np.argsort(vec) aa = np.argsort(a) vec = vec[a] i = 0 while max(vec) > 700: i += 1 vec -= vec[i] vec = vec[aa] e = np.exp(vec) return e/np.sum(e) def sample_multinomial(w): """ Sample multinomial distribution with parameters given by softmax of w Returns an int """ p = softmax(w) x = np.random.uniform(0,1) for i,v in enumerate(np.cumsum(p)): if x < v: return i return len(p)-1 # shouldn't happen... </code></pre>

How can I avoid value errors when using numpy.random.multinomial?

Tags:

python

random

numpy

numerical-stability

When I use this random generator: numpy.random.multinomial, I keep getting:

ValueError: sum(pvals[:-1]) > 1.0

I am always passing the output of this softmax function:

def softmax(w, t = 1.0):
    e = numpy.exp(numpy.array(w) / t)
    dist = e / np.sum(e)
    return dist

except now that I am getting this error, I also added this for the parameter (pvals):

while numpy.sum(pvals) > 1:
    pvals /= (1+1e-5)

but that didn't solve it. What is the right way to make sure I avoid this error?

EDIT: here is function that includes this code

def get_MDN_prediction(vec):
    coeffs = vec[::3]
    means = vec[1::3]
    stds = np.log(1+np.exp(vec[2::3]))
    stds = np.maximum(stds, min_std)
    coe = softmax(coeffs)
    while np.sum(coe) > 1-1e-9:
        coe /= (1+1e-5)
    coeff = unhot(np.random.multinomial(1, coe))
    return np.random.normal(means[coeff], stds[coeff])

487

asked Apr 24 '14 00:04

capybaralet

4 Answers

I also encountered this problem during my language modelling work.

The root of this problem rises from numpy's implicit data casting: the output of my sorfmax() is in float32 type, however, numpy.random.multinomial() will cast the pval into float64 type IMPLICITLY. This data type casting would cause pval.sum() exceed 1.0 sometimes due to numerical rounding.

This issue is recognized and posted here

200

answered Oct 19 '22 15:10

Jedi

I know the question is old but since I faced the same problem just now, it seems to me it's still valid. Here's the solution I've found for it:

a = np.asarray(a).astype('float64')
a = a / np.sum(a)
b = np.random.multinomial(1, a, 1)

I've made the important part bold. If you omit that part the problem you've mentioned will happen from time to time. But if you change the type of array into float64, it will never happen.

answered Oct 19 '22 15:10

Mehran

Something that few people noticed: a robust version of the softmax can be easily obtained by removing the logsumexp from the values:

from scipy.misc import logsumexp

def log_softmax(vec):
    return vec - logsumexp(vec)

def softmax(vec):
    return np.exp(log_softmax(vec))

Just check it:

print(softmax(np.array([1.0, 0.0, -1.0, 1.1])))

Simple, isn't it?

answered Oct 19 '22 15:10

Guillaume

The softmax implementation I was using is not stable enough for the values I was using it with. As a result, sometimes the output has a sum greater than 1 (e.g. 1.0000024...).

This case should be handled by the while loop. But sometimes the output contains NaNs, in which case the loop is never triggered, and the error persists.

Also, numpy.random.multinomial doesn't raise an error if it sees a NaN.

Here is what I'm using right now, instead:

def softmax(vec):
    vec -= min(A(vec))
    if max(vec) > 700:
        a = np.argsort(vec)
        aa = np.argsort(a)
        vec = vec[a]
        i = 0
        while max(vec) > 700:
            i += 1
            vec -= vec[i]
        vec = vec[aa]
    e = np.exp(vec)
    return e/np.sum(e)

def sample_multinomial(w):
    """
       Sample multinomial distribution with parameters given by softmax of w
       Returns an int    
    """
    p = softmax(w)
    x = np.random.uniform(0,1)
    for i,v in enumerate(np.cumsum(p)):
        if x < v: return i
    return len(p)-1 # shouldn't happen...

answered Oct 19 '22 15:10

capybaralet

Related questions
                            
                                How to calculate mean in python?
                            
                                How to generate a random graph given the number of nodes and edges?
                            
                                How do you pull WEEKLY historical data from yahoo finance?
                            
                                Compute daily climatology using pandas python
                            
                                What is the best way to alias method names in python?
                            
                                How to embed Python code into YAML?
                            
                                Using continue in python ternary? [duplicate]
                            
                                Crop out partial image using NumPy (or SciPy)
                            
                                Negate a Q object in Django
                            
                                Python floating point determinism
                            
                                Overwrite the previous print value in python?
                            
                                How can I convert python class with slots to dictionary?
                            
                                Python: Covariance matrix by hand
                            
                                Pygraphviz / networkx set node level or layer
                            
                                Python Multiprocessing Worker/Queue
                            
                                Can you explain the following function?
                            
                                How to calculate 3D object points from 2D image points using stereo triangulation?
                            
                                Using variable as keyword passed to **kwargs in Python
                            
                                ElementTree findall 'or' operator
                            
                                Does enumerate create a copy of its argument?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I avoid value errors when using numpy.random.multinomial?

Tags:

python

random

numpy

numerical-stability

capybaralet

People also ask

4 Answers

Jedi

Mehran

Guillaume

capybaralet

Recent Activity

Donate For Us