<img src="https://i.stack.imgur.com/GsDuL.png" alt="enter image description here"> I am trying to implement softmax at the end of cnn, The output I got is nan and zeros. I am giving high input values to softmax around 10-20k I'm giving an array of <code>X=[2345,3456,6543,-6789,-9234]</code> My function is <pre class="prettyprint"><code>def softmax (X): B=np.exp(X) C=np.sum(np.exp(X)) return B/C </code></pre> I am getting error of <code>true divide and run time error</code> <pre class="prettyprint"><code>C:\Anaconda\envs\deep_learning\lib\site-packages\ipykernel_launcher.py:4: RuntimeWarning: invalid value encountered in true_divide after removing the cwd from sys.path. </code></pre>

According to softmax function, you need to iterate all elements in the array and compute the exponential for each individual element then divide it by the sum of the exponential of the all elements: <pre class="prettyprint"><code>import numpy as np a = [1,3,5] for i in a: print np.exp(i)/np.sum(np.exp(a)) 0.015876239976466765 0.11731042782619837 0.8668133321973349 </code></pre> However if the numbers are too big the exponents will probably blow up (computer can not handle such big numbers): <pre class="prettyprint"><code>a = [2345,3456,6543] for i in a: print np.exp(i)/np.sum(np.exp(a)) __main__:2: RuntimeWarning: invalid value encountered in double_scalars nan nan nan </code></pre> To avoid this, first shift the highest value in array to zero. Then compute the softmax. For example, to compute the softmax of <code>[1, 3, 5]</code> use <code>[1-5, 3-5, 5-5]</code> which is <code>[-4, -2, 0]</code>. Also you may choose the implement it in vectorized way (as you intendet to do in question): <pre class="prettyprint"><code>def softmax(x): f = np.exp(x - np.max(x)) # shift values return f / f.sum(axis=0) softmax([1,3,5]) # prints: array([0.01587624, 0.11731043, 0.86681333]) softmax([2345,3456,6543,-6789,-9234]) # prints: array([0., 0., 1., 0., 0.]) </code></pre> For detailed information check out the cs231n course page. The Practical issues: Numeric stability. heading is exactly what I'm trying to explain.

Implementation of softmax function returns nan for high inputs

Tags:

python

softmax

enter image description here

I am trying to implement softmax at the end of cnn, The output I got is nan and zeros. I am giving high input values to softmax around 10-20k I'm giving an array of X=[2345,3456,6543,-6789,-9234]

My function is

def softmax (X):
    B=np.exp(X)
    C=np.sum(np.exp(X))
    return B/C

I am getting error of true divide and run time error

C:\Anaconda\envs\deep_learning\lib\site-packages\ipykernel_launcher.py:4: RuntimeWarning: invalid value encountered in true_divide
  after removing the cwd from sys.path.

641

asked Feb 26 '19 07:02

Alok Ranjan Swain

Video Answer

1 Answers

According to softmax function, you need to iterate all elements in the array and compute the exponential for each individual element then divide it by the sum of the exponential of the all elements:

import numpy as np

a = [1,3,5]
for i in a:
    print np.exp(i)/np.sum(np.exp(a))

0.015876239976466765
0.11731042782619837
0.8668133321973349

However if the numbers are too big the exponents will probably blow up (computer can not handle such big numbers):

a = [2345,3456,6543]
for i in a:
    print np.exp(i)/np.sum(np.exp(a))

__main__:2: RuntimeWarning: invalid value encountered in double_scalars
nan
nan
nan

To avoid this, first shift the highest value in array to zero. Then compute the softmax. For example, to compute the softmax of [1, 3, 5] use [1-5, 3-5, 5-5] which is [-4, -2, 0]. Also you may choose the implement it in vectorized way (as you intendet to do in question):

def softmax(x):
    f = np.exp(x - np.max(x))  # shift values
    return f / f.sum(axis=0)

softmax([1,3,5])
# prints: array([0.01587624, 0.11731043, 0.86681333])

softmax([2345,3456,6543,-6789,-9234])
# prints: array([0., 0., 1., 0., 0.])

For detailed information check out the cs231n course page. The Practical issues: Numeric stability. heading is exactly what I'm trying to explain.

197

answered Sep 26 '22 01:09

Ersel Er

Related questions
                            
                                ValueError: You are trying to load a weight file containing 6 layers into a model with 0
                            
                                how to return the order index of each element of a list? [duplicate]
                            
                                React Tutorial history map (step, move)
                            
                                pythonic style for functional programming
                            
                                Tensorflow: Different results with the same random seed
                            
                                Top N rows by group using python datatable
                            
                                Read excel file from S3 into Pandas DataFrame
                            
                                Django: Run a script right after runserver
                            
                                How to denormalize YAML for Pandas Dataframe?
                            
                                create a dictionary from string that each character is key and value
                            
                                Merging content of two rows in Pandas
                            
                                aws cognito list-users function only returns 60 users
                            
                                Executing one line of code inside a while loop only once
                            
                                How to preprocess training set for VGG16 fine tuning in Keras?
                            
                                How to make nested enum also have value
                            
                                how to get Ip address of client after connection is established in python socket programming?
                            
                                Transfer pandas dataframe column names to dictionary
                            
                                pip is giving conflict error while installing package
                            
                                Get the highest String Version number in Python
                            
                                Ordering of elements in Pandas stacked bar chart

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With