what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer? <pre class="prettyprint"><code>tf.keras.layers.Dense(10, activation=tf.nn.softmax) </code></pre> and <pre class="prettyprint"><code>tf.keras.layers.Softmax(10) </code></pre>

they are the same, you can test it on your own <pre class="prettyprint"><code># generate data x = np.random.uniform(0,1, (5,20)).astype('float32') # 1st option X = Dense(10, activation=tf.nn.softmax) A = X(x) # 2nd option w,b = X.get_weights() B = Softmax()(tf.matmul(x,w) + b) tf.reduce_all(A == B) # <tf.Tensor: shape=(), dtype=bool, numpy=True> </code></pre> Pay attention also when using <code>tf.keras.layers.Softmax</code>, it doesn't require to specify the units, it's a simple activation by default, the softmax is computed on the -1 axis, you can change this if you have tensor outputs > 2D and want to operate softmax on other dimensionalities. You can change this easily in the second option

what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?

Tags:

python

machine-learning

tensorflow

computer-vision

keras

what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?

tf.keras.layers.Dense(10, activation=tf.nn.softmax)

and

tf.keras.layers.Softmax(10)

941

asked Sep 28 '20 05:09

Pavan elisetty

1 Answers

they are the same, you can test it on your own

# generate data
x = np.random.uniform(0,1, (5,20)).astype('float32')

# 1st option
X = Dense(10, activation=tf.nn.softmax)
A = X(x)

# 2nd option
w,b = X.get_weights()
B = Softmax()(tf.matmul(x,w) + b)

tf.reduce_all(A == B)
# <tf.Tensor: shape=(), dtype=bool, numpy=True>

Pay attention also when using tf.keras.layers.Softmax, it doesn't require to specify the units, it's a simple activation

by default, the softmax is computed on the -1 axis, you can change this if you have tensor outputs > 2D and want to operate softmax on other dimensionalities. You can change this easily in the second option

answered Oct 28 '22 18:10

Marco Cerliani

Related questions
                            
                                How to Access Private Github Repo File (.csv) in Python using Pandas or Requests
                            
                                How do I read project dependencies from pyproject.toml from my setup.py, to avoid duplicating the information in both files?
                            
                                Replace certain value in pandas Dataframe without knowing neither column nor row
                            
                                Time efficient way to skip no of line from very large text file (16gb) using python
                            
                                Why does a type hint `float` accept `int` while it is not even a subclass?
                            
                                Automating Winmerge comparison in Python
                            
                                make input features map from expansion tensor in keras
                            
                                What method does Python call when I access an attribute of a class via the class name?
                            
                                How do you use pipenv in a GitHub action?
                            
                                Cascade multiple RNN models for N-dimensional output
                            
                                Can't find model 'en_core_web_md'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory
                            
                                How to save and edit server rendering data?
                            
                                Installing socketio module on python3 seems to be corrupting pip
                            
                                Identify the first and all non-zero values in every row in Pandas DataFrame
                            
                                How to convert a sklearn pipeline into a pyspark pipeline?
                            
                                Delete diagonals of zero elements
                            
                                Is there a way to use Python 3.9 type hinting in its previous versions?
                            
                                Kivy sounds do not play on android device even though they play fine on laptop
                            
                                enabling CORS Google Cloud Function (Python)
                            
                                How to read, format, sort, and save a csv file, without pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

what is the difference between using softmax as a sequential layer in tf.keras and softmax as an activation function for a dense layer?

Tags:

python

machine-learning

tensorflow

computer-vision

keras

Pavan elisetty

People also ask

1 Answers

Marco Cerliani

Recent Activity

Donate For Us