What is the definition of a non-trainable parameter?

Tags:

What is the definition of non-trainable parameter in a model?

For example, while you are building your own model, its value is 0 as a default, but when you want to use an inception model, it is becoming something else rather than 0. What would be the reason behind it?

582

asked Nov 15 '17 16:11

TheWho

1 Answers

In keras, non-trainable parameters (as shown in model.summary()) means the number of weights that are not updated during training with backpropagation.

There are mainly two types of non-trainable weights:

The ones that you have chosen to keep constant when training. This means that keras won't update these weights during training at all.
The ones that work like statistics in BatchNormalization layers. They're updated with mean and variance, but they're not "trained with backpropagation".

Weights are the values inside the network that perform the operations and can be adjusted to result in what we want. The backpropagation algorithm changes the weights towards a lower error at the end.

By default, all weights in a keras model are trainable.

When you create layers, internally it creates its own weights and they're trainable. (The backpropagation algorithm will update these weights)

When you make them untrainable, the algorithm will not update these weights anymore. This is useful, for instance, when you want a convolutional layer with a specific filter, like a Sobel filter, for instance. You don't want the training to change this operation, so these weights/filters should be kept constant.

There is a lot of other reasons why you might want to make weights untrainable.

Changing parameters:

For deciding whether weights are trainable or not, you take layers from the model and set trainable:

model.get_layer(layerName).trainable = False #or True

This must be done before compilation.

143

answered Nov 11 '22 04:11

Daniel Möller

Related questions
                            
                                ValueError: Layer sequential_20 expects 1 inputs, but it received 2 input tensors
                            
                                Tensorflow: When use tf.expand_dims?
                            
                                Custom TensorFlow Keras optimizer
                            
                                What's the difference between Tensor and Variable in Tensorflow
                            
                                Are tf.layers.dense() and tf.contrib.layers.fully_connected() interchangeable?
                            
                                How do you get the name of the tensorflow output nodes in a Keras Model?
                            
                                Should TensorFlow users prefer SavedModel over Checkpoint or GraphDef?
                            
                                Multivariate LSTM with missing values
                            
                                Is there an easy way to get something like Keras model.summary in Tensorflow?
                            
                                Can't save custom subclassed model
                            
                                Clarification on tf.Tensor.set_shape()
                            
                                Does tensorflow use automatic or symbolic gradients?
                            
                                Why is my GPU slower than CPU when training LSTM/RNN models?
                            
                                What is the mathematics behind the "smoothing" parameter in TensorBoard's scalar graphs?
                            
                                what is the difference between Flatten() and GlobalAveragePooling2D() in keras
                            
                                Tensorflow image reading & display
                            
                                How to interpret increase in both loss and accuracy
                            
                                Trouble with TensorFlow in Jupyter Notebook
                            
                                How to convert one-hot encodings into integers?
                            
                                'module' object has no attribute 'SummaryWriter'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the definition of a non-trainable parameter?

Tags:

tensorflow

deep-learning

keras

caffe

theano

TheWho

People also ask

1 Answers

Daniel Möller

Recent Activity

Donate For Us