I would like to know the difference between the option <code>trainable=False</code> and the <code>tf.stop_gradient()</code>. If I make the <code>trainable</code> option <code>False</code> will my optimizer not consider the variable for training? Does this option make the it a constant value throughout the training?

<blockquote> trainable=False </blockquote> Here the variable value will be constant throughout the training. Optimizer won't consider this variable for training, no gradient update op. <blockquote> stop_gradient </blockquote> In certain situations, you want to calculate the gradient of a op with respect to some variable keeping a few other variables constant; but for other ops you may use those variables also to calculate gradient. So here you can't use <code>trinable=False</code>, as you need those variable for training with other ops. <code>stop_gradient</code> is very useful for ops; you can selectively optimize a op with respect to select few variables while keeping other constant. <pre class="prettyprint"><code>y1 = tf.stop_gradient(W1x+b1) y2 = W2y1+b2 cost = cost_function(y2, y) # this following op wont optimize the cost with respect to W1 and b1 train_op_w2_b2 = tf.train.MomentumOptimizer(0.001, 0.9).minimize(cost) W1 = tf.get_variable('w1', trainable=False) y1 = W1x+b1 y2 = W2y1+b2 cost = cost_function(y2, y) # this following op wont optimize the cost with respect to W1 train_op = tf.train.MomentumOptimizer(0.001, 0.9).minimize(cost) </code></pre>

In tensorflow what is the difference between trainable and stop gradient

Tags:

python

machine-learning

tensorflow

deep-learning

I would like to know the difference between the option trainable=False and the tf.stop_gradient(). If I make the trainable option False will my optimizer not consider the variable for training? Does this option make the it a constant value throughout the training?

658

asked Aug 10 '17 11:08

pratsbhatt

1 Answers

trainable=False

Here the variable value will be constant throughout the training. Optimizer won't consider this variable for training, no gradient update op.

stop_gradient

In certain situations, you want to calculate the gradient of a op with respect to some variable keeping a few other variables constant; but for other ops you may use those variables also to calculate gradient. So here you can't use trinable=False, as you need those variable for training with other ops.

stop_gradient is very useful for ops; you can selectively optimize a op with respect to select few variables while keeping other constant.

y1 = tf.stop_gradient(W1x+b1)
y2 = W2y1+b2
cost = cost_function(y2, y)
# this following op wont optimize the cost with respect to W1 and b1
train_op_w2_b2 = tf.train.MomentumOptimizer(0.001, 0.9).minimize(cost)

W1 = tf.get_variable('w1', trainable=False)
y1 = W1x+b1
y2 = W2y1+b2
cost = cost_function(y2, y)
# this following op wont optimize the cost with respect to W1
train_op = tf.train.MomentumOptimizer(0.001, 0.9).minimize(cost)

108

answered Sep 30 '22 18:09

Ishant Mrinal

Related questions
                            
                                "django.contrib.admin.sites.NotRegistered: The model User is not registered" I get this error when a want to register my Custom User.
                            
                                pandas dataframe: how to count the number of 1 rows in a binary column?
                            
                                Pandas dataframe first instance of value in column
                            
                                How to calculate Cohen's kappa coefficient that measures inter-rater agreement ? ( movie review )
                            
                                How do I get Flake8 to work with F811 errors?
                            
                                How to use Bazel's py_library imports argument
                            
                                how to send photo by telegram bot using multipart/form-data
                            
                                In C python, accessing the bytecode evaluation stack
                            
                                How can I use advanced regex in a boto3 ec2 instance filter?
                            
                                Logging to logstash from python
                            
                                What is the meaning of 'mean_test_score' in cv_result?
                            
                                I need to create a python list object, or any object, out of a pandas DataFrame object grouping pieces of values from different rows
                            
                                Keras model to fit polynomial
                            
                                openCV: How to use getPerspectiveTransform
                            
                                Scrapy : Sending information to prior function
                            
                                Python Flask returning a html page while simultaneously performing a function
                            
                                X Y Z array data to heatmap
                            
                                Run TensorFlow remotely
                            
                                ImportError: cannot import name 'StringType'
                            
                                x11 - ImportError: No module named 'kivy.core.window.window_x11'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With