Holding variables constant during optimizer

Tags:

tensorflow

I have a TensorFlow computational graph for a loss tensor L that depends on 2 tf.Variables, A and B.

I'd like to run gradient ascent on variable A (A+=gradient of L wrt A) while holding B fixed, and vice versa - running gradient ascent on B (B+=gradient of L wrt B) while holding A fixed. How do I do this?

837

asked Dec 27 '15 05:12

ejang

1 Answers

tf.stop_gradient(tensor) might be what you are looking for. The tensor will be treated as constant for gradient computation purposes. You can create two losses with different parts treated as constants.

The other option (and often better) would be to create 2 optimizers but explicitly optimize only subsets of variables, e.g.

train_a = tf.train.GradientDescentOptimizer(0.1).minimize(loss_a, var_list=[A])
train_b = tf.train.GradientDescentOptimizer(0.1).minimize(loss_b, var_list=[B])

and you can iterate between them on the updates.

answered Oct 21 '22 07:10

Rafał Józefowicz

Related questions
                            
                                How to use a Keras RNN model to forecast for future dates or events?
                            
                                eval() and run() in tensorflow
                            
                                How to use tf.reset_default_graph()
                            
                                Best practice for upgrading CUDA and cuDNN for tensorflow
                            
                                How to use keras layers in custom keras layer
                            
                                Tensorflow, best way to save state in RNNs?
                            
                                TensorFlow: Non-repeatable results
                            
                                TensorFlow simple operations: tensors vs Python variables
                            
                                Initializing LSTM hidden state Tensorflow/Keras
                            
                                TensorFlow: How to predict from a SavedModel?
                            
                                Checkpointing keras model: TypeError: can't pickle _thread.lock objects
                            
                                Is there a way to stack two tensorflow datasets?
                            
                                Tensorflow: Confusion regarding the adam optimizer
                            
                                Is there a built-in KL divergence loss function in TensorFlow?
                            
                                How tf.transpose works in tensorflow?
                            
                                Tensorflow on windows - ImportError: DLL load failed: The specified module could not be found
                            
                                KerasRegressor Coefficient of Determination R^2 Score
                            
                                Issue NaN with Adam solver
                            
                                tensorflow.train.import_meta_graph does not work?
                            
                                why tensorflow just outputs killed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With