How to apply gradient clipping in TensorFlow?

Tags:

Considering the example code.

I would like to know How to apply gradient clipping on this network on the RNN where there is a possibility of exploding gradients.

tf.clip_by_value(t, clip_value_min, clip_value_max, name=None)

This is an example that could be used but where do I introduce this ? In the def of RNN

    lstm_cell = rnn_cell.BasicLSTMCell(n_hidden, forget_bias=1.0)     # Split data because rnn cell needs a list of inputs for the RNN inner loop     _X = tf.split(0, n_steps, _X) # n_steps tf.clip_by_value(_X, -1, 1, name=None)

But this doesn't make sense as the tensor _X is the input and not the grad what is to be clipped?

Do I have to define my own Optimizer for this or is there a simpler option?

953

asked Apr 08 '16 11:04

Arsenal Fanatic

1 Answers

Gradient clipping needs to happen after computing the gradients, but before applying them to update the model's parameters. In your example, both of those things are handled by the AdamOptimizer.minimize() method.

In order to clip your gradients you'll need to explicitly compute, clip, and apply them as described in this section in TensorFlow's API documentation. Specifically you'll need to substitute the call to the minimize() method with something like the following:

optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate) gvs = optimizer.compute_gradients(cost) capped_gvs = [(tf.clip_by_value(grad, -1., 1.), var) for grad, var in gvs] train_op = optimizer.apply_gradients(capped_gvs)

answered Sep 19 '22 00:09

Styrke

Related questions
                            
                                Install only available packages using "conda install --yes --file requirements.txt" without error
                            
                                How can I check if code is executed in the IPython notebook?
                            
                                Importing a long list of constants to a Python file
                            
                                How to dynamically update a plot in a loop in IPython notebook (within one cell)
                            
                                Is there a WebSocket client implemented for Python? [closed]
                            
                                How can I add items to an empty set in python
                            
                                Disable all Pylint warnings for a file
                            
                                What is the purpose of python's inner classes?
                            
                                many-to-many in list display django
                            
                                Working with TIFFs (import, export) in Python using numpy
                            
                                Custom Python list sorting
                            
                                List to array conversion to use ravel() function
                            
                                URL-parameters and logic in Django class-based views (TemplateView)
                            
                                Request UAC elevation from within a Python script?
                            
                                Python Django Rest Framework UnorderedObjectListWarning
                            
                                Pandas: filling missing values by mean in each group
                            
                                How to send requests with JSON in unit tests
                            
                                Stratified Train/Test-split in scikit-learn
                            
                                Pycharm/Python OpenCV and CV2 install error
                            
                                How to upgrade pip3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to apply gradient clipping in TensorFlow?

Tags:

python

machine-learning

tensorflow

deep-learning

keras

Arsenal Fanatic

People also ask

1 Answers

Styrke

Recent Activity

Donate For Us