How to monitor gradient vanish and explosion in keras with tensorboard?

1 Answers

To visualize the training in Tensorboard, add keras.callbacks.TensorBoard callback to model.fit function. Don't forget to set write_grads=True to see the gradients there. Right after training start, you can run...

tensorboard --logdir=/full_path_to_your_logs

... from the command line and point your browser to htttp://localhost:6006. See the example code in this question.

To check for vanishing / exploding gradients, pay attention the gradients distribution and absolute values in the layer of interest ("Distributions" tab):

If the distribution is highly peaked and concentrated around 0, the gradients are probably vanishing. Here's a concrete example how it looks like in practice.
If the distribution is rapidly growing in absolute value with time, the gradients are exploding. Often the output values at the same layer become NaNs very quickly as well.

157

answered Sep 17 '22 18:09

Maxim

Related questions
                            
                                Reorder levels of MultiIndex in a pandas DataFrame
                            
                                How to replace all values in a Pandas Dataframe not in a list? [duplicate]
                            
                                Using Boto3 to interact with amazon Aurora on RDS
                            
                                Average of a numpy array returns NaN
                            
                                overcome Graphdef cannot be larger than 2GB in tensorflow
                            
                                interpolate missing values 2d python
                            
                                How to remove the extra row (or column) after transpose() in Pandas
                            
                                Google Search Web Scraping with Python
                            
                                How can I slice each element of a numpy array of strings?
                            
                                Difference between '[:]' and '[::]' slicing when copying a list?
                            
                                No module named urllib3
                            
                                Python subprocess.call not waiting for process to finish blender
                            
                                pandas groupby where you get the max of one column and the min of another column
                            
                                Python error when calling NumPy from class method with map
                            
                                Tox WARNING:test command found but not installed in testenv
                            
                                Not able to upload local files in google colab
                            
                                How to efficiently find the indices of matching elements in two lists
                            
                                Simplifying an 'if' statement with bool()
                            
                                What do 1_000 and 100_000 mean? [duplicate]
                            
                                Selenium can't click element because other element obscures it

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to monitor gradient vanish and explosion in keras with tensorboard?

Tags:

python

tensorflow

keras

tensorboard

tensorflow-gradient

Joey Chia

People also ask

1 Answers

Maxim

Recent Activity

Donate For Us