I think I might have a problem with dead Relus, but I don't really know how to check it with tensorboard or any other way. Your help would be really appreciated.

I initially had this same question myself and couldn't find an answer, so here's how I'm doing it with Tensorboard (This assumes some familiarity with Tensorboard). <pre class="prettyprint"><code>activation = tf.nn.relu(layer) active = tf.count_nonzero(tf.count_nonzero(activation, axis=0)) tf.summary.scalar('pct-active-neurons', active / layer.shape[1]) </code></pre> In this snip, <code>activation</code> is my post-ReLU activation for this particular layer. The first call to <code>tf.count_nonzero(out, axis=0)</code> is counting how many activations each neuron has seen across all training examples for the current step of training. The second call <code>tf.count_nonzero( ... )</code> that wraps the first call counts how many neurons in the layer had at least one activation for the batch of training examples for this step. Finally, I convert it to a percentage by dividing the number of neurons that had at least one activation in the training step by the total number of neurons for the layer. More information on setting up Tensorboard can be found here.

How to monitor dead relus

1 Answers

I initially had this same question myself and couldn't find an answer, so here's how I'm doing it with Tensorboard (This assumes some familiarity with Tensorboard).

activation = tf.nn.relu(layer) 
active = tf.count_nonzero(tf.count_nonzero(activation, axis=0))
tf.summary.scalar('pct-active-neurons', active / layer.shape[1])

In this snip, activation is my post-ReLU activation for this particular layer. The first call to tf.count_nonzero(out, axis=0) is counting how many activations each neuron has seen across all training examples for the current step of training. The second call tf.count_nonzero( ... ) that wraps the first call counts how many neurons in the layer had at least one activation for the batch of training examples for this step. Finally, I convert it to a percentage by dividing the number of neurons that had at least one activation in the training step by the total number of neurons for the layer.

More information on setting up Tensorboard can be found here.

131

answered Oct 05 '22 18:10

Jazzer

Related questions
                            
                                How to use numpy functions on a keras tensor in the loss function?
                            
                                TensorFlow Horovod: NCCL and MPI
                            
                                How to invoke the Flex delegate for tflite interpreters?
                            
                                Issue with embedding layer when serving a Tensorflow/Keras model with TF 2.0
                            
                                Tensorflow: Finetune pretrained model on new dataset with different number of classes
                            
                                What is the best way to save tensor value to file as binary format?
                            
                                Segmentation fault (core dumped) on tf.Session()
                            
                                Specify either CPU or GPU for multiple models tensorflow java's job
                            
                                Keras with Tensorflow: Use memory as it's needed [ResourceExhaustedError]
                            
                                Set half of the filters of a layer as not trainable keras/tensorflow
                            
                                Specifying CPUs for use in Keras Tensorflow Model Inference
                            
                                Why is AdamOptimizer duplicated in my graph?
                            
                                Support for Tensorflow 2.0 in Object Detection API
                            
                                How to generate .pbtxt file from a .pb file for dnn module in opencv?
                            
                                Evaluating TF model inside a TF op throws error
                            
                                When training GANs in Keras, are multiple passes required to optimize the generator and discriminator?
                            
                                tensorflow 2.0: An op outside of the function building code is being passed
                            
                                how to apply imgaug augmentation to tf.dataDataset in Tensorflow 2.0
                            
                                InvalidArgumentError: 2 root error(s) found. Incompatible shapes in Tensorflow text-classification model
                            
                                Keras or Tensorflow function to draw a 3D diagram of a neural network structure?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to monitor dead relus

Tags:

tensorflow

tensorboard

florpi

People also ask

1 Answers

Jazzer

Recent Activity

Donate For Us