How can I compute kl diveregence in keras while using tensorflow as backend? I compute L1 loss as follows: <pre class="prettyprint"><code>def l1_loss(y_true, y_pred): return K.sum(K.abs(y_pred - y_true), axis=-1) </code></pre>

You can simply use the <code>tf.keras.losses.kullback_leibler_divergence</code> function. If you want to use it as an activity regularizer, you can create a simple regularization function: <pre class="prettyprint lang-py prettyprint-override"><code>import keras # if using keras # from tensorflow import keras # if using tf.keras kullback_leibler_divergence = keras.losses.kullback_leibler_divergence K = keras.backend def kl_divergence_regularizer(inputs): means = K.mean(inputs, axis=0) return 0.01 * (kullback_leibler_divergence(0.05, means) + kullback_leibler_divergence(1 - 0.05, 1 - means)) </code></pre> In this example, 0.01 is the regularization weight, and 0.05 is the sparsity target. Then use it like this: <pre class="prettyprint lang-py prettyprint-override"><code>keras.layers.Dense(32, activation="sigmoid", activity_regularizer=kl_divergence_regularizer) </code></pre> For example, this would be the encoding layer of a sparse autoencoder. Note that the <code>kullback_leibler_divergence</code> expects all the class probabilities, even in the case of binary classification (giving just the positive class probability is not enough). This is why we compute the KLD for both 0.05 and 1-0.05 in the function above.

How do I compute the KL divergence in Keras with TensorFlow backend?

Tags:

tensorflow

keras

How can I compute kl diveregence in keras while using tensorflow as backend? I compute L1 loss as follows:

def l1_loss(y_true, y_pred):
    return K.sum(K.abs(y_pred - y_true), axis=-1)

405

asked Apr 24 '17 23:04

Abhishek Bhatia

2 Answers

Keras already has the KL-divergence implemented, as it can be seen here, the code is just:

def kullback_leibler_divergence(y_true, y_pred):
    y_true = K.clip(y_true, K.epsilon(), 1)
    y_pred = K.clip(y_pred, K.epsilon(), 1)
    return K.sum(y_true * K.log(y_true / y_pred), axis=-1)

So just use kld, KLD or kullback_leibler_divergence as loss.

121

answered Oct 10 '22 08:10

Dr. Snoopy

You can simply use the tf.keras.losses.kullback_leibler_divergence function.

If you want to use it as an activity regularizer, you can create a simple regularization function:

import keras # if using keras
# from tensorflow import keras # if using tf.keras
kullback_leibler_divergence = keras.losses.kullback_leibler_divergence
K = keras.backend

def kl_divergence_regularizer(inputs):
    means = K.mean(inputs, axis=0)
    return 0.01 * (kullback_leibler_divergence(0.05, means)
                 + kullback_leibler_divergence(1 - 0.05, 1 - means))

In this example, 0.01 is the regularization weight, and 0.05 is the sparsity target. Then use it like this:

keras.layers.Dense(32, activation="sigmoid",
                   activity_regularizer=kl_divergence_regularizer)

For example, this would be the encoding layer of a sparse autoencoder.

Note that the kullback_leibler_divergence expects all the class probabilities, even in the case of binary classification (giving just the positive class probability is not enough). This is why we compute the KLD for both 0.05 and 1-0.05 in the function above.

answered Oct 10 '22 08:10

MiniQuark

Related questions
                            
                                Using tf.keras.utils.Sequence with model.fit_generator with use_multiprocessing=True generated warning
                            
                                When to use @tf.function decorator and when not? I know tf.function builds graph. But how to know when to build graphs?
                            
                                What is tracing with regard to tf.function
                            
                                AttributeError: module 'keras.backend' has no attribute 'common'
                            
                                Tensor flow install OSX
                            
                                sklearn.linear_model not found in TensorFlow Udacity course
                            
                                Recommended GPUs for Tensorflow
                            
                                Run train op multiple times in tensorflow
                            
                                merging recurrent layers with dense layer in Keras
                            
                                Tensorflow AVX Support
                            
                                What does tf.train.ExponentialMovingAverage do?
                            
                                TensorFlow: How to use CudnnLSTM with variable input length (like dynamic_rnn)?
                            
                                Debugging TensorFlow tests: pdb or gdb?
                            
                                TensorFlow FileWriter not writing to file
                            
                                Keras jupyter notebook outputs blocks during training
                            
                                TensorFlow: SKCompat Depreciation Warning
                            
                                TensorFlow dynamic_rnn regressor: ValueError dimension mismatch
                            
                                How to use the pre-trained ResNet50 in tensorflow?
                            
                                Tensorboard Cannot find .runfiles directory error
                            
                                Installing tensorflow on windows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With