I have two tensors, <code>prob_a</code> and <code>prob_b</code> with shape <code>[None, 1000]</code>, and I want to compute the KL divergence from <code>prob_a</code> to <code>prob_b</code>. Is there a built-in function for this in TensorFlow? I tried using <code>tf.contrib.distributions.kl(prob_a, prob_b)</code>, but it gives: <blockquote> NotImplementedError: No KL(dist_a || dist_b) registered for dist_a type Tensor and dist_b type Tensor </blockquote> If there is no built-in function, what would be a good workaround?

Assuming that your input tensors <code>prob_a</code> and <code>prob_b</code> are probability tensors that sum to 1 along the last axis, you could do it like this: <pre class="prettyprint"><code>def kl(x, y): X = tf.distributions.Categorical(probs=x) Y = tf.distributions.Categorical(probs=y) return tf.distributions.kl_divergence(X, Y) result = kl(prob_a, prob_b) </code></pre> A simple example: <pre class="prettyprint"><code>import numpy as np import tensorflow as tf a = np.array([[0.25, 0.1, 0.65], [0.8, 0.15, 0.05]]) b = np.array([[0.7, 0.2, 0.1], [0.15, 0.8, 0.05]]) sess = tf.Session() print(kl(a, b).eval(session=sess)) # [0.88995184 1.08808468] </code></pre> You would get the same result with <pre class="prettyprint"><code>np.sum(a * np.log(a / b), axis=1) </code></pre> However, this implementation is a bit buggy (checked in Tensorflow 1.8.0). If you have zero probabilities in <code>a</code>, e.g. if you try <code>[0.8, 0.2, 0.0]</code> instead of <code>[0.8, 0.15, 0.05]</code>, you will get <code>nan</code> even though by Kullback-Leibler definition <code>0 * log(0 / b)</code> should contribute as zero. To mitigate this, one should add some small numerical constant. It is also prudent to use <code>tf.distributions.kl_divergence(X, Y, allow_nan_stats=False)</code> to cause a runtime error in such situations. Also, if there are some zeros in <code>b</code>, you will get <code>inf</code> values which won't be caught by the <code>allow_nan_stats=False</code> option so those have to be handled as well.

Is there a built-in KL divergence loss function in TensorFlow?

Tags:

python

tensorflow

statistics

entropy

I have two tensors, prob_a and prob_b with shape [None, 1000], and I want to compute the KL divergence from prob_a to prob_b. Is there a built-in function for this in TensorFlow? I tried using tf.contrib.distributions.kl(prob_a, prob_b), but it gives:

NotImplementedError: No KL(dist_a || dist_b) registered for dist_a type Tensor and dist_b type Tensor

If there is no built-in function, what would be a good workaround?

854

asked Jan 25 '17 23:01

Transcendental

1 Answers

Assuming that your input tensors prob_a and prob_b are probability tensors that sum to 1 along the last axis, you could do it like this:

def kl(x, y):
    X = tf.distributions.Categorical(probs=x)
    Y = tf.distributions.Categorical(probs=y)
    return tf.distributions.kl_divergence(X, Y)

result = kl(prob_a, prob_b)

A simple example:

import numpy as np
import tensorflow as tf
a = np.array([[0.25, 0.1, 0.65], [0.8, 0.15, 0.05]])
b = np.array([[0.7, 0.2, 0.1], [0.15, 0.8, 0.05]])
sess = tf.Session()
print(kl(a, b).eval(session=sess))  # [0.88995184 1.08808468]

You would get the same result with

np.sum(a * np.log(a / b), axis=1)

However, this implementation is a bit buggy (checked in Tensorflow 1.8.0).

If you have zero probabilities in a, e.g. if you try [0.8, 0.2, 0.0] instead of [0.8, 0.15, 0.05], you will get nan even though by Kullback-Leibler definition 0 * log(0 / b) should contribute as zero.

To mitigate this, one should add some small numerical constant. It is also prudent to use tf.distributions.kl_divergence(X, Y, allow_nan_stats=False) to cause a runtime error in such situations.

Also, if there are some zeros in b, you will get inf values which won't be caught by the allow_nan_stats=False option so those have to be handled as well.

182

answered Sep 18 '22 13:09

meferne

Related questions
                            
                                Print original input order of dictionary in python
                            
                                Saving plots to pdf files using matplotlib
                            
                                Configure Pytest discovery to ignore class name
                            
                                Disable SQL detection in JetBrains PyCharm
                            
                                Parse French date in python
                            
                                Read lists into columns of pandas DataFrame
                            
                                how do you install python package without dependencies
                            
                                Numpy: Replace random elements in an array
                            
                                removing duplicates of a list of sets
                            
                                Profile Python import times
                            
                                How to prettyprint (human readably print) a Python dict in json format (double quotes)? [duplicate]
                            
                                what should be in gitignore, and how do I put env folder to gitignore and is my folder structure correct?
                            
                                Django nested if else in templates
                            
                                Putting a variable into a string (quote)
                            
                                F1-score per class for multi-class classification
                            
                                Tensorflow: Confusion regarding the adam optimizer
                            
                                Import data from excel spreadsheet to django model
                            
                                Multiple lookup_fields for django rest framework
                            
                                Converting pandas.core.series.Series to dataframe with appropriate column values python
                            
                                Global Weight Decay in Keras

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With