I have two probability distributions. How should I find the KL-divergence between them in PyTorch? The regular cross entropy only accepts integer labels.

Yes, PyTorch has a method named <code>kl_div</code> under <code>torch.nn.functional</code> to directly compute KL-devergence between tensors. Suppose you have tensor <code>a</code> and <code>b</code> of same shape. You can use the following code: <pre class="prettyprint"><code>import torch.nn.functional as F out = F.kl_div(a, b) </code></pre> For more details, see the above method documentation.

KL Divergence for two probability distributions in PyTorch

2 Answers

Yes, PyTorch has a method named kl_div under torch.nn.functional to directly compute KL-devergence between tensors. Suppose you have tensor a and b of same shape. You can use the following code:

import torch.nn.functional as F
out = F.kl_div(a, b)

For more details, see the above method documentation.

answered Sep 27 '22 19:09

jdhao

function kl_div is not the same as wiki's explanation.

I use the following:

# this is the same example in wiki
P = torch.Tensor([0.36, 0.48, 0.16])
Q = torch.Tensor([0.333, 0.333, 0.333])

(P * (P / Q).log()).sum()
# tensor(0.0863), 10.2 µs ± 508

F.kl_div(Q.log(), P, None, None, 'sum')
# tensor(0.0863), 14.1 µs ± 408 ns

compare to kl_div, even faster

answered Sep 27 '22 18:09

hantian_pang

Related questions
                            
                                Is it possible to use TensorFlow C++ API on Windows?
                            
                                tensorflow:Can save best model only with val_acc available, skipping
                            
                                What does global pooling do?
                            
                                Interpreting a Self Organizing Map
                            
                                Items of feature_columns must be a _FeatureColumn Given: _VocabularyListCategoricalColumn
                            
                                List the words in a vocabulary according to occurrence in a text corpus, with Scikit-Learn CountVectorizer
                            
                                sklearn LinearRegression, why only one coefficient returned by the model?
                            
                                What is the difference between normalisation and regularisation in machine learning
                            
                                In machine learning, what is definition of “downstream”?
                            
                                Neural Network Ordinal Classification for Age
                            
                                Stop Training in Keras when Accuracy is already 1.0
                            
                                Why does one not use IOU for training?
                            
                                What does "sparse" mean in the context of neural nets?
                            
                                Xavier and he_normal initialization difference
                            
                                Avoid certain parameter combinations in GridSearchCV
                            
                                Sci-kit learn how to print labels for confusion matrix?
                            
                                ValueError: Number of labels is 1. Valid values are 2 to n_samples - 1 (inclusive) when using silhouette_score
                            
                                How to correct unstable loss and accuracy during training? (binary classification)
                            
                                pytorch error: multi-target not supported in CrossEntropyLoss()
                            
                                Using sklearn voting ensemble with partial fit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

KL Divergence for two probability distributions in PyTorch

Tags:

machine-learning

pytorch

Mojtaba Komeili

People also ask

2 Answers

jdhao

hantian_pang

Recent Activity

Donate For Us