calculate perplexity in pytorch

Tags:

I've just trained an LSTM language model using pytorch. The main body of the class is this:

class LM(nn.Module):
    def __init__(self, n_vocab, 
                       seq_size, 
                       embedding_size, 
                       lstm_size, 
                       pretrained_embed):

        super(LM, self).__init__()
        self.seq_size = seq_size
        self.lstm_size = lstm_size
        self.embedding = nn.Embedding.from_pretrained(pretrained_embed, freeze = True)
        self.lstm = nn.LSTM(embedding_size,
                            lstm_size,
                            batch_first=True)
        self.fc = nn.Linear(lstm_size, n_vocab)

    def forward(self, x, prev_state):
        embed = self.embedding(x)
        output, state = self.lstm(embed, prev_state)
        logits = self.fc(output)

        return logits, state

Now I want to write a function which calculates how good a sentence is, based on the trained language model (some score like perplexity, etc.).

I'm a bit confused and I don't know how should I calculate this.
A similar sample would be of greate use.

867

asked Dec 06 '19 07:12

P.Alipoor

1 Answers

When using Cross-Entropy loss you just use the exponential function torch.exp() calculate perplexity from your loss.
(pytorch cross-entropy also uses the exponential function resp. log_n)

So here is just some dummy example:

import torch
import torch.nn.functional as F
num_classes = 10
batch_size  = 1

# your model outputs / logits
output      = torch.rand(batch_size, num_classes) 

# your targets
target      = torch.randint(num_classes, (batch_size,))

# getting loss using cross entropy
loss        = F.cross_entropy(output, target)

# calculating perplexity
perplexity  = torch.exp(loss)
print('Loss:', loss, 'PP:', perplexity)

In my case the output is:

Loss: tensor(2.7935) PP: tensor(16.3376)

You just need to be beware of that if you want to get the per-word-perplexity you need to have per word loss as well.

Here is a neat example for a language model that might be interesting to look at that also computes the perplexity from the output:

https://github.com/yunjey/pytorch-tutorial/blob/master/tutorials/02-intermediate/language_model/main.py#L30-L50

answered Sep 23 '22 01:09

MBT

Related questions
                            
                                How to detect if decimal columns should be converted into integer or double?
                            
                                How does tf.audio.decode_wav get its contents?
                            
                                Python pathlib.Path - how do I get just a platform independent file separator as a string?
                            
                                Decompose a combined IntFlag into its individual flags
                            
                                How to access foreign key table's data in Django templates?
                            
                                Plotly - How to set width to specific line?
                            
                                Can someone explain MaxAbsScaler in Scikit-learn?
                            
                                Can PyCharm display variable value as hexadecimal number?
                            
                                loading a multiple .txt files in to python as dataframe
                            
                                mock boto3 response for downloading file from S3
                            
                                How can I tell Pandas read_csv to use multiple whitespaces as separators but not single whitespaces?
                            
                                TensorFlow 2.0 [Condition x == y did not hold element-wise:]
                            
                                Why is key in dict() faster than dict.get(key) in Python3?
                            
                                List All Files in a Folder Sitting in a Data Lake
                            
                                Compare two or more columns values only if another column value is True
                            
                                Gunicorn 20 failed to find application object 'app.server' in 'index'
                            
                                Why python don't print after a time.sleep()?
                            
                                Get a shareable link of a file in our google drive using Colab notebook
                            
                                Why does Python's sorted() method not reverse orders of keys with the same value in a dictionary?
                            
                                How to pass data from React Form -> Flask Backend -> React Component (does it have something to do with CORS)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

calculate perplexity in pytorch

Tags:

python

nlp

pytorch

language-model

P.Alipoor

People also ask

1 Answers

MBT

Recent Activity

Donate For Us