How to get the probability of a particular token(word) in a sentence given the context

Tags:

I'm trying to calculate the probability or any type of score for words in a sentence using NLP. I've tried this approach with GPT2 model using Huggingface Transformers library, but, I couldn't get satisfactory results due to the model's unidirectional nature which for me didn't seem to predict within context. So I was wondering whether there is a way, to calculate the above said using BERT since it's Bidirectional.

I've found this post relatable, which I randomly saw the other day but didn't see any answer which would be useful for me as well.

Hope I will be able to receive ideas or a solution for this. Any help is appreciated. Thank you.

432

asked May 14 '20 01:05

Dilrukshi Perera

Video Answer

1 Answers

BERT is trained as a masked language model, i.e., it is trained to predict tokens that were replaced by a [MASK] token.

from transformers import AutoTokenizer, BertForMaskedLM

tok = AutoTokenizer.from_pretrained("bert-base-cased")
bert = BertForMaskedLM.from_pretrained("bert-base-cased")

input_idx = tok.encode(f"The {tok.mask_token} were the best rock band ever.")
logits = bert(torch.tensor([input_idx]))[0]
prediction = logits[0].argmax(dim=1)
print(tok.convert_ids_to_tokens(prediction[2].numpy().tolist()))

It prints token no. 11581 which is:

Beatles

The tricky thing is that words might be split into multiple subwords. You can simulate that be adding multiple [MASK] tokens, but then you have a problem of how to reliably compare the scores of prediction so different lengths. I would probably average the probabilities, but maybe there is a better way.

answered Sep 24 '22 00:09

Jindřich

Related questions
                            
                                Mallet CRF SimpleTagger Performance Tuning
                            
                                Discovering "templates" in a given text?
                            
                                Converting adjectives and adverbs to their noun forms
                            
                                Brute-Force language detection
                            
                                language detection
                            
                                TreeTagger installation successful but cannot open .par file
                            
                                How can one resolve synonyms in named-entity recognition?
                            
                                Stanford POS Tagger not tagging Chinese text
                            
                                Sentiments Analysis Vs emotion Analysis
                            
                                How to extend the stopword list from NLTK and remove stop words with the extended list?
                            
                                Where can I find a corpus of search engine queries?
                            
                                Stanford Dependency Parser Setup and NLTK
                            
                                NLTK words vs word_tokenize
                            
                                Apertium translator. Is there a way to get the original phrase
                            
                                Generate misspelled words (typos)
                            
                                How to handle text classification problems when multiple features are involved
                            
                                Accessing Google cloud API from local Project not Hosted on Google cloud platform
                            
                                Determining tense of a sentence Python
                            
                                Latent Dirichlet Allocation, pitfalls, tips and programs
                            
                                Are there APIs for text analysis/mining in Java? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get the probability of a particular token(word) in a sentence given the context

Tags:

nlp

pytorch

bert-language-model

huggingface-transformers

Dilrukshi Perera

People also ask

Video Answer

1 Answers

Jindřich

Recent Activity

Donate For Us