Huggingface transformer model returns string instead of logits

Tags:

I am trying to run this example from huggingface website. https://huggingface.co/transformers/task_summary.html. It seems that the model returns two strings instead of logits! and that leads to an error thrown by torch.argmax()

    from transformers import AutoTokenizer, AutoModelForQuestionAnswering
    import torch
    
    tokenizer = AutoTokenizer.from_pretrained("bert-large-uncased-whole-word-masking-finetuned-squad")
    
    model = AutoModelForQuestionAnswering.from_pretrained("bert-large-uncased-whole-word-masking-finetuned-squad", return_dict=True)
    
    text = r"""🤗 Transformers (formerly known as pytorch-transformers and pytorch-pretrained-bert) provides general-purpose
    architectures (BERT, GPT-2, RoBERTa, XLM, DistilBert, XLNet…) for Natural Language Understanding (NLU) and Natural
    Language Generation (NLG) with over 32+ pretrained models in 100+ languages and deep interoperability between
    TensorFlow 2.0 and PyTorch.
    """
    
    questions = ["How many pretrained models are available in 🤗 Transformers?",
    "What does 🤗 Transformers provide?",
    "🤗 Transformers provides interoperability between which frameworks?"]
    
    for question in questions:
      inputs = tokenizer(question, text, add_special_tokens=True, return_tensors="pt")
      input_ids = inputs["input_ids"].tolist()[0] # the list of all indices of words in question + context
    
      text_tokens = tokenizer.convert_ids_to_tokens(input_ids) # Get the tokens for the question + context
      answer_start_scores, answer_end_scores = model(**inputs)
    
      answer_start = torch.argmax(answer_start_scores)  # Get the most likely beginning of answer with the argmax of the score
      answer_end = torch.argmax(answer_end_scores) + 1  # Get the most likely end of answer with the argmax of the score
    
      answer = tokenizer.convert_tokens_to_string(tokenizer.convert_ids_to_tokens(input_ids[answer_start:answer_end]))
    
      print(f"Question: {question}")
      print(f"Answer: {answer}")

531

asked Nov 18 '20 21:11

Reza Afra

1 Answers

Since one of the recent updates, the models return now task-specific output objects (which are dictionaries) instead of plain tuples. The site you used has not been updated to reflect that change. You can either force the model to return a tuple by specifying return_dict=False:

answer_start_scores, answer_end_scores = model(**inputs, return_dict=False)

or you can extract the values from the QuestionAnsweringModelOutput object by calling the values() method:

answer_start_scores, answer_end_scores = model(**inputs).values()

or even utilizing the QuestionAnsweringModelOutput object:

outputs = model(**inputs)
answer_start_scores = outputs.start_logits
answer_end_scores = outputs.end_logits

130

answered Oct 09 '22 11:10

cronoik

Related questions
                            
                                Pretraining a language model on a small custom corpus
                            
                                How to get the probability of a particular token(word) in a sentence given the context
                            
                                Do I need to pre-tokenize the text first before using HuggingFace's RobertaTokenizer? (Different undersanding)
                            
                                Shall we lower case input data for (pre) training a BERT uncased model using huggingface?
                            
                                Where is perplexity calculated in the Huggingface gpt2 language model code?
                            
                                How to get intermediate layers' output of pre-trained BERT model in HuggingFace Transformers library?
                            
                                how to convert HuggingFace's Seq2seq models to onnx format
                            
                                Early stopping in Bert Trainer instances
                            
                                BERT sentence embeddings from transformers
                            
                                Text generation using huggingface's distilbert models
                            
                                How to predict the probability of an empty string using BERT
                            
                                How to use the past with HuggingFace Transformers GPT-2?
                            
                                What are the inputs to the transformer encoder and decoder in BERT?
                            
                                How do I use BertForMaskedLM or BertModel to calculate perplexity of a sentence?
                            
                                How to fine tune BERT on unlabeled data?
                            
                                Downloading transformers models to use offline
                            
                                How exactly should the input file be formatted for the language model finetuning (BERT through Huggingface Transformers)?
                            
                                Save only best weights with huggingface transformers
                            
                                BERT tokenizer & model download

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Huggingface transformer model returns string instead of logits

Tags:

huggingface-transformers

question-answering

nlp-question-answering

Reza Afra

People also ask

1 Answers

cronoik

Recent Activity

Donate For Us