Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in huggingface-transformers

AttributeError: 'TrainingArguments' object has no attribute 'model_init_kwargs'

What is the loss function used in Trainer from the Transformers library of Hugging Face?

Difference between AutoModelForSeq2SeqLM and AutoModelForCausalLM

Continual pre-training vs. Fine-tuning a language model with MLM

How to use fine-tuned model in huggingface for actual prediction after re-loading?

HuggingFace AutoModelForCasualLM "decoder-only architecture" warning, even after setting padding_side='left'

How to understand hidden_states of the returns in BertModel?

BERT - Is that needed to add new tokens to be trained in a domain specific environment?

Sentence embeddings from LLAMA 2 Huggingface opensource

Longformer get last_hidden_state

Strange results with huggingface transformer[marianmt] translation of larger text