Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in huggingface-transformers

Can we calculate feature importance in Huggingface Bert?

How to properly save the finetuned transformer model in safetensors without losing frozen parameters?

Multiple training with huggingface transformers will give exactly the same result except for the first time

ImportError: cannot import name 'LLaMATokenizer' from 'transformers'

Llama-2 7B-hf repeats context of question directly from input prompt, cuts off with newlines

How to load a WordLevel Tokenizer trained with tokenizers in transformers

With the HuggingFace transformer, how can I return multiple samples when generating text?

How to read a BERT attention weight matrix?

HuggingFace - model.generate() is extremely slow when I load from checkpoint

OSError when loading tokenizer for huggingface model

What's the meaning of "Using bos_token, but it is not set yet."

Implementation (and working) differences between AutoModelForCausalLMWithValueHead vs AutoModelForCausalLM?

ERROR: file:///content does not appear to be a Python project: neither 'setup.py' nor 'pyproject.toml' found

How to use a batch size bigger than zero in Bert Sequence Classification

RemoteEntryNotFoundError with downloading models from Hugging Face in Kaggle

How to remove input from from generated text in GPTNeo?