Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in huggingface

Which HuggingFace summarization models support more than 1024 tokens? Which model is more suitable for programming related articles?

How to know if HuggingFace's pipeline text input exceeds 512 tokens

NameError: name 'init_empty_weights' is not defined while using hugging face models

How to determine the value of early_stopping_patience in HuggingFace's Seq2SeqTrainer EarlyStoppingCallback?

Transformers tokenizer attention mask for pytorch

Does Huggingface's "resume_from_checkpoint" work?

How does one set the pad token correctly (not to eos) during fine-tuning to avoid model not predicting EOS?

Issues when using HuggingFace `accelerate` with `fp16`

How to use Huggingface Trainer with multiple GPUs?

How to fix nsfw error for stable diffusion?

Tokenizer.from_file() HUGGINFACE : Exception: data did not match any variant of untagged enum ModelWrapper

How to Load a 4-bit Quantized VLM Model from Hugging Face with Transformers?

Loading checkpoint shards takes too long

Loading Google Flan models through Langchain HuggingFacePipeline throws error of unrecognized configuration class for AutoModelForCausalLM

huggingface py-langchain

How do I slim down SBERT's sentencer-transformer library?