Saving and reload huggingface fine-tuned transformer

Tags:

I am trying to reload a fine-tuned DistilBertForTokenClassification model. I am using transformers 3.4.0 and pytorch version 1.6.0+cu101. After using the Trainer to train the downloaded model, I save the model with trainer.save_model() and in my trouble shooting I save in a different directory via model.save_pretrained(). I am using Google Colab and saving the model to my Google drive. After testing the model I also evaluated the model on my test getting great results, however, when I return to the notebook (or Factory restart the colab notebook) and try to reload the model, the predictions are terrible. Upon checking the directories, the config.json file is there as is the pytorch_mode.bin. Below is the full code.

from transformers import DistilBertForTokenClassification

# load the pretrained model from huggingface
#model = DistilBertForTokenClassification.from_pretrained('distilbert-base-cased', num_labels=len(uniq_labels))
model = DistilBertForTokenClassification.from_pretrained('distilbert-base-uncased', num_labels=len(uniq_labels)) 

model.to('cuda');

from transformers import Trainer, TrainingArguments

training_args = TrainingArguments(
    output_dir = model_dir +  'mitmovie_pt_distilbert_uncased/results',          # output directory
    #overwrite_output_dir = True,
    evaluation_strategy='epoch',
    num_train_epochs=3,              # total number of training epochs
    per_device_train_batch_size=16,  # batch size per device during training
    per_device_eval_batch_size=64,   # batch size for evaluation
    warmup_steps=500,                # number of warmup steps for learning rate scheduler
    weight_decay=0.01,               # strength of weight decay
    logging_dir = model_dir +  'mitmovie_pt_distilbert_uncased/logs',            # directory for storing logs
    logging_steps=10,
    load_best_model_at_end = True
)

trainer = Trainer(
    model = model,                         # the instantiated 🤗 Transformers model to be trained
    args = training_args,                  # training arguments, defined above
    train_dataset = train_dataset,         # training dataset
    eval_dataset = test_dataset             # evaluation dataset
)

trainer.train()

trainer.evaluate()

model_dir = '/content/drive/My Drive/Colab Notebooks/models/'
trainer.save_model(model_dir + 'mitmovie_pt_distilbert_uncased/model')

# alternative saving method and folder
model.save_pretrained(model_dir + 'distilbert_testing')

Coming back to the notebook after restarting...

from transformers import DistilBertForTokenClassification, DistilBertConfig, AutoModelForTokenClassification

# retreive the saved model 
model = DistilBertForTokenClassification.from_pretrained(model_dir + 'mitmovie_pt_distilbert_uncased/model', 
                                                        local_files_only=True)

model.to('cuda')

Model predictions are terrible now from either directory, however, the model does work and outputs the number of classes I would expect, it appears that the actual trained weights have not been saved or are somehow not getting loaded.

947

asked Nov 03 '20 13:11

Nate

Video Answer

1 Answers

Do you tried loading the by the trainer saved model in the folder:

mitmovie_pt_distilbert_uncased/results

The Huggingface trainer saves the model directly to the defined output_dir.

answered Oct 08 '22 14:10

André Soblechero

Related questions
                            
                                How to use the latest sqlite3 version in python
                            
                                Proxy Pooling System for Scrapy to temporarily stop using slow/timing out proxies
                            
                                How to use py_func with a function that returns dict
                            
                                What does "Broker transport failure" mean in kafka?
                            
                                Weird behaviour with groupby on ordered categorical columns
                            
                                Simulation of t copula in Python
                            
                                Showing cropped image in bokeh
                            
                                Google Cloud ML-engine scikit-learn prediction probability 'predict_proba()'
                            
                                Errors packaging app for android using ubuntu and buildozer
                            
                                How can I construct a Pandas DataFrame from individual 1D Numpy arrays without copying
                            
                                Change code while debugging python program in Visual Studio Code
                            
                                Is there an equivalent of kable (R) on python?
                            
                                How to connect a Jupyter Notebook to a Spyder kernel?
                            
                                Extracting the license plate parallelogram from the surrounding bounding box?
                            
                                Most scalable way for using generators with tf.data ? tf.data guide says `from_generator` has limited scalability
                            
                                How to properly handle multiple binary files in python?
                            
                                How to find the minimum number of moves to move an item into a position in a stack?
                            
                                How to find which DLL failed in "ImportError: DLL load failed while importing" in python?
                            
                                VSCode integrated source control and pre-commit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Saving and reload huggingface fine-tuned transformer

Tags:

python

pytorch

huggingface-transformers

Nate

People also ask

Video Answer

1 Answers

André Soblechero

Recent Activity

Donate For Us