I am fine tuning a BERT model for a multiclass classification task. My problem is that I don't know how to add "early stopping" to those Trainer instances. Any ideas?

There are a couple of modifications you need to perform, prior to correctly using the <code>EarlyStoppingCallback()</code>. <pre class="prettyprint"><code>from transformers import EarlyStoppingCallback, IntervalStrategy ... ... # Defining the TrainingArguments() arguments args = TrainingArguments( f"training_with_callbacks", evaluation_strategy = IntervalStrategy.STEPS, # "steps" eval_steps = 50, # Evaluation and Save happens every 50 steps save_total_limit = 5, # Only last 5 models are saved. Older ones are deleted. learning_rate=2e-5, per_device_train_batch_size=batch_size, per_device_eval_batch_size=batch_size, num_train_epochs=5, weight_decay=0.01, push_to_hub=False, metric_for_best_model = 'f1', load_best_model_at_end=True) </code></pre> You need to: <ol> <li>Use <code>load_best_model_at_end = True</code> (<code>EarlyStoppingCallback()</code> requires this to be <code>True</code>).</li> <li> <code>evaluation_strategy</code> = <code>'steps'</code> or <code>IntervalStrategy.STEPS</code> instead of <code>'epoch'</code>.</li> <li> <code>eval_steps = 50</code> (evaluate the metrics after <code>N steps</code>).</li> <li> <code>metric_for_best_model = 'f1'</code>,</li> </ol> In your <code>Trainer()</code>: <pre class="prettyprint"><code>trainer = Trainer( model, args, ... compute_metrics=compute_metrics, callbacks = [EarlyStoppingCallback(early_stopping_patience=3)] ) </code></pre> Of course, when you use <code>compute_metrics()</code>, for example it can be a function like: <pre class="prettyprint"><code>def compute_metrics(p): pred, labels = p pred = np.argmax(pred, axis=1) accuracy = accuracy_score(y_true=labels, y_pred=pred) recall = recall_score(y_true=labels, y_pred=pred) precision = precision_score(y_true=labels, y_pred=pred) f1 = f1_score(y_true=labels, y_pred=pred) return {"accuracy": accuracy, "precision": precision, "recall": recall, "f1": f1} </code></pre> The return of the <code>compute_metrics()</code> should be a dictionary and you can access whatever metric you want/compute inside the function and return. Note: In newer <code>transformers</code> version, the usage of <code>Enum</code> <code>IntervalStrategy.steps</code> is recommended (see <code>TrainingArguments()</code>) instead of plain <code>steps</code> string, the latter being soon subject to deprecation.

Early stopping in Bert Trainer instances

Video Answer

1 Answers

There are a couple of modifications you need to perform, prior to correctly using the EarlyStoppingCallback().

Click to copy

from transformers import EarlyStoppingCallback, IntervalStrategy
...
...
# Defining the TrainingArguments() arguments
args = TrainingArguments(
   f"training_with_callbacks",
   evaluation_strategy = IntervalStrategy.STEPS, # "steps"
   eval_steps = 50, # Evaluation and Save happens every 50 steps
   save_total_limit = 5, # Only last 5 models are saved. Older ones are deleted.
   learning_rate=2e-5,
   per_device_train_batch_size=batch_size,
   per_device_eval_batch_size=batch_size,
   num_train_epochs=5,
   weight_decay=0.01,
   push_to_hub=False,
   metric_for_best_model = 'f1',
   load_best_model_at_end=True)

You need to:

Use load_best_model_at_end = True (EarlyStoppingCallback() requires this to be True).
evaluation_strategy = 'steps' or IntervalStrategy.STEPS instead of 'epoch'.
eval_steps = 50 (evaluate the metrics after N steps).
metric_for_best_model = 'f1',

In your Trainer():

Click to copy

trainer = Trainer(
    model,
    args,
    ...
    compute_metrics=compute_metrics,
    callbacks = [EarlyStoppingCallback(early_stopping_patience=3)]
)

Of course, when you use compute_metrics(), for example it can be a function like:

Click to copy

def compute_metrics(p):    
    pred, labels = p
    pred = np.argmax(pred, axis=1)
    accuracy = accuracy_score(y_true=labels, y_pred=pred)
    recall = recall_score(y_true=labels, y_pred=pred)
    precision = precision_score(y_true=labels, y_pred=pred)
    f1 = f1_score(y_true=labels, y_pred=pred)    
return {"accuracy": accuracy, "precision": precision, "recall": recall, "f1": f1}

The return of the compute_metrics() should be a dictionary and you can access whatever metric you want/compute inside the function and return.

Note: In newer transformers version, the usage of Enum IntervalStrategy.steps is recommended (see TrainingArguments()) instead of plain steps string, the latter being soon subject to deprecation.

137

answered Oct 26 '22 07:10

Timbus Calin

Related questions
                            
                                Django get min and max value from PostgreSQL specific ArrayField holding IntegerField(s)
                            
                                How to raise every element of a vector to the power of every element of another vector?
                            
                                Cannot install pyaudio in google colab
                            
                                How to order an array and count it in Python?
                            
                                Software based on Python 3.9 is not working on Windows 7
                            
                                filter class/subfolder with pytorch ImageFolder
                            
                                Use lazy % formatting in logging functions pylint error message
                            
                                Numpy matrix multiplication but instead of multiplying it XOR's elements
                            
                                Julia symbolic and numeric performance vs Python
                            
                                Apply a function to each cell of a pandas dataframe using information from a particular column
                            
                                How to populate rows of pandas dataframe column based with previous row based on a multiple conditions?
                            
                                Pythonic way to apply multiple class methods to list of objects
                            
                                How to scatter randomly points on a sphere
                            
                                Seaborn scatterplot can't get hue_order to work
                            
                                Converting a recursion problem code from Python to Common Lisp
                            
                                Special text to latin characters in python
                            
                                model.predict_classes is deprecated - What to use instead?
                            
                                Why my Python code is extracting the same data for all the elements in my list?
                            
                                Spliting a list into n uneven buckets with all combinations
                            
                                Python - Is there a shorthand for [eg]: print(f'type(var) = {type(var)}')

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Early stopping in Bert Trainer instances

Tags:

python

neural-network

deep-learning

bert-language-model

huggingface-transformers

soulwreckedyouth

People also ask

Video Answer

1 Answers

Timbus Calin

Recent Activity

Donate For Us