I am using keras with a custom loss function like below: <pre class="prettyprint"><code>def custom_fn(y_true, y_pred): # changing y_true, y_pred values systematically return mean_absolute_percentage_error(y_true, y_pred) </code></pre> Then I am calling <code>model.compile(loss=custom_fn)</code> and <code>model.fit(X, y,..validation_data=(X_val, y_val)..)</code> Keras is then saving <code>loss</code> and <code>val_loss</code> in model history. As a sanity check, when the model finishes training, I am using <code>model.predict(X_val)</code> so I can calculate validation loss manually with my <code>custom_fn</code> using the trained model. I am saving the model with the best epoch using this callback: <pre class="prettyprint"><code>callbacks.append(ModelCheckpoint(path, save_best_only=True, monitor='val_loss', mode='min')) </code></pre> so after calculating this, the validation loss should match keras' <code>val_loss</code> value of the best epoch. But this is not happening. As another attempt to figure this issue out, I am also doing this: <pre class="prettyprint"><code> model.compile(loss=custom_fn, metrics=[custom_fn]) </code></pre> And to my surprise, <code>val_loss</code> and <code>val_custom_fn</code> do not match (neither <code>loss</code> or <code>loss_custom_fn</code> for that matter). This is really strange, my <code>custom_fn</code> is essentially keras' built in <code>mape</code> with the <code>y_true</code> and <code>y_pred</code> slightly manipulated. what is going on here? PS: the layers I am using are <code>LSTM</code> layers and a final <code>Dense</code> layer. But I think this information is not relevant to the problem. I am also using regularisation as hyperparameter but not dropout. <h3>Update</h3> Even removing <code>custom_fn</code> and using keras' built in <code>mape</code> as a loss function and metric like so: <pre class="prettyprint"><code>model.compile(loss='mape', metrics=['mape']) </code></pre> and for simplicity, removing <code>ModelCheckpoint</code> callback is having the same effect; <code>val_loss</code> and <code>val_mape</code> for each epoch are not equivalent. This is extremely strange to me. I am either missing something or there is a bug in Keras code..the former might be more realistic.

This blog post suggests that keras adds any regularisation used in the training when calculating the validation loss. And obviously, when calculating the metric of choice no regularisation is applied. This is why it occurs with any loss function of choice as stated in the question. This is something I could not find any documentation on from Keras. However, it seems to hold up since when I remove all regularisation hyperparameters, the <code>val_loss</code> and <code>val_custom_fn</code> match exactly in each epoch. An easy workaround is to either use the <code>custom_fn</code> as a metric and save the best model based on the metric (<code>val_custom_fn</code>) than on the <code>val_loss</code>. Or else Loop through each epoch manually and calculate the correct <code>val_loss</code> manually after training each epoch. The latter seems to make more sense since there is no reason to include <code>custom_fn</code> both as a metric and as a loss function. If anyone can find any evidence of this in the Keras documentation that would be helpful.

Keras loss and metrics values do not match with same function in each

Tags:

python

tensorflow

deep-learning

keras

I am using keras with a custom loss function like below:

def custom_fn(y_true, y_pred):
   # changing y_true, y_pred values systematically
   return mean_absolute_percentage_error(y_true, y_pred)

Then I am calling model.compile(loss=custom_fn) and model.fit(X, y,..validation_data=(X_val, y_val)..)

Keras is then saving loss and val_loss in model history. As a sanity check, when the model finishes training, I am using model.predict(X_val) so I can calculate validation loss manually with my custom_fn using the trained model.

I am saving the model with the best epoch using this callback:

callbacks.append(ModelCheckpoint(path, save_best_only=True, monitor='val_loss', mode='min'))

so after calculating this, the validation loss should match keras' val_loss value of the best epoch. But this is not happening.

As another attempt to figure this issue out, I am also doing this:

    model.compile(loss=custom_fn, metrics=[custom_fn])

And to my surprise, val_loss and val_custom_fn do not match (neither loss or loss_custom_fn for that matter).

This is really strange, my custom_fn is essentially keras' built in mape with the y_true and y_pred slightly manipulated. what is going on here?

PS: the layers I am using are LSTM layers and a final Dense layer. But I think this information is not relevant to the problem. I am also using regularisation as hyperparameter but not dropout.

Update

Even removing custom_fn and using keras' built in mape as a loss function and metric like so:

model.compile(loss='mape', metrics=['mape'])

and for simplicity, removing ModelCheckpoint callback is having the same effect; val_loss and val_mape for each epoch are not equivalent. This is extremely strange to me. I am either missing something or there is a bug in Keras code..the former might be more realistic.

781

asked Aug 18 '20 08:08

bcsta

Video Answer

1 Answers

This blog post suggests that keras adds any regularisation used in the training when calculating the validation loss. And obviously, when calculating the metric of choice no regularisation is applied. This is why it occurs with any loss function of choice as stated in the question.

This is something I could not find any documentation on from Keras. However, it seems to hold up since when I remove all regularisation hyperparameters, the val_loss and val_custom_fn match exactly in each epoch.

An easy workaround is to either use the custom_fn as a metric and save the best model based on the metric (val_custom_fn) than on the val_loss. Or else Loop through each epoch manually and calculate the correct val_loss manually after training each epoch. The latter seems to make more sense since there is no reason to include custom_fn both as a metric and as a loss function.

If anyone can find any evidence of this in the Keras documentation that would be helpful.

181

answered Sep 21 '22 10:09

bcsta

Related questions
                            
                                NotImplementedError: Learning rate schedule must override get_config
                            
                                python asyncio - RuntimeError: await wasn't used with future
                            
                                Pytorch RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
                            
                                Ansible no longer works
                            
                                how to change string matrix to a integer matrix
                            
                                Pip install fails with “connection error" ssl problem
                            
                                Get current zoom and center from mapbox in dash
                            
                                Automatically refactor python lambdas to named functions
                            
                                How to fix "WARNING: Hidden import "pygame._view" not found!" when converting .py to .exe using PyInstaller?
                            
                                How can I copy DataFrames with datetimes from Stack Overflow into Python?
                            
                                Can't use Image.putalpha() on a png file from PIL lib. OSError: cannot write mode PA as PNG
                            
                                Write a readable test-case for a diff which includes "\n"
                            
                                Bot only takes one command
                            
                                Python 3.6 type hinting for a function accepting generic class type and instance type of the same generic type
                            
                                How do I make a circular tree with multiple root trees
                            
                                How to implement single sign-on django auth in azure ad?
                            
                                Shift "nan" to the beginning of an array in python [duplicate]
                            
                                To what extent does Google Colab support Python typing?
                            
                                Python Turtle Write Value in Containing Box
                            
                                What form of imports should I use in __main__.py and then how should I run the project?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras loss and metrics values do not match with same function in each

Tags:

python

tensorflow

deep-learning

keras

Update

bcsta

People also ask

Video Answer

1 Answers

bcsta

Recent Activity

Donate For Us