Custom RMSE not the same as taking the root of built-in Keras MSE for same prediction

Tags:

I have defined a custom RMSE function:

def rmse(y_pred, y_true):
    return K.sqrt(K.mean(K.square(y_pred - y_true)))

I was evaluating it against the mean squared error provided by Keras:

keras.losses.mean_squared_error(y_true, y_pred)

The values I get for MSE and RMSE metrics respectively for some (the same) prediction are:

mse: 115.7218 - rmse: 8.0966

Now, when I take the root of the MSE, I get 10.7574, which is obviously higher than the RMSE the custom RMSE function outputs. I haven't been able to figure out why this is so, nor have I found any related posts on this particular topic. Is there maybe a mistake in the RMSE function that I'm simply not seeing? Or is it somehow related to how Keras defines axis=-1 in the MSE function (purpose of which I haven't fully understood yet)?

Here is where I invoke the RMSE and MSE:

model.compile(loss="mae", optimizer="adam", metrics=["mse", rmse])

So I would expect the root of MSE to be the same as the RMSE.

I originally asked this question on Cross Validated but it was put on hold as off-topic.

967

asked Sep 27 '19 09:09

mka

1 Answers

Is there maybe a mistake in the RMSE loss function that I'm simply not seeing? Or is it somehow related to how Keras defines axis=-1 in the MSE loss function (purpose of which I haven't fully understood yet)?

When Keras does the loss calculation, the batch dimension is retained which is the reason for axis=-1. The returned value is a tensor. This is because the loss for each sample may have to be weighted before taking the mean depending on whether certain arguments are passed in the fit() method like sample_weight.

I get the same results with both the approaches.

Click to copy

from tensorflow import keras
import numpy as np
from keras import backend as K

def rmse(y_pred, y_true):
    return K.sqrt(K.mean(K.square(y_pred - y_true)))

l1 = keras.layers.Input(shape=(32))
l2 = keras.layers.Dense(10)(l1)
model = keras.Model(inputs=l1, outputs=l2)

train_examples = np.random.randn(5,32)
train_labels=np.random.randn(5,10)

MSE approach

Click to copy

model.compile(loss='mse', optimizer='adam')
model.evaluate(train_examples, train_labels)

RMSE approach

Click to copy

model.compile(loss=rmse, optimizer='adam')
model.evaluate(train_examples, train_labels)

Output

Click to copy

5/5 [==============================] - 0s 8ms/sample - loss: 1.9011
5/5 [==============================] - 0s 2ms/sample - loss: 1.3788

sqrt(1.9011) = 1.3788

112

answered Sep 22 '22 15:09

Manoj Mohan

Related questions
                            
                                Prediction step for time series using continuous hidden Markov models
                            
                                How to use Generic (higher-level) type variables in type hinting system?
                            
                                Selenium - Difference between text_to_be_present_in_element and text_to_be_present_in_element_value
                            
                                Obtain input_array and output_array items to convert model to tflite format
                            
                                Executing multiple lines of input in pycharm console from first line
                            
                                Jupyter notebook with Python 2 and Python3 Kernel
                            
                                Feature-wise scaling and shifting (FiLM layer) in Keras
                            
                                Django: using F() expressions on JSONField?
                            
                                Controlling stack-order of an altair area
                            
                                what's the difference between airflow's 'parallelism' and 'dag_concurrency'
                            
                                Why does this dict of 7 items only consume 368 bytes?
                            
                                Python ThreadPoolExecutor Suppress Exceptions
                            
                                Create a generic List from C# dll in python script
                            
                                In TensorFlow 2.0 with eager-execution, how to compute the gradients of a network output wrt a specific layer?
                            
                                Multiple output regression or classifier with one (or more) parameters with Python
                            
                                Rolling sum with strings
                            
                                Indexing numpy array with index array of lower dim yields array of higher dim than both
                            
                                Tensorflow suppresses logging messages bug
                            
                                UnboundLocalError: local variable 'arith_flex' referenced before assignment
                            
                                AWS Lambda Python3.7 Function - numpy: cannot import name 'WinDLL'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Custom RMSE not the same as taking the root of built-in Keras MSE for same prediction

Tags:

python

machine-learning

keras

mka

People also ask

1 Answers

Manoj Mohan

Recent Activity

Donate For Us