Should I use loss or accuracy as the early stopping metric?

Tags:

I am learning and experimenting with neural networks and would like to have the opinion from someone more experienced on the following issue:

When I train an Autoencoder in Keras ('mean_squared_error' loss function and SGD optimizer), the validation loss is gradually going down. and the validation accuracy is going up. So far so good.

However, after a while, the loss keeps decreasing but the accuracy suddenly falls back to a much lower low level.

Is it 'normal' or expected behavior that the accuracy goes up very fast and stay high to fall suddenly back?
Should I stop training at the maximum accuracy even if the validation loss is still decreasing? In other words, use val_acc or val_loss as metric to monitor for early stopping?

See images:

Loss: (green = val, blue = train] enter image description here

Accuracy: (green = val, blue = train] enter image description here

UPDATE: The comments below pointed me in the right direction and I think I understand it better now. It would be nice if someone could confirm that following is correct:

the accuracy metric measures the % of y_pred==Y_true and thus only make sense for classification.
my data is a combination of real and binary features. The reason why the accuracy graph goes up very steep and then falls back, while the loss continues to decrease is because around epoch 5000, the network probably predicted +/- 50% of the binary features correctly. When training continues, around epoch 12000, the prediction of real and binary features together improved, hence the decreasing loss, but the prediction of the binary features alone, are a little less correct. Therefor the accuracy falls down, while the loss decreases.

766

asked May 10 '16 14:05

Mark

1 Answers

If the prediction is real-time or the data is continuous rather than discrete, then use MSE(Mean Square Error) because the values are real time.

But in the case of Discrete values (i.e) classification or clustering use accuracy because the values given are either 0 or 1 only. So, here the concept of MSE will not applicable, rather use accuracy= no of error values/total values * 100.

107

answered Sep 19 '22 07:09

Naren Babu R

Related questions
                            
                                Looking for a Good Reference on Neural Networks [closed]
                            
                                How to access the network weights while using PyTorch 'nn.Sequential'?
                            
                                The concept of straight through estimator (STE) [closed]
                            
                                What is the difference between a layer with a linear activation and a layer without activation?
                            
                                Geometric representation of Perceptrons (Artificial neural networks)
                            
                                What are the uses of recurrent neural networks when using them with Reinforcement Learning?
                            
                                Machine Learning: Unsupervised Backpropagation
                            
                                Impact of using data shuffling in Pytorch dataloader
                            
                                Theano HiddenLayer Activation Function
                            
                                Derivative of a softmax function explanation [closed]
                            
                                Training feedforward neural network for OCR [closed]
                            
                                Overflow Error in Neural Networks implementation
                            
                                ValueError in Keras: How could I get the model fitted?
                            
                                Need good way to choose and adjust a "learning rate"
                            
                                pybrain: how to print a network (nodes and weights)
                            
                                How to add Dropout in Keras functional model?
                            
                                ValueError: Negative dimension size caused by subtracting 2 from 1 for 'max_pooling2d_6/MaxPool' (op: 'MaxPool') with input shapes: [?,1,1,64]
                            
                                Keras IndexError: indices are out-of-bounds
                            
                                Neural Network learning rate and batch weight update
                            
                                TensorFlow: Performing this loss computation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Should I use loss or accuracy as the early stopping metric?

Tags:

neural-network

deep-learning

keras

autoencoder

Mark

People also ask

1 Answers

Naren Babu R

Recent Activity

Donate For Us