Keras: Why my val_acc suddenly drops at Epoch 42/50?

Tags:

Using around 27.000 image samples for a CNN, having a very good performance, but all of a sudden, at epoch 42 the validation accuracy drops dramatically (from val_acc: 0.9982 to val_acc: 0.0678)!. Any idea? should I just stop training at the maximum val_acc? It's also weird that the validation accuracy is always higher than the training accuracy.

    Using TensorFlow backend.
...
27091/27067 [==============================] - 2645s - loss: 0.0120 - acc: 0.9967 - val_loss: 0.0063 - val_acc: 0.9982
Epoch 33/50
27091/27067 [==============================] - 2674s - loss: 0.0114 - acc: 0.9971 - val_loss: 0.0145 - val_acc: 0.9975
Epoch 34/50
27091/27067 [==============================] - 2654s - loss: 0.0200 - acc: 0.9962 - val_loss: 0.0063 - val_acc: 0.9979
Epoch 35/50
27091/27067 [==============================] - 2649s - loss: 0.0137 - acc: 0.9964 - val_loss: 0.0069 - val_acc: 0.9985
Epoch 36/50
27091/27067 [==============================] - 2663s - loss: 0.0161 - acc: 0.9962 - val_loss: 0.0117 - val_acc: 0.9978
Epoch 37/50
27091/27067 [==============================] - 2680s - loss: 0.0155 - acc: 0.9959 - val_loss: 0.0039 - val_acc: 0.9993
Epoch 38/50
27091/27067 [==============================] - 2660s - loss: 0.0145 - acc: 0.9965 - val_loss: 0.0117 - val_acc: 0.9973
Epoch 39/50
27091/27067 [==============================] - 2647s - loss: 0.0111 - acc: 0.9970 - val_loss: 0.0127 - val_acc: 0.9982
Epoch 40/50
27091/27067 [==============================] - 2644s - loss: 0.0112 - acc: 0.9970 - val_loss: 0.0092 - val_acc: 0.9984
Epoch 41/50
27091/27067 [==============================] - 2658s - loss: 0.0131 - acc: 0.9967 - val_loss: 0.0057 - val_acc: 0.9982
Epoch 42/50
27091/27067 [==============================] - 2662s - loss: 0.0114 - acc: 0.7715 - val_loss: 1.1921e-07 - val_acc: 0.0678
Epoch 43/50
27091/27067 [==============================] - 2661s - loss: 1.1921e-07 - acc: 0.0714 - val_loss: 1.1921e-07 - val_acc: 0.0653
Epoch 44/50
27091/27067 [==============================] - 2668s - loss: 1.1921e-07 - acc: 0.0723 - val_loss: 1.1921e-07 - val_acc: 0.0664
Epoch 45/50
27091/27067 [==============================] - 2669s - loss: 1.1921e-07 - acc: 0.0731 - val_loss: 1.1921e-07 - val_acc: 0.0683

871

asked Apr 13 '17 09:04

Dídac Royo

1 Answers

Thanks Marcin Możejkofor pointing me to the right direction.

This can happen at very high learning rates loss can start increasing after some epochs as described here It worked reducing the learning rate as described in the keras callbacks documentation.

Example:

 reduce_lr = ReduceLROnPlateau(monitor='val_loss', factor=0.2,
                  patience=5, min_lr=0.001)
    model.fit(X_train, Y_train, callbacks=[reduce_lr])

154

answered Oct 19 '22 03:10

Dídac Royo

Related questions
                            
                                Why does loading tensorflow on Mac lead to "Process finished with exit code 132 (interrupted by signal 4: SIGILL)"?
                            
                                TensorFlow cholesky decomposition
                            
                                TensorFlow initializing Tensor of ones
                            
                                skflow regression predict multiple values
                            
                                How can I execute a TensorFlow graph from a protobuf in C++?
                            
                                Tensorflow ArgumentError Running CIFAR-10 example
                            
                                TensorFlow Resize image tensor to dynamic shape
                            
                                Elegant Way to Select one Element per Row in Tensorflow
                            
                                How do I get TensorFlow's 'import_graph_def' to return Tensors
                            
                                How can I use intersphinx with Tensorflow and numpydoc?
                            
                                How can I implement max norm constraints in an MLP in tensorflow?
                            
                                Working with variable-length text in Tensorflow
                            
                                How to perform max pooling on a 1-dimensional ConvNet (conv1d) in TensowFlow?
                            
                                Binary mask in Tensorflow
                            
                                Ensuring positive definite covariance matrix
                            
                                Machine Learning - Information extraction from a document [closed]
                            
                                tensorflow installation issues:ImportError: No module named tensorflow
                            
                                loss function design to incorporate different weight for false positive and false negative
                            
                                TensorFlow - Text recognition in image [closed]
                            
                                Does LSTM in Keras support dynamic sentence length or not?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras: Why my val_acc suddenly drops at Epoch 42/50?

Tags:

tensorflow

keras

Dídac Royo

People also ask

1 Answers

Dídac Royo

Recent Activity

Donate For Us