Overfitting after first epoch

Tags:

I am using convolutional neural networks (via Keras) as my model for facial expression recognition (55 subjects). My data set is quite hard and around 450k with 7 classes. I have balanced my training set per subject and per class label.

I implemented a very simple CNN architecture (with real-time data augmentation):

Click to copy

model = Sequential()
model.add(Convolution2D(32, 3, 3, border_mode=borderMode, init=initialization,  input_shape=(48, 48, 3)))
model.add(BatchNormalization())
model.add(PReLU())
model.add(MaxPooling2D(pool_size=(2, 2)))

model.add(Flatten())
model.add(Dense(256))
model.add(BatchNormalization())
model.add(PReLU())
model.add(Dropout(0.5))

model.add(Dense(nb_output))
model.add(Activation('softmax'))

After first epoch, my training loss decreases constantly while validation loss increases. Could overfitting happen that soon? Or is there a problem with my data being confusing? Should I also balance my testing set?

337

asked Oct 09 '16 14:10

Renz

1 Answers

It could be that the task is easy to solve and after one epoch the model has learned enough to solve it, and training for more epochs just increases overfitting.

But if you have balanced the train set and not the test set, what may be happening is that you are training for one task (expression recognition on evenly distributed data) and then you are testing on a slightly different task, because the test set is not balanced.

answered Oct 10 '22 23:10

Guillem Cucurull

Related questions
                            
                                Predicting radius of circle with Neural Network
                            
                                Are there any classification algorithms which target data with a one to many (1:n) relationship?
                            
                                Export Weka Models for Use in C or C++
                            
                                Processing malformed text data with machine learning or NLP
                            
                                Why isn't DropOut used in Unsupervised Learning?
                            
                                XgBoost on Android
                            
                                ANN regression, linear function approximation
                            
                                Autoencoder not learning identity function
                            
                                reverse word embeddings in keras - python
                            
                                Determine WHY Features Are Important in Decision Tree Models
                            
                                Changing activation function of a keras layer w/o replacing whole layer
                            
                                'tensorflow' has no attribute 'config'
                            
                                Validation loss for pytorch Faster-RCNN
                            
                                Does changing a token name in an image caption model affect performance?
                            
                                Logistic Regression giving incorrect results
                            
                                Logistic Regression Scikit-Learn Getting the coefficients of the classification
                            
                                Errors encountered in partial_fit in scikit learn
                            
                                Distinction between linear and non linear regression?
                            
                                What are hidden units in individual LSTM cells?
                            
                                Benefits of TDD in machine learning

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Overfitting after first epoch

Tags:

machine-learning

neural-network

deep-learning

computer-vision

conv-neural-network

Renz

People also ask

1 Answers

Guillem Cucurull

Recent Activity

Donate For Us