Why neural network tends to output 'mean value'?

Tags:

I am using keras to build a simple neural network for a regression task. But the output is always tends to the 'mean value' of ground truth y data. See the first figure, blue is ground truth, red is predicted value (very close to the constant mean of ground truth).

Also the model stops learning very early even though I set a learning epoch=100.

Anyone have ideas under what kinds of conditions the neural network will stop learning early and why the regression output tends to 'the mean' of ground truth?

Thanks! blue-ground truth; red-predicted value

Learning rate

646

asked Oct 05 '16 00:10

mira67

2 Answers

Possibly because the data are unpredictable....? Do you know for certain that the data set has N order predictability of some kind?

Just eyeballing your data set, it lacks periodicity, lacks homoscedasticity, it lacks any slope or skew or trend or pattern... I can't really tell if there is anything wrong with your 'net. In the absence of any pattern, the mean is always the best prediction... and it is entirely possible (although not certain) that the neural net is doing its job.

I suggest you find an easier data set, and see if you can tackle that first.

100

answered Sep 28 '22 07:09

John Wu

The model is not learning from the data. Think of a basic linear regression - the 'null' prediction, the prediction if you didn't have any predictors at all, is just the expected value; i.e. the mean. It could be caused by many different issues, but initialization comes to mind - bad initialization leads to no learning. This blog post has good practical advice that may help.

answered Sep 28 '22 09:09

Jeremiah Johnson

Related questions
                            
                                Ordinary Least Squares Regression in Vowpal Wabbit
                            
                                Error in `contrasts<-`(`*tmp*`, value = contr.funs[1 + isOF[nn]]) : contrasts can be applied only to factors with 2 or more levels
                            
                                sklearn: Regression models on sparse data?
                            
                                Ridge Regression Grid Search with Pipeline
                            
                                Comparing two linear models with anova() in R [closed]
                            
                                Multi-level regression model on multiply imputed data set in R (Amelia, zelig, lme4)
                            
                                How does R handle ordinal predictors in lm()?
                            
                                Logistic regression on One-hot encoding
                            
                                plot linear regressions lines without interaction in ggplot2
                            
                                Elastic net regression or lasso regression with weighted samples (sklearn)
                            
                                Using the glmulti package in R for exhaustive search multiple regression for akaike weights
                            
                                Plot the results of a multivariate logistic regression model in R
                            
                                Lasso r code - what is wrong with it?
                            
                                How obtain the true residual deviance and degrees of freedom in R of a glm model when a set of parameters gets pasted() as a vector
                            
                                Output each factor level as dummy variable in stargazer summary statistics table
                            
                                ggplot2: Plotting regression lines with different intercepts but with same slope
                            
                                Python Multiple Linear Regression using OLS code with specific data?
                            
                                How to set the Coefficient Value in Regression; R
                            
                                Need SQL Server Query to solve 3rd Order Polynomial Regression
                            
                                How to do 2SLS IV regression using statsmodels python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why neural network tends to output 'mean value'?

Tags:

keras

regression

mira67

People also ask

2 Answers

John Wu

Jeremiah Johnson

Recent Activity

Donate For Us