I am trying to predict a continuous value (using a Neural Network for the first time). I have normalized the input data. I can't figure out why I am getting a <code>loss: nan</code> output starting with the first epoch. I read and tried many suggestions from previous answers to the same question but that none of them helped me. My training data shape is: <code>(201917, 64)</code>. Here's my code: <pre class="prettyprint"><code>model = Sequential() model.add(Dense(100, input_dim=X.shape[1], activation='relu')) model.add(Dense(100, activation='relu')) model.add(Dense(100, activation='relu')) # Output layer model.add(Dense(1, activation='linear')) # Construct the neural network inside of TensorFlow model.compile(loss='mean_squared_error', optimizer='Adam') # train the model model.fit(X_train, y_train, epochs=10, batch_size=32, shuffle=True, verbose=2) </code></pre>

These are the steps that you can take to find the cause of your problem: <ol> <li> Make sure that your dataset is what it should be: <ul> <li>Look for any nan/inf in your dataset and fix it.</li> <li>Incorrect encoding (convert it to UTF-8).</li> <li>Invalid values in your column or rows.</li> </ul> </li> <li> Normalize your model using Dropout, BatchNormalization, L1/L2 regularization, change your batch_size, or scale your data to other ranges (e.g. [-1, 1]). </li> <li> Reduce the size of your network. </li> <li> Change other hyper-parameters (e.g. optimizer or activation function). </li> </ol> You can check this and this link to get extra help.

Loss: NaN in Keras while performing regression

Tags:

I am trying to predict a continuous value (using a Neural Network for the first time). I have normalized the input data. I can't figure out why I am getting a loss: nan output starting with the first epoch.

I read and tried many suggestions from previous answers to the same question but that none of them helped me. My training data shape is: (201917, 64). Here's my code:

model = Sequential()
model.add(Dense(100, input_dim=X.shape[1], activation='relu'))
model.add(Dense(100, activation='relu'))
model.add(Dense(100, activation='relu'))

# Output layer
model.add(Dense(1, activation='linear'))

# Construct the neural network inside of TensorFlow
model.compile(loss='mean_squared_error', optimizer='Adam')

# train the model
model.fit(X_train, y_train, epochs=10, batch_size=32,
shuffle=True, verbose=2)

632

asked Dec 05 '18 21:12

Erez Ben-Moshe

1 Answers

These are the steps that you can take to find the cause of your problem:

Make sure that your dataset is what it should be:
- Look for any nan/inf in your dataset and fix it.
- Incorrect encoding (convert it to UTF-8).
- Invalid values in your column or rows.
Normalize your model using Dropout, BatchNormalization, L1/L2 regularization, change your batch_size, or scale your data to other ranges (e.g. [-1, 1]).
Reduce the size of your network.
Change other hyper-parameters (e.g. optimizer or activation function).

You can check this and this link to get extra help.

185

answered Nov 15 '22 07:11

Reza Behzadpour

Related questions
                            
                                Typescript Type Guards and fat arrow function
                            
                                How to cache an object on Varnish, but tell the client not the cache it
                            
                                How to change which tkinter Button is highlighted via arrow keys?
                            
                                VS 2017 Store 'Create App Packages' Disabled
                            
                                In Tensorflow, what is the difference between Session.partial_run and Session.run?
                            
                                React-Datepicker handleChange with multiple name attributes
                            
                                How to lengthen specific tick marks in facet gridded ggplot?
                            
                                R - use group_by() and mutate() in dplyr to apply function that returns a vector the length of groups
                            
                                How can I test Customer Chat Plugin (beta) in localhost?
                            
                                Azure AD GraphServiceClient can't set AdditionalData against User
                            
                                What "command verbs" are available for the os.startfile 'operation' argument and what do they do?
                            
                                Java Webstart "javaws -open" flag doesn't work with multiple arguments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With