I am using Keras with TensorFlow backend to train CNN models. What is the between <code>model.fit()</code> and <code>model.evaluate()</code>? Which one should I ideally use? (I am using <code>model.fit()</code> as of now). I know the utility of <code>model.fit()</code> and <code>model.predict()</code>. But I am unable to understand the utility of <code>model.evaluate()</code>. Keras documentation just says: <blockquote> It is used to evaluate the model. </blockquote> I feel this is a very vague definition.

<code>fit()</code> is for training the model with the given inputs (and corresponding training labels). <code>evaluate()</code> is for evaluating the already trained model using the validation (or test) data and the corresponding labels. Returns the loss value and metrics values for the model. <code>predict()</code> is for the actual prediction. It generates output predictions for the input samples. Let us consider a simple regression example: <pre class="prettyprint"><code># input and output x = np.random.uniform(0.0, 1.0, (200)) y = 0.3 + 0.6*x + np.random.normal(0.0, 0.05, len(y)) </code></pre> <img src="https://i.stack.imgur.com/HSB8Y.png" alt="enter image description here"> Now lets apply a regression model in keras: <pre class="prettyprint"><code># A simple regression model model = Sequential() model.add(Dense(1, input_shape=(1,))) model.compile(loss='mse', optimizer='rmsprop') # The fit() method - trains the model model.fit(x, y, nb_epoch=1000, batch_size=100) Epoch 1000/1000 200/200 [==============================] - 0s - loss: 0.0023 # The evaluate() method - gets the loss statistics model.evaluate(x, y, batch_size=200) # returns: loss: 0.0022612824104726315 # The predict() method - predict the outputs for the given inputs model.predict(np.expand_dims(x[:3],1)) # returns: [ 0.65680361],[ 0.70067143],[ 0.70482892] </code></pre>

What is the difference between model.fit() an model.evaluate() in Keras?

2 Answers

fit() is for training the model with the given inputs (and corresponding training labels).

evaluate() is for evaluating the already trained model using the validation (or test) data and the corresponding labels. Returns the loss value and metrics values for the model.

predict() is for the actual prediction. It generates output predictions for the input samples.

Let us consider a simple regression example:

# input and output x = np.random.uniform(0.0, 1.0, (200)) y = 0.3 + 0.6*x + np.random.normal(0.0, 0.05, len(y))

enter image description here

Now lets apply a regression model in keras:

# A simple regression model model = Sequential() model.add(Dense(1, input_shape=(1,))) model.compile(loss='mse', optimizer='rmsprop')  # The fit() method - trains the model model.fit(x, y, nb_epoch=1000, batch_size=100)  Epoch 1000/1000 200/200 [==============================] - 0s - loss: 0.0023  # The evaluate() method - gets the loss statistics model.evaluate(x, y, batch_size=200)      # returns: loss: 0.0022612824104726315  # The predict() method - predict the outputs for the given inputs model.predict(np.expand_dims(x[:3],1))  # returns: [ 0.65680361],[ 0.70067143],[ 0.70482892]

186

answered Oct 10 '22 14:10

vijay m

In Deep learning you first want to train your model. You take your data and split it into two sets: the training set, and the test set. It seems pretty common that 80% of your data goes into your training set and 20% goes into your test set.

Your training set gets passed into your call to fit() and your test set gets passed into your call to evaluate(). During the fit operation a number of rows of your training data are fed into your neural net (based on your batch size). After every batch is sent the fit algorithm does back propagation to adjust the weights in your neural net.

After this is done your neural net is trained. The problem is sometimes your neural net gets overfit which is a condition where it performs well for the training set but poorly for other data. To guard against this situation you run the evaluate() function to send new data (your test set) through your neural net to see how it performs with data it has never seen. There is no training occurring, this is purely a test. If all goes well then the score from training is similar to the score from testing.

answered Oct 10 '22 13:10

rancidfishbreath

Related questions
                            
                                Working with multiple graphs in TensorFlow
                            
                                Multilabel Text Classification using TensorFlow
                            
                                What does tf.gather_nd intuitively do?
                            
                                How do Monitored Training Sessions work?
                            
                                Difference between installation libraries of Tensorflow GPU vs CPU
                            
                                Getting the current learning rate from a tf.train.AdamOptimizer
                            
                                tensorflow Mac OS gpu support
                            
                                Tensorflow: How to convert .meta, .data and .index model files into one graph.pb file
                            
                                What is difference frozen_inference_graph.pb and saved_model.pb?
                            
                                TensorFlow - Read all examples from a TFRecords at once?
                            
                                This model has not yet been built error on model.summary()
                            
                                How do I specify nvidia runtime from docker-compose.yml?
                            
                                ImportError: Failed to import any qt binding, Python - Tensorflow
                            
                                Update TensorFlow
                            
                                Machine Learning : Tensorflow v/s Tensorflow.js v/s Brain.js [closed]
                            
                                How can I make tensorflow run on a GPU with capability 2.x?
                            
                                Visualizing output of convolutional layer in tensorflow
                            
                                How to understand loss acc val_loss val_acc in Keras model fitting
                            
                                What is the meaning of the "None" in model.summary of KERAS?
                            
                                How to use tf.while_loop() in tensorflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between model.fit() an model.evaluate() in Keras?

Tags:

tensorflow

model

keras

evaluate

Abhijit Balaji

People also ask

2 Answers

vijay m

rancidfishbreath

Recent Activity

Donate For Us