Meaning of batch_size in model.evaluate()

Tags:

I am building a plain vanilla FNN and want to evaluate my model after training. I was wondering what impact the batch_size has when evaluating the model on a test set. Of course it is relevant for training as it determines the number of samples to be fed to the network before computing the next gradient. It is also clear that it can be needed when predicting values for a (statefull) RNN. But it is not clear to me why it is needed when evaluating the model especially a FNN. Furthermore, I get slightly different values when I evaluate the model on the same test set but with different batch sizes. Consider the following toy example:

import numpy as np
from keras.models import Sequential
from keras.layers import Dense, Activation
from keras.optimizers import SGD

# function to be learned
def f(x):
    return x[0] + x[1] + x[2]

# sample training and test points on a rectangular grid
x_train = np.random.uniform(low = -10, high = 10, size = (50,3))
y_train = np.apply_along_axis(f, 1, x_train).reshape(-1,1)

x_test = np.random.uniform(low = -10, high = 10, size = (50,3))
y_test = np.apply_along_axis(f, 1, x_test).reshape(-1,1)

model = Sequential()
model.add(Dense(20, input_dim = 3, activation = 'tanh'))
model.add(Dense(1))

sgd = SGD(lr=0.01, decay=1e-6, momentum=0.9, nesterov=True)
model.compile(loss='mse',
      optimizer=sgd)
model.fit(x_train, y_train, batch_size = 10, epochs = 30, verbose = 0)

model.evaluate(x_test, y_test, batch_size = 10)
model.evaluate(x_test, y_test, batch_size = 20)
model.evaluate(x_test, y_test, batch_size = 30)
model.evaluate(x_test, y_test, batch_size = 40)
model.evaluate(x_test, y_test, batch_size = 50)

The values are very similar but nevertheless different. Where does this come from? Shouldn't the following be always true?

from sklear.metrics import mean_squared_error as mse
0 == model.evaluate(x_test, y_test) - mse(model.predict(x_test), y_test)

422

asked Jun 06 '18 14:06

lbf_1994

1 Answers

No, they don't have to be the same. If you combine floating point math with parallelism, you don't get reproducible results as then (a + b) + c is not the same as a + (b + c).

The evaluate function of Model has a batch size just in order to speed-up evaluation, as the network can process multiple samples at a time, and with a GPU this makes evaluation much faster. I think the only way to reduce the effect of this would be to set batch_size to one.

answered Oct 20 '22 20:10

Dr. Snoopy

Related questions
                            
                                How to check the status of docker-compose up -d command
                            
                                Getting deep learning tracker (GOTURN) to run opencv python
                            
                                Extract named group regex pattern from a compiled regex in Python
                            
                                Linkedin API - Bad Redirect, invalid redirect URI
                            
                                How do I debug Flask App in VS Code
                            
                                How can axios get the status code in .catch()?
                            
                                Trouble using lambda function within my scraper
                            
                                Indexing and percolating documents with elasticsearch-dsl-py
                            
                                Python: Time and space complexity of creating size n^2 tuples
                            
                                Django asyncio call in views doesn't work
                            
                                Process communication of Python's Multiprocessing
                            
                                How to apply a custom function to specific columns in a matrix in PyTorch
                            
                                Celery not work: Cannot connect to amqp://guest:**@127.0.0.1:5672//
                            
                                Access to Flask Global Variables in Blueprint Apps
                            
                                python: pandas.DataFrame，how to avoid keyerror?
                            
                                Using negative numbers in pandas.DataFrame.query() expression
                            
                                Fastest way to check if a list is present in a list of lists
                            
                                Tensorflow: stack all row pairs from a tensor
                            
                                Python ggplot and ggplotly
                            
                                How to use pytest fixtures with Unittest methods

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Meaning of batch_size in model.evaluate()

Tags:

python

deep-learning

keras

lbf_1994

People also ask

1 Answers

Dr. Snoopy

Recent Activity

Donate For Us