Prediction is depending on the batch size in Keras

Tags:

I am trying to use keras for binary classification of an image.

My CNN model is well trained on the training data (giving ~90% training accuracy and ~93% validation accuracy). But during training if I set the batch size=15000 I get the Figure I output and if I set the batch size=50000 I get Figure II as the output. Can someone please tell what is wrong? The prediction should not depend on batch size right?

Code I am using for prediction :

y=model.predict_classes(patches, batch_size=50000,verbose=1) y=y.reshape((256,256))

My model:-

model = Sequential()

model.add(Convolution2D(32, 3, 3, border_mode='same',
                        input_shape=(img_channels, img_rows, img_cols)))
model.add(Activation('relu'))
model.add(Convolution2D(32, 3, 3))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Dropout(0.25))

model.add(Convolution2D(64, 3, 3, border_mode='same'))
model.add(Activation('relu'))
model.add(Convolution2D(64, 3, 3))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Dropout(0.25))

model.add(Flatten())
model.add(Dense(512))
model.add(Activation('relu'))
model.add(Dropout(0.5))
model.add(Dense(nb_classes))
model.add(Activation('softmax'))

# let's train the model using SGD + momentum (how original).
sgd = SGD(lr=0.01, decay=1e-6, momentum=0.9, nesterov=True)
model.compile(loss='categorical_crossentropy',
              optimizer=sgd,
              metrics=['accuracy'])

932

asked May 25 '16 06:05

Avijit Dasgupta

1 Answers

Keras is standarizing input automaticaly in the predict function. The statistics needed for standarization are computed on a batch - that's why your outputs might depend on a batch size. You may solve this by :

If Keras > 1.0 you could simply define your model in functional API and simpy apply a trained function to self standarized data.
If you have your model trained - you could recover it as Theano function and also apply it to self standarized data.
If your data is not very big you could also simply set your batch size to the number of examples in your dataset.

UPDATE: here is a code for 2nd solution :

import theano

input = model.layers[0].input # Gets input Theano tensor
output = model.layers[-1].output # Gets output Theano tensor
model_theano = theano.function(input, output) # Compiling theano function 

# Now model_theano is a function which behaves exactly like your classifier 

predicted_score = model_theano(example) # returns predicted_score for an example argument

Now if you want to use this new theano_model you should standarize main dataset on your own (e.g. by subtracting mean and dividing by standard deviation every pixel in your image) and apply theano_model to obtain scores for a whole dataset (you could do this in a loop iterating over examples or using numpy.apply_along_axis or numpy.apply_over_axes functions).

UPDATE 2: in order to make this solution working change

model.add(Dense(nb_classes))
model.add(Activation('softmax'))

to:

model.add(Dense(nb_classes, activation = "softmax"))

answered Sep 18 '22 13:09

Marcin Możejko

Related questions
                            
                                Python cannot import module from subdirectory even with a file named __init.py__ in the directory
                            
                                Python - delete blank lines of text at the end of the file
                            
                                How to find the breakpoint numbers in pdb (ipdb)?
                            
                                Use Python to send keystrokes to games in Windows?
                            
                                SQLAlchemy session error: InvalidRequestError
                            
                                Match rows in one Pandas dataframe to another based on three columns
                            
                                How to document Python packages using Sphinx
                            
                                Using multiprocessing pool from celery task raises exception
                            
                                How to apply function to dataframe in place
                            
                                Raise to 1/3 gives complex number
                            
                                installing PyGObject via pip in virtualenv [duplicate]
                            
                                How can I pass null to an external library, using ctypes, with an argument declared with ctypeslib.ndpointer?
                            
                                Zigzag or wavy lines in matplotlib
                            
                                Pycharm: terminate all running processes
                            
                                Correct way to declare an image field, sqlalchemy
                            
                                Why does the OrderedDict keys view compare order-insensitive?
                            
                                Using InitSpider with splash: only parsing the login page?
                            
                                Why attribute lookup in Python is designed this way (precedence chain)?
                            
                                numpy array set ones between two values, fast
                            
                                IPython Notebook and SQL: 'ImportError: No module named sql' when running '%load_ext sql'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Prediction is depending on the batch size in Keras

Tags:

python

machine-learning

neural-network

deep-learning

keras

Avijit Dasgupta

People also ask

1 Answers

Marcin Możejko

Recent Activity

Donate For Us