Understanding batch_size in CNNs

Question

Say that I have a CNN model in Pytorch and 2 inputs of the following sizes:

input_1: [2, 1, 28, 28]
input_2: [10, 1, 28, 28]

Notes:

To reiterate, input_1 is batch_size == 2 and input_2 is batch_size == 10.
Input_2 is a superset of input_1. That is, input_2 contains the 2 images in input_1 in the same position.

My question is: how does the CNN process the images in both inputs? I.e. does the CNN process every image in the batch sequentially? Or does it concatenate all of the images in the batch size and then perform convolutions per usual?

The reason I ask is because:

The output of CNN(input_1) != CNN(input_2)[:2]

That is, the difference in batch_size results in slightly different CNN outputs for both inputs for the same positions.

Jatentaki · Accepted Answer

CNN is a general term for convolutional neural networks. Depending on the particular architecture it may do different things. The main building blocks of CNNs are convolutions which do not cause any "crosstalk" between items in batch and pointwise nonlinearities like ReLU which do not either. However, most architectures also involve other operations, such as normalization layers - arguably the most popular is batch norm which does introduce crosstalk. Many models will also use dropout which behaves stochastically outside of eval mode (by default models are in train mode). Both above effects could lead to the observed outcome above, as well as other custom operations which could cause cross-talk across the batch.

Aside from that, because of numeric precision issues, your code may not give exactly the same results, even if it doesn't feature any cross-batch operations. This error is very minor but sufficient to manifest itself when checking with CNN(input_1) == CNN(input_2)[:2]. It is better to use allclose instead, with a suitable epsilon.

Understanding batch_size in CNNs

Tags:

python

pytorch

conv-neural-network

DSH

1 Answers

Jatentaki

Recent Activity

Donate For Us

Understanding batch_size in CNNs

Tags:

python

pytorch

conv-neural-network

DSH

1 Answers

Jatentaki

Related questions

Recent Activity

Donate For Us