Does Batch Norm require constant batch size?

Question

Batch normalization uses a mini-batch mean and variance to normalize layer output. If I train a network with batch size, say 100, but then want to use the trained network on single-shot predictions (batch size 1), should I expect to run into problems? Should I penalize the batch norm layer to converge towards the identity transform during learning to avoid this?

Dr. Snoopy · Accepted Answer

No, there are no problems when doing that, at test time the batch normalization layer just scales and shifts the inputs, with factors learned at training time.

Does Batch Norm require constant batch size?

Tags:

Mageek

1 Answers

Dr. Snoopy

Recent Activity

Donate For Us

Does Batch Norm require constant batch size?

Tags:

Mageek

1 Answers

Dr. Snoopy

Related questions

Recent Activity

Donate For Us