How to set weights of the batch normalization layer?

1 Answers

Yes, you need all four values. Recollect what batch normalization does. Its goal is to normalize (i.e. mean = 0 and standard deviation = 1) inputs coming into each layer. To this end, you need (mean, std). Thus a normalized activation can be viewed as an input to a sub-network which does a linear transformation:

y = gamma*x_norm + beta

(gamma, beta) are very important since they complement (mean,std) in the sense that (gamma, beta) help get the original activations back from the normalized ones. If you don't do this or change any one parameter without considering the others, you risk changing the semantic meaning of the activations. These original activations can now be processed with your next layer. This process is repeated for all layers.

Edit:

On the other hand, I think it would be worth trying to first compute the mean and std on a large number of images and take input that as your mean and std. Take care that the images that you are computing mean and std on, come from the same distribution as your training data. I think this should work as batch normalization usually has two modes for computing mean, one is running average maintained over batches and the other is global mean (at least in Caffe, see here).

104

answered Sep 23 '22 10:09

Autonomous

Related questions
                            
                                Receiving items from reactive stream SubmissionPublisher
                            
                                Ionic 2 Refresher toggle ion-refresher on page load
                            
                                Inno Setup - HTTP request - Get www/web content
                            
                                How do define source root in VS Code
                            
                                File Upload using node js without multer
                            
                                Deduction failure when using a constexpr function?
                            
                                Passing variable or setting a variable by URL in Angular 2
                            
                                How to authenticate to Azure Active Directory without user interaction?
                            
                                Multiple SQL Statements in Java (with ?allowMultiQueries=true)
                            
                                Julia: Making empty/initialized multidimensional arrays of self defined types
                            
                                Can't schedule azure webjob
                            
                                voronoi_plot_2d in SciPy, difference between dashed lines and solid lines? and warning?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to set weights of the batch normalization layer?

Tags:

Prasanna

People also ask

1 Answers

Autonomous

Recent Activity

Donate For Us