How does mean image subtraction work?

Tags:

To preface, I am new to the field of ML/CV, and am currently in the process of training a custom conv net using Caffe.

I am interested in mean image subtraction to achieve basic data normalization on my training images. However, I am confused as to how mean subtraction works and exactly what benefits it has.

I know that a "mean image" can be calculated from the training set, which is then subtracted from the training, validation, and testing sets to make the network less sensitive to differing background and lightening conditions.

Does this involve calculating the mean of all pixels in each image, and averaging these? Or, is the value from each pixel coordinate averaged across all images in the set (i.e. average values of pixels at location (1,1) for all images)? This may require that all images are the same size...

Also, for colored images (3-channels), is the value for each channel individually averaged?

Any clarity would be appreciated.

939

asked Jun 27 '17 19:06

Mink

2 Answers

In deep learning, there are in fact different practices as to how to subtract the mean image.

Subtract mean image

The first way is to subtract mean image as @lejlot described. But there is an issue if your dataset images are not the same size. You need to make sure all dataset images are in the same size before using this method (e.g., resize original image and crop patch of same size from original image). It is used in original ResNet paper, see reference here.

Subtract the per-channel mean

The second way is to subtract per-channel mean from the original image, which is more popular. In this way, you do not need to resize or crop the original image. You can just calculate the per-channel mean from the training set. This is used widely in deep learning, e.g, Caffe: here and here. Keras: here. PyTorch: here. (PyTorch also divide the per-channel value by standard deviation.)

answered Sep 26 '22 04:09

jdhao

Mean image is an image where i,j,c pixel is an average of i,j,c pixels from all images. So you take a mean separately for each position and each color channel. It requires all images to have the same size of course, otherwise it is not defined. Also, it is not really about being less sensitive to different conditions - it has nothing to do with it, it is literally just to keep initial activations in a reasonable range, nothing else.

answered Sep 22 '22 04:09

lejlot

Related questions
                            
                                How to resize text in java
                            
                                Change the opacity of an image in PHP
                            
                                error occur when saving image to local path
                            
                                Correcting Image Orientation server side in vb.net
                            
                                sharing image with whatsapp in android
                            
                                paperclip where to place the missing.png default image?
                            
                                How do you zoom into a specific point (no canvas)?
                            
                                How to speed up image loading in pillow (python)?
                            
                                How can I generate GIF images in .NET?
                            
                                CSS Sprites Repeating Images
                            
                                html images in table with no space
                            
                                drawing image to bigger bitmap [closed]
                            
                                RenderTargetBitmap renders image of a wrong size
                            
                                What does it mean to get the (MSE) mean error squared for 2 images?
                            
                                String or binary data would be truncated. The statement has been terminated. While uploading profile
                            
                                How to view an RGB image with pylab
                            
                                circular image in table view cell swift
                            
                                How to make a Javafx Image Crop App
                            
                                How to change the labels to the image (icon) in bar chart.js
                            
                                how to find height and width of image for FileField Django

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does mean image subtraction work?

Tags:

image

machine-learning

computer-vision

caffe

conv-neural-network