Image convolution with even-sized kernel

Tags:

I want to perform a simple 2D image convolution but my kernel is even-sized. Which indices I should pick for my kernel center? I tried googling for an answer and looking existing codes. People usually center their kernel so there would be one sample more before the new 0. So, if we have a 4x4 kernel the centered indices should be -2 -1 0 +1. Is that correct? And if it is, why is that so? Can someone explain why -2 -1 0 +1 is correct while -1 0 +1 +2 is not? Keep in mind that I want to perform the convolution without using FFT.

582

asked Jun 11 '13 03:06

AstrOne

2 Answers

If I understand your question correctly, then for even sized kernels you are correct that it is the convention to centre the kernel so that there is one more sample before the new zero.

So, for a kernel of width 4, the centred indices will be -2 -1 0 +1 as you say above.

However, this really is just a convention - an asymmetric convolution is very rarely used anyway and the exact nature of the asymmetry (to the left/right etc.) has no relation to the "correct" result. I would imagine that the reason that most implementations behave this way is so that they can give comparable results given the same inputs.

When performing the convolution in the frequency domain, the kernel is padded to match the image size anyway, and you've already stated that you are performing the convolution in the spatial domain.

I'm much more intrigued as to why you need to use an even sized kernel in the first place.

answered Oct 23 '22 19:10

Roger Rowland

The correct answer is to return the results pixel in the upper left corner, regardless whether your matrix is evenly sized or not. Then you can simply perform the operation in a simple scanline, and they require no memory.

private static void applyBlur(int[] pixels, int stride) {
    int v0, v1, v2, r, g, b;
    int pos;
    pos = 0;
    try {
        while (true) {
            v0 = pixels[pos];
            v1 = pixels[pos+1];
            v2 = pixels[pos+2];

            r = ((v0 >> 16) & 0xFF) + ((v1 >> 16) & 0xFF) + ((v2 >> 16) & 0xFF);
            g = ((v0 >> 8 ) & 0xFF) + ((v1 >>  8) & 0xFF) + ((v2 >>  8) & 0xFF);
            b = ((v0      ) & 0xFF) + ((v1      ) & 0xFF) + ((v2      ) & 0xFF);
            r/=3;
            g/=3;
            b/=3;
            pixels[pos++] = r << 16 | g << 8 | b;
        }
    }
    catch (ArrayIndexOutOfBoundsException e) { }
    pos = 0;
    try {
    while (true) {
            v0 = pixels[pos];
            v1 = pixels[pos+stride];
            v2 = pixels[pos+stride+stride];

            r = ((v0 >> 16) & 0xFF) + ((v1 >> 16) & 0xFF) + ((v2 >> 16) & 0xFF);
            g = ((v0 >> 8 ) & 0xFF) + ((v1 >>  8) & 0xFF) + ((v2 >>  8) & 0xFF);
            b = ((v0      ) & 0xFF) + ((v1      ) & 0xFF) + ((v2      ) & 0xFF);
            r/=3;
            g/=3;
            b/=3;
            pixels[pos++] = r << 16 | g << 8 | b;
        }
    }
    catch (ArrayIndexOutOfBoundsException e) { }
}

answered Oct 23 '22 19:10

Tatarize

Related questions
                            
                                GPUImage equivalent of cv::findContours
                            
                                JPEG decompression inconsistent across Windows architectures
                            
                                Image processing related discussion forum [closed]
                            
                                Fastest way to convert 12bit image to 16bit image
                            
                                David Lowe's SIFT -- Question about scale space and image coordinates (weird offset problem)
                            
                                Compress image using sharp in node.js
                            
                                How did Google images normalize the width of each row?
                            
                                ImageMagick -- setImageAlphaChannel not working (php)
                            
                                Filling holes in objects that touch the border of an image
                            
                                How to convert spherical coordinates to equirectangular projection coordinates?
                            
                                Supervised Motion Detection Library
                            
                                Image convolution in spatial domain
                            
                                How to use pre-multiplied during image convolution to solve alpha bleed problem?
                            
                                Can the WPF API be safely used in a WCF service?
                            
                                Trying to extract pixel values from a given PNG image
                            
                                How to find the success rate of a clustering algorithm?
                            
                                How to align two different pictures in such a way, that they match as close as possible?
                            
                                Normalize histogram (brightness and contrast) of a set of images using Python Image Library (PIL)
                            
                                Open huge TIF in .NET and copy parts to new image [closed]
                            
                                7-Segment Display OCR

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Image convolution with even-sized kernel

Tags:

image-processing

convolution

AstrOne

People also ask

2 Answers

Roger Rowland

Tatarize

Recent Activity

Donate For Us