I am trying to replicate the outcome of this link using linear convolution in spatial-domain. Images are first converted to 2d <code>double</code> arrays and then convolved. Image and kernel are of the same size. The image is padded before convolution and cropped accordingly after the convolution. <img src="https://i.stack.imgur.com/Nn5I8.png" alt="enter image description here"> As compared to the FFT-based convolution, the output is weird and incorrect. How can I solve the issue? Note that I obtained the following image output from Matlab which matches my C# FFT output: <img src="https://i.stack.imgur.com/7LltS.jpg" alt="enter image description here"> . Update-1: Following @Ben Voigt's comment, I changed the <code>Rescale()</code> function to replace <code>255.0</code> with <code>1</code> and thus the output is improved substantially. But, still, the output doesn't match the FFT output (which is the correct one). <img src="https://i.stack.imgur.com/zRMTf.png" alt="enter image description here"> . Update-2: Following @Cris Luengo's comment, I have padded the image by stitching and then performed spatial convolution. The outcome has been as follows: <img src="https://i.stack.imgur.com/wMGnQ.png" alt="enter image description here"> So, the output is worse than the previous one. But, this has a similarity with the 2nd output of the linked answer which means a circular convolution is not the solution. . Update-3: I have used the <code>Sum()</code> function proposed by @Cris Luengo's answer. The result is a more improved version of <code>**Update-1**</code>: <img src="https://i.stack.imgur.com/aqneV.png" alt="enter image description here"> But, it is still not 100% similar to the FFT version. . Update-4: Following @Cris Luengo's comment, I have subtracted the two outcomes to see the difference: <img src="https://i.stack.imgur.com/o2zSz.png" alt="enter image description here">, <img src="https://i.stack.imgur.com/u7XkX.png" alt="enter image description here"> 1. spatial minus frequency domain 2. frequency minus spatial domain Looks like, the difference is substantial which means, spatial convolution is not being done correctly. . Source Code: (Notify me if you need more source code to see.) <pre class="prettyprint"><code> public static double[,] LinearConvolutionSpatial(double[,] image, double[,] mask) { int maskWidth = mask.GetLength(0); int maskHeight = mask.GetLength(1); double[,] paddedImage = ImagePadder.Pad(image, maskWidth); double[,] conv = Convolution.ConvolutionSpatial(paddedImage, mask); int cropSize = (maskWidth/2); double[,] cropped = ImageCropper.Crop(conv, cropSize); return conv; } static double[,] ConvolutionSpatial(double[,] paddedImage1, double[,] mask1) { int imageWidth = paddedImage1.GetLength(0); int imageHeight = paddedImage1.GetLength(1); int maskWidth = mask1.GetLength(0); int maskHeight = mask1.GetLength(1); int convWidth = imageWidth - ((maskWidth / 2) * 2); int convHeight = imageHeight - ((maskHeight / 2) * 2); double[,] convolve = new double[convWidth, convHeight]; for (int y = 0; y < convHeight; y++) { for (int x = 0; x < convWidth; x++) { int startX = x; int startY = y; convolve[x, y] = Sum(paddedImage1, mask1, startX, startY); } } Rescale(convolve); return convolve; } static double Sum(double[,] paddedImage1, double[,] mask1, int startX, int startY) { double sum = 0; int maskWidth = mask1.GetLength(0); int maskHeight = mask1.GetLength(1); for (int y = startY; y < (startY + maskHeight); y++) { for (int x = startX; x < (startX + maskWidth); x++) { double img = paddedImage1[x, y]; double msk = mask1[x - startX, y - startY]; sum = sum + (img * msk); } } return sum; } static void Rescale(double[,] convolve) { int imageWidth = convolve.GetLength(0); int imageHeight = convolve.GetLength(1); double maxAmp = 0.0; for (int j = 0; j < imageHeight; j++) { for (int i = 0; i < imageWidth; i++) { maxAmp = Math.Max(maxAmp, convolve[i, j]); } } double scale = 1.0 / maxAmp; for (int j = 0; j < imageHeight; j++) { for (int i = 0; i < imageWidth; i++) { double d = convolve[i, j] * scale; convolve[i, j] = d; } } } public static Bitmap ConvolveInFrequencyDomain(Bitmap image1, Bitmap kernel1) { Bitmap outcome = null; Bitmap image = (Bitmap)image1.Clone(); Bitmap kernel = (Bitmap)kernel1.Clone(); //linear convolution: sum. //circular convolution: max uint paddedWidth = Tools.ToNextPow2((uint)(image.Width + kernel.Width)); uint paddedHeight = Tools.ToNextPow2((uint)(image.Height + kernel.Height)); Bitmap paddedImage = ImagePadder.Pad(image, (int)paddedWidth, (int)paddedHeight); Bitmap paddedKernel = ImagePadder.Pad(kernel, (int)paddedWidth, (int)paddedHeight); Complex[,] cpxImage = ImageDataConverter.ToComplex(paddedImage); Complex[,] cpxKernel = ImageDataConverter.ToComplex(paddedKernel); // call the complex function Complex[,] convolve = Convolve(cpxImage, cpxKernel); outcome = ImageDataConverter.ToBitmap(convolve); outcome = ImageCropper.Crop(outcome, (kernel.Width/2)+1); return outcome; } </code></pre>

Your current output looks more like the auto-correlation function than the convolution of Lena with herself. I think the issue might be in your <code>Sum</code> function. If you look at the definition of the convolution sum, you'll see that the kernel (or the image, doesn't matter) is mirrored: <pre class="prettyprint"><code>sum_m( f[n-m] g[m] ) </code></pre> For the one function, <code>m</code> appears with a plus sign, and for the other it appears with a minus sign. You'll need to modify your <code>Sum</code> function to read the <code>mask1</code> image in the right order: <pre class="prettyprint"><code>static double Sum(double[,] paddedImage1, double[,] mask1, int startX, int startY) { double sum = 0; int maskWidth = mask1.GetLength(0); int maskHeight = mask1.GetLength(1); for (int y = startY; y < (startY + maskHeight); y++) { for (int x = startX; x < (startX + maskWidth); x++) { double img = paddedImage1[x, y]; double msk = mask1[maskWidth - x + startX - 1, maskHeight - y + startY - 1]; sum = sum + (img * msk); } } return sum; } </code></pre> The other option is to pass a mirrored version of <code>mask1</code> to this function.

Image convolution in spatial domain

Tags:

c#

image-processing

convolution

I am trying to replicate the outcome of this link using linear convolution in spatial-domain.

Images are first converted to 2d double arrays and then convolved. Image and kernel are of the same size. The image is padded before convolution and cropped accordingly after the convolution.

enter image description here

As compared to the FFT-based convolution, the output is weird and incorrect.

How can I solve the issue?

Note that I obtained the following image output from Matlab which matches my C# FFT output:

enter image description here

^{Update-1: Following @Ben Voigt's comment, I changed the Rescale() function to replace 255.0 with 1 and thus the output is improved substantially. But, still, the output doesn't match the FFT output (which is the correct one).}
enter image description here

^{Update-2: Following @Cris Luengo's comment, I have padded the image by stitching and then performed spatial convolution. The outcome has been as follows:}
enter image description here

^{So, the output is worse than the previous one. But, this has a similarity with the 2nd output of the linked answer which means a circular convolution is not the solution.}

^{Update-3: I have used the Sum() function proposed by @Cris Luengo's answer. The result is a more improved version of **Update-1**:}
enter image description here

^{But, it is still not 100% similar to the FFT version.}

^{Update-4: Following @Cris Luengo's comment, I have subtracted the two outcomes to see the difference:}
enter image description here ,

^{1. spatial minus frequency domain

2. frequency minus spatial domain}

_{Looks like, the difference is substantial which means, spatial convolution is not being done correctly.}

Source Code:

^{(Notify me if you need more source code to see.)}

Click to copy

    public static double[,] LinearConvolutionSpatial(double[,] image, double[,] mask)
    {
        int maskWidth = mask.GetLength(0);
        int maskHeight = mask.GetLength(1);

        double[,] paddedImage = ImagePadder.Pad(image, maskWidth);

        double[,] conv = Convolution.ConvolutionSpatial(paddedImage, mask);

        int cropSize = (maskWidth/2);

        double[,] cropped = ImageCropper.Crop(conv, cropSize);

        return conv;
    } 
    static double[,] ConvolutionSpatial(double[,] paddedImage1, double[,] mask1)
    {
        int imageWidth = paddedImage1.GetLength(0);
        int imageHeight = paddedImage1.GetLength(1);

        int maskWidth = mask1.GetLength(0);
        int maskHeight = mask1.GetLength(1);

        int convWidth = imageWidth - ((maskWidth / 2) * 2);
        int convHeight = imageHeight - ((maskHeight / 2) * 2);

        double[,] convolve = new double[convWidth, convHeight];

        for (int y = 0; y < convHeight; y++)
        {
            for (int x = 0; x < convWidth; x++)
            {
                int startX = x;
                int startY = y;

                convolve[x, y] = Sum(paddedImage1, mask1, startX, startY);
            }
        }

        Rescale(convolve);

        return convolve;
    } 

    static double Sum(double[,] paddedImage1, double[,] mask1, int startX, int startY)
    {
        double sum = 0;

        int maskWidth = mask1.GetLength(0);
        int maskHeight = mask1.GetLength(1);

        for (int y = startY; y < (startY + maskHeight); y++)
        {
            for (int x = startX; x < (startX + maskWidth); x++)
            {
                double img = paddedImage1[x, y];
                double msk = mask1[x - startX, y - startY];
                sum = sum + (img * msk);
            }
        }

        return sum;
    }

    static void Rescale(double[,] convolve)
    {
        int imageWidth = convolve.GetLength(0);
        int imageHeight = convolve.GetLength(1);

        double maxAmp = 0.0;

        for (int j = 0; j < imageHeight; j++)
        {
            for (int i = 0; i < imageWidth; i++)
            {
                maxAmp = Math.Max(maxAmp, convolve[i, j]);
            }
        }

        double scale = 1.0 / maxAmp;

        for (int j = 0; j < imageHeight; j++)
        {
            for (int i = 0; i < imageWidth; i++)
            {
                double d = convolve[i, j] * scale;
                convolve[i, j] = d;
            }
        }
    } 

    public static Bitmap ConvolveInFrequencyDomain(Bitmap image1, Bitmap kernel1)
    {
        Bitmap outcome = null;

        Bitmap image = (Bitmap)image1.Clone();
        Bitmap kernel = (Bitmap)kernel1.Clone();

        //linear convolution: sum. 
        //circular convolution: max
        uint paddedWidth = Tools.ToNextPow2((uint)(image.Width + kernel.Width));
        uint paddedHeight = Tools.ToNextPow2((uint)(image.Height + kernel.Height));

        Bitmap paddedImage = ImagePadder.Pad(image, (int)paddedWidth, (int)paddedHeight);
        Bitmap paddedKernel = ImagePadder.Pad(kernel, (int)paddedWidth, (int)paddedHeight);

        Complex[,] cpxImage = ImageDataConverter.ToComplex(paddedImage);
        Complex[,] cpxKernel = ImageDataConverter.ToComplex(paddedKernel);

        // call the complex function
        Complex[,] convolve = Convolve(cpxImage, cpxKernel);

        outcome = ImageDataConverter.ToBitmap(convolve);

        outcome = ImageCropper.Crop(outcome, (kernel.Width/2)+1);

        return outcome;
    }

479

asked Jul 10 '18 10:07

user366312

2 Answers

Your current output looks more like the auto-correlation function than the convolution of Lena with herself. I think the issue might be in your Sum function.

If you look at the definition of the convolution sum, you'll see that the kernel (or the image, doesn't matter) is mirrored:

Click to copy

sum_m( f[n-m] g[m] )

For the one function, m appears with a plus sign, and for the other it appears with a minus sign.

You'll need to modify your Sum function to read the mask1 image in the right order:

Click to copy

static double Sum(double[,] paddedImage1, double[,] mask1, int startX, int startY)
{
    double sum = 0;

    int maskWidth = mask1.GetLength(0);
    int maskHeight = mask1.GetLength(1);

    for (int y = startY; y < (startY + maskHeight); y++)
    {
        for (int x = startX; x < (startX + maskWidth); x++)
        {
            double img = paddedImage1[x, y];
            double msk = mask1[maskWidth - x + startX - 1, maskHeight - y + startY - 1];
            sum = sum + (img * msk);
        }
    }

    return sum;
}

The other option is to pass a mirrored version of mask1 to this function.

171

answered Oct 07 '22 13:10

Cris Luengo

I have found the solution from this link. The main clue was to introduce an offset and a factor.

factor is the sum of all values in the kernel.
offset is an arbitrary value to fix the output further.

@Cris Luengo's answer also raised a valid point.

The following source code is supplied in the given link:

Click to copy

    private void SafeImageConvolution(Bitmap image, ConvMatrix fmat) 
    { 
        //Avoid division by 0 
        if (fmat.Factor == 0) 
            return; 

        Bitmap srcImage = (Bitmap)image.Clone(); 

        int x, y, filterx, filtery; 
        int s = fmat.Size / 2; 
        int r, g, b; 
        Color tempPix; 

        for (y = s; y < srcImage.Height - s; y++) 
        { 
            for (x = s; x < srcImage.Width - s; x++) 
            { 
                r = g = b = 0; 

                // Convolution 
                for (filtery = 0; filtery < fmat.Size; filtery++) 
                { 
                    for (filterx = 0; filterx < fmat.Size; filterx++) 
                    { 
                        tempPix = srcImage.GetPixel(x + filterx - s, y + filtery - s); 

                        r += fmat.Matrix[filtery, filterx] * tempPix.R; 
                        g += fmat.Matrix[filtery, filterx] * tempPix.G; 
                        b += fmat.Matrix[filtery, filterx] * tempPix.B; 
                    } 
                } 

                r = Math.Min(Math.Max((r / fmat.Factor) + fmat.Offset, 0), 255); 
                g = Math.Min(Math.Max((g / fmat.Factor) + fmat.Offset, 0), 255); 
                b = Math.Min(Math.Max((b / fmat.Factor) + fmat.Offset, 0), 255); 

                image.SetPixel(x, y, Color.FromArgb(r, g, b)); 
            } 
        } 
    }

answered Oct 07 '22 13:10

user366312

Related questions
                            
                                "skipped loading symbols for ngen binary" for C# dll
                            
                                Async-Await vs ThreadPool vs MultiThreading on High-Performance Sockets (C10k Solutions?)
                            
                                FileLoadException At InitializeComponent or x:Class=
                            
                                Are Stream.ReadAsync and Stream.WriteAsync supposed to alter the cursor position synchronously before returning or after the operation completes?
                            
                                Exceptions when rolling back a transaction - connection already closed?
                            
                                Is it possible to make separate dlls with MVC project?
                            
                                Newtonsoft.JSON v9.01 + FileNotFoundException (.NET Core Class library)
                            
                                Stream audio from PC to smartphones?
                            
                                ConfuserEx: System.TypeInitializationException on Mono
                            
                                Unit testing a Web API controller
                            
                                How to use APNs Auth Key (.p8 file) in C#?
                            
                                DbUpdateException: Which field is causing "String or binary data would be truncated"
                            
                                AutoMapper define mapping level
                            
                                OneSignal: How to Handle notificationOpened in AppDelegate of a Xamarin.Forms app?
                            
                                ASP NET Core 2 with Full Framework
                            
                                Is specifying the listening HTTP port via UseUrls the correct way?
                            
                                Application icon is blank when started from Process.Start
                            
                                What exactly happens call async method without await keyword?
                            
                                Visual Studio shows 'Configure settings to improve performance' notification for ReSharper
                            
                                VSTO custom taskpane on multi DPI system shows content twice

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Image convolution in spatial domain

Tags:

c#

image-processing

convolution

user366312

People also ask

2 Answers

Cris Luengo

user366312

Recent Activity

Donate For Us