So I am aware that a convolution by FFT has a lower computational complexity than a convolution in real space. But what are the downsides of an FFT convolution? Does the kernel size always have to match the image size, or are there functions that take care of this, for example in pythons numpy and scipy packages? And what about anti-aliasing effects?

FFT convolutions are based on the convolution theorem, which states that given two functions <code>f</code> and <code>g</code>, if <code>Fd()</code> and <code>Fi()</code> denote the direct and inverse Fourier transform, and <code>*</code> and <code>.</code> convolution and multiplication, then: <pre class="prettyprint"><code>f*g = Fi(Fd(d).Fd(g)) </code></pre> To apply this to a signal <code>f</code> and a kernel <code>g</code>, there are some things you need to take care of: <ul> <li> <code>f</code> and <code>g</code> have to be of the same size for the multiplication step to be possible, so you need to zero-pad the kernel (or input, if the kernel is longer than it).</li> <li>When doing a DFT, which is what FFT does, the resulting frequency domain representation of the function is periodic. This means that, by default, your kernel wraps around the edge when doing the convolution. If you want this, then all is great. But if not, you have to add an extra zero-padding the size of the kernel to avoid it.</li> <li>Most (all?) FFT packages only work well (performance-wise) with sizes that do not have any large prime factors. Rounding the signal and kernel size up to the next power of two is a common practice that may result in a (very) significant speed-up.</li> </ul> If your signal and kernel sizes are <code>f_l</code> and <code>g_l</code>, doing a straightforward convolution in time domain requires <code>g_l * (f_l - g_l + 1)</code> multiplications and <code>(g_l - 1) * (f_l - g_l + 1)</code> additions. For the FFT approach, you have to do 3 FFTs of size at least <code>f_l + g_l</code>, as well as <code>f_l + g_l</code> multiplications. For large sizes of both <code>f</code> and <code>g</code>, the FFT is clearly superior with its <code>n*log(n)</code> complexity. For small kernels, the direct approach may be faster. <code>scipy.signal</code> has both <code>convolve</code> and <code>fftconvolve</code> methods for you to play around. And <code>fftconvolve</code> handles all the padding described above transparently for you.

What are the downsides of convolution by FFT compared to realspace convolution?

1 Answers

FFT convolutions are based on the convolution theorem, which states that given two functions f and g, if Fd() and Fi() denote the direct and inverse Fourier transform, and * and . convolution and multiplication, then:

f*g = Fi(Fd(d).Fd(g))

To apply this to a signal f and a kernel g, there are some things you need to take care of:

f and g have to be of the same size for the multiplication step to be possible, so you need to zero-pad the kernel (or input, if the kernel is longer than it).
When doing a DFT, which is what FFT does, the resulting frequency domain representation of the function is periodic. This means that, by default, your kernel wraps around the edge when doing the convolution. If you want this, then all is great. But if not, you have to add an extra zero-padding the size of the kernel to avoid it.
Most (all?) FFT packages only work well (performance-wise) with sizes that do not have any large prime factors. Rounding the signal and kernel size up to the next power of two is a common practice that may result in a (very) significant speed-up.

If your signal and kernel sizes are f_l and g_l, doing a straightforward convolution in time domain requires g_l * (f_l - g_l + 1) multiplications and (g_l - 1) * (f_l - g_l + 1) additions.

For the FFT approach, you have to do 3 FFTs of size at least f_l + g_l, as well as f_l + g_l multiplications.

For large sizes of both f and g, the FFT is clearly superior with its n*log(n) complexity. For small kernels, the direct approach may be faster.

scipy.signal has both convolve and fftconvolve methods for you to play around. And fftconvolve handles all the padding described above transparently for you.

109

answered Sep 27 '22 22:09

Jaime

Related questions
                            
                                Directly export a query to CSV using SQL Developer
                            
                                How to set global environment variables for PHP
                            
                                Disable dropup feature using Bootstrap Select
                            
                                Does SASS support adding !important to all properties in a mixin?
                            
                                In App Purchaes with Android Studio unable to find IInAppBillingService
                            
                                django admin inlines: get object from formfield_for_foreignkey
                            
                                Is it possible to load doctrine fixtures without deleting database
                            
                                Is qDebug() thread-safe?
                            
                                lambda expression join multiple tables with select and where clause
                            
                                Run exe file with parameters in a batch file
                            
                                Openpyxl 1.8.5: Reading the result of a formula typed in a cell using openpyxl
                            
                                Incomprehensible function signature - Return reference to an array of N objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are the downsides of convolution by FFT compared to realspace convolution?

Tags:

ABDreverhaven

People also ask

1 Answers

Jaime

Recent Activity

Donate For Us