Whenever I'm plotting the values obtained by a programme using the cuFFT and comparing the results with that of Matlab, I'm getting the same shape of graphs and the values of maxima and minima are getting at the same points. However, the values resulting by the cuFFT are much greater than those resulting from Matlab. The Matlab code is <pre class="prettyprint"><code>fs = 1000; % sample freq D = [0:1:4]'; % pulse delay times t = 0 : 1/fs : 4000/fs; % signal evaluation time w = 0.5; % width of each pulse yp = pulstran(t,D,'rectpuls',w); filt = conj(fliplr(yp)); xx = fft(yp,1024).*fft(filt,1024); xx = (abs(ifft(xx))); </code></pre> and the CUDA code with the same input is like: <pre class="prettyprint"><code>cufftExecC2C(plan, (cufftComplex *)d_signal, (cufftComplex *)d_signal, CUFFT_FORWARD); cufftExecC2C(plan, (cufftComplex *)d_filter_signal, (cufftComplex *)d_filter_signal, CUFFT_FORWARD); ComplexPointwiseMul<<<blocksPerGrid, threadsPerBlock>>>(d_signal, d_filter_signal, NX); cufftExecC2C(plan, (cufftComplex *)d_signal, (cufftComplex *)d_signal, CUFFT_INVERSE); </code></pre> The cuFFT performs also a <code>1024</code> points FFT with batch size of <code>2</code>. With the scaling factor of <code>NX=1024</code>, the values are not coming correct. Please tell what to do.

This is a late answer to remove this question from the unanswered list. You are not giving enough information to diagnose your problem, since you are missing to specify the way you are setting up the cuFFT plan. You are even not specifying whether you have exactly the same shape for the Matlab's and cuFFT's signals (so you have just a scaling) or you have approximately the same shape. However, let me make the following two observations: <ol> <li>The <code>yp</code> vector has <code>4000</code> elements; opposite to thatm by <code>fft(yp,1024)</code>, you are performing an FFT by truncating the signal to <code>1024</code> elements;</li> <li>The inverse cuFFT does not perform the scaling by the number of vector elements.</li> </ol> For the sake of convenience (it could be useful to other users), I'm reporting below a simple FFT-IFFT scheme which includes also the scaling performed by using the CUDA Thrust library. <pre class="prettyprint"><code>#include <cufft.h> #include <thrust/host_vector.h> #include <thrust/device_vector.h> /*********************/ /* SCALE BY CONSTANT */ /*********************/ class Scale_by_constant { private: float c_; public: Scale_by_constant(float c) { c_ = c; }; __host__ __device__ float2 operator()(float2 &a) const { float2 output; output.x = a.x / c_; output.y = a.y / c_; return output; } }; int main(void){ const int N=4; // --- Setting up input device vector thrust::device_vector<float2> d_vec(N,make_cuComplex(1.f,2.f)); cufftHandle plan; cufftPlan1d(&plan, N, CUFFT_C2C, 1); // --- Perform in-place direct Fourier transform cufftExecC2C(plan, thrust::raw_pointer_cast(d_vec.data()),thrust::raw_pointer_cast(d_vec.data()), CUFFT_FORWARD); // --- Perform in-place inverse Fourier transform cufftExecC2C(plan, thrust::raw_pointer_cast(d_vec.data()),thrust::raw_pointer_cast(d_vec.data()), CUFFT_INVERSE); thrust::transform(d_vec.begin(), d_vec.end(), d_vec.begin(), Scale_by_constant((float)(N))); // --- Setting up output host vector thrust::host_vector<float2> h_vec(d_vec); for (int i=0; i<N; i++) printf("Element #%i; Real part = %f; Imaginary part: %f\n",i,h_vec[i].x,h_vec[i].y); getchar(); } </code></pre>

Scaling in inverse FFT by cuFFT

Tags:

cuda

matlab

scaling

fft

cufft

Whenever I'm plotting the values obtained by a programme using the cuFFT and comparing the results with that of Matlab, I'm getting the same shape of graphs and the values of maxima and minima are getting at the same points. However, the values resulting by the cuFFT are much greater than those resulting from Matlab. The Matlab code is

fs = 1000;                              % sample freq
D = [0:1:4]';                           % pulse delay times
t = 0 : 1/fs : 4000/fs;                 % signal evaluation time
w = 0.5;                                % width of each pulse
yp = pulstran(t,D,'rectpuls',w);
filt = conj(fliplr(yp));
xx = fft(yp,1024).*fft(filt,1024);
xx = (abs(ifft(xx)));

and the CUDA code with the same input is like:

cufftExecC2C(plan, (cufftComplex *)d_signal, (cufftComplex *)d_signal, CUFFT_FORWARD);
cufftExecC2C(plan, (cufftComplex *)d_filter_signal, (cufftComplex *)d_filter_signal,     CUFFT_FORWARD);
ComplexPointwiseMul<<<blocksPerGrid, threadsPerBlock>>>(d_signal, d_filter_signal, NX);
cufftExecC2C(plan, (cufftComplex *)d_signal, (cufftComplex *)d_signal, CUFFT_INVERSE);

The cuFFT performs also a 1024 points FFT with batch size of 2.

With the scaling factor of NX=1024, the values are not coming correct. Please tell what to do.

451

asked Jan 21 '13 14:01

Ani

1 Answers

This is a late answer to remove this question from the unanswered list.

You are not giving enough information to diagnose your problem, since you are missing to specify the way you are setting up the cuFFT plan. You are even not specifying whether you have exactly the same shape for the Matlab's and cuFFT's signals (so you have just a scaling) or you have approximately the same shape. However, let me make the following two observations:

The yp vector has 4000 elements; opposite to thatm by fft(yp,1024), you are performing an FFT by truncating the signal to 1024 elements;
The inverse cuFFT does not perform the scaling by the number of vector elements.

For the sake of convenience (it could be useful to other users), I'm reporting below a simple FFT-IFFT scheme which includes also the scaling performed by using the CUDA Thrust library.

#include <cufft.h>
#include <thrust/host_vector.h>
#include <thrust/device_vector.h>

/*********************/
/* SCALE BY CONSTANT */
/*********************/
class Scale_by_constant
{
    private:
        float c_;

    public:
        Scale_by_constant(float c) { c_ = c; };

        __host__ __device__ float2 operator()(float2 &a) const
        {
            float2 output;

            output.x = a.x / c_;
            output.y = a.y / c_;

            return output;
        }

};

int main(void){

    const int N=4;

    // --- Setting up input device vector    
    thrust::device_vector<float2> d_vec(N,make_cuComplex(1.f,2.f));

    cufftHandle plan;
    cufftPlan1d(&plan, N, CUFFT_C2C, 1);

    // --- Perform in-place direct Fourier transform
    cufftExecC2C(plan, thrust::raw_pointer_cast(d_vec.data()),thrust::raw_pointer_cast(d_vec.data()), CUFFT_FORWARD);

    // --- Perform in-place inverse Fourier transform
    cufftExecC2C(plan, thrust::raw_pointer_cast(d_vec.data()),thrust::raw_pointer_cast(d_vec.data()), CUFFT_INVERSE);

    thrust::transform(d_vec.begin(), d_vec.end(), d_vec.begin(), Scale_by_constant((float)(N)));

    // --- Setting up output host vector    
    thrust::host_vector<float2> h_vec(d_vec);

    for (int i=0; i<N; i++) printf("Element #%i; Real part = %f; Imaginary part: %f\n",i,h_vec[i].x,h_vec[i].y);

    getchar();
}

answered Oct 20 '22 14:10

Vitality

Related questions
                            
                                Digital image processing with MATLAB using 3 techniques
                            
                                Matlab - usage of workspace variables
                            
                                Filter 'rows' in a Matlab structure
                            
                                Passing pointer argument in MATLAB to a C-DLL function foo(char**)
                            
                                Compare two vectors of unequal lengths to get a logical array
                            
                                MATLAB: Combinations of an arbitrary number of cell arrays
                            
                                sort columns in Matlab
                            
                                Putting certain tick labels in boldface (but not all of them)?
                            
                                How can I throw an exception in Matlab?
                            
                                MATLAB search cell array for string subset
                            
                                Matlab parallel computing toolbox, dynamic allocation of work in parfor loops
                            
                                Numbers smaller than realmin
                            
                                About finding pupil in a video
                            
                                Matlab VS Python - eig(A,B) VS sc.linalg.eig(A,B)
                            
                                Matlab: Stacking of various plots
                            
                                Calculating MD5 Hash (RFC 1321 conform) in Matlab via Java
                            
                                Getting stuck on Matlab's subplot mechanism for matching images' points for vlfeat
                            
                                Matlab equivalent to calling inside static class
                            
                                How can I plot from a plot handler?
                            
                                Non-uniform axis of imagesc() in Matlab

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With