CUDA FFT vs Matlab FFT CUDA FFT Library

bluestorm · November 5, 2009, 6:42pm

Hi!

I hope someone can help me with a problem I am having.

I am trying to do 1D FFT in a 1024*1000 array (one column at a time). I am trying to move my code from Matlab to CUDA. The Matlab fft() function does 1dFFT on the columns and it gives me a different answer that CUDA FFT and I am not sure why…I have tried all I can think off but it still does the same… :wacko:

Is the CUDA FFT library different? Is this result expected?

My code is here:

[codebox]

#define ROWS 1024

#define COLUMNS 1000

// CUFFT plan

cufftHandle plan;

cufftSafeCall(cufftPlan1d(&plan,ROWS,CUFFT_R2C,COLUMNS));

cufftSafeCall(cufftExecR2C(plan, (cufftReal *)d_image_buff, (cufftComplex *)d_result_buff));

[/codebox]

where the d_image_buff contains the 1024*1000 elements array. Is this the way I should be using the library?

Any help is greatly appreciated!

Thanks!!

mfatica · November 5, 2009, 7:11pm

Matlab and CUFFT use two different formats for complex arrays.
In Matlab, you have all the real components, followed by the imaginary components.
On CUFFT they are interleaved. You will need to shuffle them.

bluestorm · November 5, 2009, 9:09pm

Thanks a lot for your help.

I found out a small bad assumption I was making. I was indeed using the cufftComplex data types to take care of the interleaved data.

The problem was more in the sense that the Matlab FFT returns a 1024 array out of a 1024 point FFT which is rather interesting…as far as I understand we should get only half the size (meaning 512 points out of an 1024 point FFT). CUDA was indeed doing this correctly but I was expecting the 1024 points and hence the data won’t match. CUDA returns 512 out of a 1024 point FFT as it should be. I still get slightly different results (e-4 order) but I guess htis is related to the single point precision of CUDA vs the double point procesion of Matlab. I will try with the double point precision libraries to see what I get

Thanks for your help again!

mfatica · November 6, 2009, 5:26am

The transform of 1024 real elements will be 513 complex elements ( N/2 +1).

Topic		Replies	Views
CUDA FFT different from Matlab FFT CUDA Programming and Performance	32	9581	March 29, 2011
Matlab FFT vs CUDA FFT GPU-Accelerated Libraries	1	1352	July 6, 2017
CUFFT Library GPU-Accelerated Libraries	1	585	October 4, 2017
Can somebody explain this? (cufft strange result!) CUDA Programming and Performance	3	2101	September 18, 2009
Cuda fft vs. Matlab fft fft CUDA Programming and Performance	4	3310	September 1, 2010
Strange results with CUFFT 3D CUDA Programming and Performance	1	5000	July 2, 2009
CUFFT gives wrong results? the results from MATLAB and CUFFT differ... CUDA Programming and Performance	5	9609	June 15, 2009
Differences between cufft and matlab fft? CUDA Programming and Performance	2	2566	July 6, 2009
Differences between cufft plan2d matlab fft2? CUDA Programming and Performance	2	3014	July 20, 2009
3D FFT C2C result is different with matlab fftn GPU-Accelerated Libraries	2	543	October 23, 2019

CUDA FFT vs Matlab FFT CUDA FFT Library

Related topics