Different results between FFTW and CUFFT

dcd16043 · April 9, 2010, 7:19pm

Hello.

I’m replacing FFTW3 for CUFFT and I get different results with floats.

Plans:

[codebox]

// p = fftwf_plan_dft_r2c_3d(global_grid_size,global_grid_size,glob

al_grid_size,static_grid, (fftwf_complex *)static_grid, FFTW_MEASURE);

cufftPlan3d (&p_cufft, global_grid_size, global_grid_size, global_grid_size, CUFFT_R2C);

// pinv = fftwf_plan_dft_c2r_3d(global_grid_size,global_grid_size,glob

al_grid_size,multiple_fsg, (fftwf_real *)multiple_fsg, FFTW_MEASURE);

cufftPlan3d (&pinv_cufft, global_grid_size, global_grid_size, global_grid_size, CUFFT_C2R);

[/codebox]

and the FT:

[codebox]

// fftwf_execute_dft_r2c(p,static_grid,(fftwf_complex *)static_grid);

CHECK_CUDA(cudaMemcpy(static_grid_d, static_grid, sizeof_grid, cudaMemcpyHostToDevice));

cufftExecR2C( p_cufft, static_grid_d, (cufftComplex * )static_grid_d );

CHECK_CUDA(cudaMemcpy(static_grid, static_grid_d, sizeof_grid, cudaMemcpyDeviceToHost));

[/codebox]

The results start to be slightly different but the error is bigger in successive iterations.

Any help?

Thank you

y09 · April 10, 2010, 1:36am

I haven’t used CUFFT since 2.3, so I don’t know anything about 3.0, but back then CUFFT implemented no appropriate FFT routines for data sizes with large prime factors but used direct DFTs instead whose error is a lot worse. If at all possible, try to use power-of-two data sizes or sizes with small prime factors (2,3,5) - for those, CUFFT results should be reasonably accurate.

dcd16043 · April 10, 2010, 8:52am

Hello,

global_grid_size is 128 so I suppose CUFFT is using a FFT routine, isn’t it?

Thank you

y09 · April 10, 2010, 6:33pm

Yes, it should. How big exactly is your error (L1/relative)?

dcd16043 · April 10, 2010, 6:57pm

Well, here we have some values using “fftwf_execute_dft_r2c” and “cufftExecR2C” respectively, where input is a 3D array initialized to 0.0f:

CPU:
-168608.00000000000000000000000000000000000000000000000000
0.00000000000000000000000000000000000000000000000000
129608.38281250000000000000000000000000000000000000000000
4217.92529296875000000000000000000000000000000000000000
-47863.76171875000000000000000000000000000000000000000000
-5714.29687500000000000000000000000000000000000000000000
-10428.89746093750000000000000000000000000000000000000000
2505.26733398437500000000000000000000000000000000000000
17181.33984375000000000000000000000000000000000000000000
4267.99316406250000000000000000000000000000000000000000
1140.93835449218750000000000000000000000000000000000000

GPU:
-168608.00000000000000000000000000000000000000000000000000
0.00000000000000000000000000000000000000000000000000
129608.35937500000000000000000000000000000000000000000000
4217.91015625000000000000000000000000000000000000000000
-47863.75781250000000000000000000000000000000000000000000
-5714.28222656250000000000000000000000000000000000000000
-10428.90234375000000000000000000000000000000000000000000
2505.25830078125000000000000000000000000000000000000000
17181.34375000000000000000000000000000000000000000000000
4267.98291015625000000000000000000000000000000000000000
1140.94860839843750000000000000000000000000000000000000

What do you think?
Thanks

dcd16043 · April 15, 2010, 12:19pm

Sorry, I edited my last post instead of writing a new one.

dcd16043 · April 15, 2010, 12:19pm

Sorry, I edited my last post instead of writing a new one.

Topic		Replies	Views
CUFFT run wrong CUDA Programming and Performance	16	2981	May 23, 2013
complex cuFFT fails for length 59200 cuFFT bug for cuda 3.0 or greater CUDA Programming and Performance	6	5093	August 27, 2010
CUFFT appears to give errors for vectors > 1024 CUDA Programming and Performance	6	8859	April 12, 2007
cufftExecC2C incorrect for certain FFT sizes CUDA Programming and Performance	5	3761	February 4, 2012
result of cufft is different with fftw CUDA Programming and Performance	0	541	June 1, 2014
CUFFT and FFTW Numeric Accuracy CUDA Programming and Performance	9	20558	May 28, 2009
3D CUFFT strange effect on volume dimensions 3D CUFFT strange effect on volume dimens CUDA Programming and Performance	1	2638	April 25, 2008
cufft doubt comparing r2c and c2c 2D FFTs CUDA Programming and Performance	28	13750	October 27, 2010
Performance of CuFFT 3.1 library CUDA Programming and Performance	0	3296	July 8, 2011
CUFFT_INTERNAL_ERROR during creation of a 1D Plan in CUFFT GPU-Accelerated Libraries cuda , cufft	11	4133	October 19, 2022

Different results between FFTW and CUFFT

Related topics