cufftDx - inverse FFT behave like forward FFT

oshinover · June 20, 2022, 11:07am

Hi,
I’m using cufftDx in order to perform convolution.
I got some non-reasonable results so I tried to figure out where does the problem come from.
I commented out part of the code, simplify the process and found the problem:
I have a data vector of 1024 complex floating point elements.
I filled the vector with the same number -40 + 0j so I have 1024 elements of the same complex number.
After executing FFT with cufftDx, I printed the data and found it’s all zeros except the first element: -40960 + 0j, as expected.
But, after executing IFFT, I printed the data and found it’s all the same number - -40960 + 0j instead of being -40 + 0j as expected.
This is the result of double FFT instead of FFT and IFFT.
What can I do? am I missing something?

Thank you in advance,
Ori

My code look like:

// Host Code:

static constexpr unsigned int fft_size1      = 1024; 

using FFT_base     = decltype(Block() + Size<fft_size1>() + Type<fft_type::c2c>() + Precision<float>() +
		                              ElementsPerThread<2>() + FFTsPerBlock<1>() + SM<750>());
using FFT          = decltype(FFT_base() + Direction<fft_direction::forward>());
using IFFT         = decltype(FFT_base() + Direction<fft_direction::inverse>());

cudaFuncSetAttribute(
	my_kernel<FFT, IFFT>,
	cudaFuncAttributeMaxDynamicSharedMemorySize,
	FFT::shared_memory_size );

    my_kernel<FFT, IFFT><<<GridSizeKernel, FFT::block_dim, FFT::shared_memory_size >>>(data);

// Device Code:

template<class FFT, class IFFT>
__launch_bounds__(FFT::max_threads_per_block)
 __global__ my_kernel(complex *data){

   // load data to shared memory

   FFT().execute(shared_mem);
   __syncthreads();
   // printing data
   IFFT().execute(shared_mem);
   __syncthreads();
   // printing data
}

oshinover · June 20, 2022, 11:22am

Now I see, maybe the IFFT of cufftdx is defined without the 1/N factor?

mnicely · June 20, 2022, 11:47am

Correct. cuFFT doesn’t normalize FFTs. That is up to the users.

oshinover · June 28, 2022, 4:59am

Thank you!

system · July 12, 2022, 5:00am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Inverse FFT returns wrong value? CUDA Programming and Performance	2	1312	April 29, 2009
Problem using cuFFT CUDA Programming and Performance	3	3559	May 31, 2011
cufftDx performance not achieve the cufft performance GPU-Accelerated Libraries cufft	1	891	August 11, 2021
Problem with inverse CuFFT calculations GPU-Accelerated Libraries	0	490	August 28, 2017
Wrong results in cufft! GPU-Accelerated Libraries	2	1215	September 30, 2015
Batched 1D FFT not faster than a loop for big images (1024x1024) GPU-Accelerated Libraries cuda	0	479	September 25, 2020
I got a wrong result cuFFT: fft --> ifft, GPU-Accelerated Libraries	4	3263	July 29, 2015
cuFFT 2^15+ issues? GPU-Accelerated Libraries	3	1804	January 2, 2013
1D CUFFT results not matching FFTWF results source code attached CUDA Programming and Performance	7	6155	March 9, 2011
3D CUFFT fails CUDA Programming and Performance	5	1011	February 9, 2012

cufftDx - inverse FFT behave like forward FFT

Related topics