Problem with CUFFT Z2Z

_Sayan · February 4, 2012, 7:32pm

Hi,

I am performing FFT (Z2Z) on an image of NXN size; as far as I understand, if I am doing an in-place C2C or Z2Z, then I do not need to pad my last dimension. But when I do an IFFT on the image generated by the real data (upon doing FFT), then I do not get the same image back. I am dividing by the number of elements (N*N) after getting the results from the inverse transform. Here are the steps followed by the CUFFT_Z2Z function, please help me in finding the bug and clearing any misconceptions. May be the bug is related to data allocation.

//Read image into NXN double array - img

//Allocate NXN cufftDoubleComplex for host_data and device_data arrays

//Transfer image data to host structure

host_data.x[...] = img[...]

host_data.y[...] = 0.0

//Memcpy and kernel launch

//Memcpy device_data to host_data

//Transfer the host_data back to img

img[...] = host_data[...].x 

//Write an image from the contents of the img array

/*************CUFFT Z2Z*****************/

void Z2Z_gpu (cufftDoubleComplex *data, unsigned int nx, unsigned int ny, int dir)

{

    cufftHandle     plan;

    /* Create a 2D FFT plan */

    cudasafe( cufftPlan2d(&plan, nx, ny, CUFFT_Z2Z));

    cudasafe( cufftSetCompatibilityMode ( plan , CUFFT_COMPATIBILITY_NATIVE ));

/* Forward transform the signal in place  */

    if ( dir )

        cudasafe( cufftExecZ2Z ( plan, data, data, CUFFT_FORWARD ));

/* Inverse transform the signal in place */

    if ( !dir )

        cudasafe( cufftExecZ2Z ( plan , data , data , CUFFT_INVERSE ));

/* Destroy the CUFFT plan */

    cufftDestroy ( plan ) ;

}

Cross post to stackoverflow - http://stackoverflow.com/questions/9137416/array-dimensions-in-2d-cufft-z2z

Thanks.

struct · February 6, 2012, 3:15pm

Hi,

I am performing FFT (Z2Z) on an image of NXN size; as far as I understand, if I am doing an in-place C2C or Z2Z, then I do not need to pad my last dimension. But when I do an IFFT on the image generated by the real data (upon doing FFT), then I do not get the same image back. I am dividing by the number of elements (N*N) after getting the results from the inverse transform. Here are the steps followed by the CUFFT_Z2Z function, please help me in finding the bug and clearing any misconceptions. May be the bug is related to data allocation.
//Read image into NXN double array - img

//Allocate NXN cufftDoubleComplex for host_data and device_data arrays

//Transfer image data to host structure

host_data.x[...] = img[...]

host_data.y[...] = 0.0

//Memcpy and kernel launch

//Memcpy device_data to host_data

//Transfer the host_data back to img

img[...] = host_data[...].x 

//Write an image from the contents of the img array

/*************CUFFT Z2Z*****************/

void Z2Z_gpu (cufftDoubleComplex *data, unsigned int nx, unsigned int ny, int dir)

{

    cufftHandle     plan;

    /* Create a 2D FFT plan */

    cudasafe( cufftPlan2d(&plan, nx, ny, CUFFT_Z2Z));

    cudasafe( cufftSetCompatibilityMode ( plan , CUFFT_COMPATIBILITY_NATIVE ));

/* Forward transform the signal in place  */

    if ( dir )

        cudasafe( cufftExecZ2Z ( plan, data, data, CUFFT_FORWARD ));

/* Inverse transform the signal in place */

    if ( !dir )

        cudasafe( cufftExecZ2Z ( plan , data , data , CUFFT_INVERSE ));

/* Destroy the CUFFT plan */

    cufftDestroy ( plan ) ;

}
Cross post to stackoverflow - http://stackoverflow.com/questions/9137416/array-dimensions-in-2d-cufft-z2z

Thanks.

Sayan,

This is expected behavior. The FFT / IFFT results need to be normalized at some point by the number of elements to get back the original results. Most people normalized the results of IFFT. Some normalize FFT and IFFT each by sqrt(number of elements).

_Sayan · February 6, 2012, 9:41pm

Thank you Pavan. I was normalizing the IFFT results with NXN, and upon some more testing, I found that the error increases with the variance in pixel values. Apart from NXN, I have also tried sqrt(NXN) on FFT and IFFT as you suggested, but again for some other images it fails. Perhaps I need to understand more about the normalization step to come up with a general solution.

pasoleatis · February 7, 2012, 7:12am

The normalization is the toal number of points (NxN). You can apply the normalization at anytime, because it is a multiplication or divition by the same number. What does the

cufftSetCompatibilityMode ( plan , CUFFT_COMPATIBILITY_NATIVE )

command do?

I think the problem it is not in the Z2Z_gpu function, but in the parts where you copy and replot the image. You should check these lines

//Transfer image data to host structure

host_data.x[...] = img[...]

host_data.y[...] = 0.0

//Memcpy and kernel launch

//Memcpy device_data to host_data

//Transfer the host_data back to img

img[...] = host_data[...].x 

//Write an image from the contents of the img array

Just do a simple test (only forward and backward transform of a complex matrix in a separate program, try first with a matrix which is all 1).

Topic		Replies	Views
Problem with CUFFT CUDA Programming and Performance	7	4902	May 16, 2018
2D cuFFT using C2C CUDA Programming and Performance cuda	2	731	April 30, 2020
CUFFT run wrong CUDA Programming and Performance	16	2808	May 23, 2013
Problem with CUFFT R2C+C2R returning NaNs CUDA Programming and Performance	3	1615	June 25, 2012
Help cufft execution CUDA Programming and Performance	1	719	November 28, 2017
2D CUFFT wrong result GPU-Accelerated Libraries cufft	8	3098	November 7, 2023
3D CUFFT fails CUDA Programming and Performance	5	1015	February 9, 2012
Wrong results in CUFFT! CUDA Programming and Performance	4	5465	March 22, 2011
cufftExecC2C incorrect for certain FFT sizes CUDA Programming and Performance	5	3647	February 4, 2012
Problem using cuFFT CUDA Programming and Performance	3	3560	May 31, 2011

Problem with CUFFT Z2Z

Related topics