cuFFT function cufftExecR2C How to get full result instead of only the first half?

lehuyduc4 · July 26, 2022, 9:57am

Function cufftExecR2C has this in its description:

cufftExecR2C() (cufftExecD2Z()) executes a single-precision (double-precision) real-to-complex, implicitly forward, cuFFT transform plan. cuFFT uses as input data the GPU memory pointed to by the idata parameter. This function stores the nonredundant Fourier coefficients in the odata array.

As a result, the output only contains the first half of the result. For example, the output is this:

15 + 0
-2.5 + 3.44095
-2.5 + 0.812299
0 + 0 // second half is zero-filled
0 + 0 // it's symmetric with the first half, is there any parameter to fill it too?

Expected:

    (+1.500e+01,+0.000e+00)
    (-2.500e+00,+3.441e+00)
    (-2.500e+00,+8.123e-01)
    (-2.500e+00,-8.123e-01)
    (-2.500e+00,-3.441e+00)

We can write an extra kernel to fill the second half of the output, but that’s an extra step. Is there any built-in method in cuFFT to fill the whole array instead of just the first half?

Thanks!

Robert_Crovella · July 26, 2022, 2:26pm

There is no built-in method in CUFFT to provide any other kind of output from the R2C transform. You could just use a C2C transform if you want the “full” output. This will require modified formatting of your input data, of course.

lehuyduc4 · July 27, 2022, 3:01am

Hmm, strange that they didn’t add one for quality-of-life improvement.

C2C transform is probably slower (is there any benchmark results online?) . So I guess I have to write a separate kernel.

Robert_Crovella · August 6, 2022, 7:22pm

Yes, in my experience, typically a C2C transform is slower than an equivalent R2C. If you have a specific case in mind, it should be trivial for you to benchmark the actual difference.

benchmark data is linked from the cufft landing page. Look for the “Learn More” link and click on it. The cufft data starts on slide 16/17 but I don’t see anything there that compares R2C to equivalent C2C

Topic		Replies	Views
cuFFT cufftPlan1d and cufftExecR2C issues GPU-Accelerated Libraries	4	2459	July 13, 2016
cufftExecR2C only gives half the answer..?! CUDA Programming and Performance	2	4304	July 24, 2009
cufft functionality CUDA Programming and Performance	7	1319	January 12, 2012
2D cuFFT Real2Complex How to retrieve "missing" coefficients? CUDA Programming and Performance	2	2690	November 14, 2008
Newbie to cuFFT - how to do real-to-real transforms GPU-Accelerated Libraries	5	2200	February 19, 2019
Cufft_R2C and Cufft_C2R are inaccurate GPU-Accelerated Libraries	2	1798	April 11, 2014
Trouble with cudaFFT real to complex CUDA Programming and Performance	2	2645	October 24, 2007
cufft R2C results only in one quadrant GPU-Accelerated Libraries	1	906	May 15, 2019
2D CUFFT wrong result GPU-Accelerated Libraries cufft	8	3244	November 7, 2023
2D CUFFT problem CUDA Programming and Performance	1	757	February 9, 2012

cuFFT function cufftExecR2C How to get full result instead of only the first half?

Related topics