3DFFT efficiency

fabbraun · June 1, 2011, 6:50pm

Hi,

It’s been quite a while I’ve been doing my last (big) project using CUDA. (Actually we were using CUDA toolkit/SDK version 2.3 at that time - two years ago)
But I remember quite well reading some statements of several people saying that the 3D CUDA implementation of the FFT in the CUDA libraries is rather inefficient compared to the 2D version.
Which means that compared to the quite optimized FFT implementations in Matlab the CUDA implementation for 2D shows significant/“breathtaking” speedups whereas the 3D version doesn’t or better say didn’t.

Is this still true todays?
Or can one say that the FFT implementations - no matter if 2D or 3D version - is quite well optimized in CUDA and shows significant speedups compared to CPU/Matlab implementations?

To be precise I’m planning to implement a fast MRI reconstruction (gridding, 3DFFT, …) algorithm on a CUDA GPU.
But before I’ll to ask annoying questions about this I’m going to read through the “MRI on CUDA” section right here!

Thanks for your help in advance!
Fab

fabbraun · June 8, 2011, 12:00pm

so far i’ve found the NukadaFFT library: (homepage or thread in this forum)

as mentioned in their paper the 3D CUFFT shows low performance especially for non powers-of-two transform sizes.
BUT their timing results look pretty promising!

so i guess i will go in this direction (taking the NukadaFFT library) for my project.
does anyone have good points against using this library?

thanks, fab

Topic		Replies	Views
Gain of FFT speed when changing to CUDA 2.3 CUDA Programming and Performance	2	1835	August 4, 2009
Best FFT library for Fermi architecture what do you use for best performance? CUDA Programming and Performance	4	12251	March 22, 2013
cufft algorithm CUDA Programming and Performance	5	2167	February 8, 2010
3d CUFFT issues / new implementation? CUDA Programming and Performance	6	5151	June 11, 2008
CUFFT: calculation time CUDA Programming and Performance	6	2664	April 21, 2012
FFT Slower with CUDA CUDA Programming and Performance	3	6655	January 17, 2010
CUFFT Implementation CUDA Programming and Performance	3	7425	July 2, 2007
Problem with 3D CUFFT CUDA Programming and Performance	3	2037	June 24, 2010
Does cufft show much higher efficiency than cpu fft routines? CUDA Programming and Performance	10	9014	July 19, 2010
CUDA Separable Convolution and FFT-Based 2D Convolution CUDA Programming and Performance	0	414	July 11, 2020

3DFFT efficiency

Related topics