Query related to CUFFT

Rajesh1973 · July 24, 2008, 5:26pm

I am using FFT for an Image Processing application. I am planning to replace my existing CPU based FFT (which is based on Cooley Tuckey algorithm) with CUFFT.

I have few questions regarding CUFFT.

( 1 ) Which is the FFT algorithm used internally by CUFFT. Is it Cooley Tuckey?
Then only I can compare the performance.

( 2 ) How much speed up can I expect for an Image of size 5k * 5k? A rough idea.

My CPU application takes around 71 seconds to complete.

Thanks in advance

E.D_Riedijk · July 24, 2008, 6:05pm

you can download the CUFFT source to check out which algorithm is used :)

iceberg · July 28, 2008, 12:20pm

Could you tell me where I can download the CUFFT source code, please?

Thank you!

E.D_Riedijk · July 28, 2008, 12:49pm

Cuda announcements & news

http://forums.nvidia.com/index.php?showforum=63

XFer · July 28, 2008, 4:16pm

On a G92, for a Complex2Complex Forward transform + Backward transform (that is, you get your original image back) of a 2048x2048 grayscale image, I measured around 100 milliseconds (0.1 seconds), including interleaving/deinterleaving (Re + Im ↔ complex) and Host<->GPU data transfers.

For comparison, FFTW takes around 0.3 seconds (3x the time) for the same task, and that’s with a 3.0 GHz quad-core using the Single Precision SSE version of FFTW, multi-threaded (NThreads = 4).

So I’d say, CudaFFT is pretty fast in this situation.

Please note that I can’t try 5Kx5K, since on my G92 512MB I can only go as far as 3Kx3K or something around that (I think a 1.5GB setup should do 5Kx5K).

Your mileage may vary if your image has rows# and/or cols# not multiple of the number of Stream Multiprocessors your GPU has (because of uncoalesced memory access and/or bank conflicts, if I recall well).

More benchmarks here:

http://forums.nvidia.com/index.php?showtop…56&#entry413956

and here

http://forums.nvidia.com/index.php?showtopic=42482&st=0

Fernando

Topic		Replies	Views
Query regarding CUFFT CUDA Programming and Performance	1	1830	July 24, 2008
Writing custom FFT for sizes other than powers of 2 CUDA Programming and Performance	2	5115	September 29, 2010
cufft doubt comparing r2c and c2c 2D FFTs CUDA Programming and Performance	28	13584	October 27, 2010
CUFFT: calculation time CUDA Programming and Performance	6	2707	April 21, 2012
cufft performance CUDA Programming and Performance	2	12891	March 10, 2011
Can CUFFT do more fast on small size img? CUDA Programming and Performance	5	5553	December 17, 2007
CUFFT 2D source code CUDA Programming and Performance	4	5508	April 28, 2010
CUFFT Implementation CUDA Programming and Performance	3	7438	July 2, 2007
CUFFT and image treatment CUDA Programming and Performance	1	1245	March 12, 2010
2D FFT CUDA Programming and Performance	0	272	January 18, 2018

Query related to CUFFT

Related topics