Cufft shipped with CUDA 11 results on Ampere/Volta != Pascal

ungar · July 8, 2021, 4:56pm

2D FFTs (real or complex float32) get slightly different results on a GTX1080Ti (Pascal) vs (V100, A40, A100) in CUDA 11. Arrays are in the 1024x1024 to 4096x4096 range. We saw exactly the same results for Pascal vs. Volta using the cufft that shipped with CUDA 9.2.88p1. Is this expected and is there some setting that can be used to get exactly the same results across all three architectures?

mnicely · July 8, 2021, 5:17pm

Identical results are not guaranteed between architectures due to implementations and optimizations specific to each architecture, within updated software stacks.

ungar · July 8, 2021, 5:20pm

Yes, of course, yet we did get binary-identical results for Pascal and Volta, and now Volta and Ampere, suggesting it might be possible for all three to get the same results…

mnicely · July 8, 2021, 5:22pm

It’s possible Volta+ kernels were updated in CTK 11.

ungar · July 8, 2021, 5:28pm

You are likely right about that. I’ll ask via a bug/problem report. Thanks for your quick response!

Topic		Replies	Views
The cufftEstimate2d has different result on GTX1080 and V100 GPU-Accelerated Libraries	2	575	December 25, 2019
CUDA FFT low accurary compared with FFTW using single precision float GPU-Accelerated Libraries cufft	1	828	May 5, 2021
cufftExecR2C generate different results between Geforce series 10 and series 20 GPU-Accelerated Libraries cufft	3	604	May 25, 2022
the FFT done by GPU gives different results from Labview CUDA Programming and Performance	1	1146	January 11, 2011
Half precision cuFFT Transforms GPU-Accelerated Libraries	12	6109	March 29, 2021
cufftExecR2C and cufftExecC2R API calls generates different results in different CUDA tool kit versions GPU-Accelerated Libraries cufft	1	1562	August 9, 2021
CUFFT accuracy in EMULATIOn CUDA Programming and Performance	0	4198	September 15, 2009
Parallel computing on two GPUs GPU-Accelerated Libraries	1	376	June 12, 2019
Different GPUs on different workstations (compatibility) CUDA Setup and Installation	3	586	September 5, 2017
Matlab FFT vs CUDA FFT GPU-Accelerated Libraries	1	1302	July 6, 2017

Cufft shipped with CUDA 11 results on Ampere/Volta != Pascal

Related topics