cublasSdot_v2() gives different results when running on different GPU types,

israel2 · July 5, 2022, 8:53am

Hi,
I’m running the same binary code on computers with different GPU types, with the same input parameters.
The code only calls cublasSdot_v2(), the input values are 32 bit floats, and the results on the 2 machines are different by 0.0000152587890625 or 0.000030517578125.

Is this a known issue?
Is there a way to receive binary-exact results on 2 different GPU types?

Thanks,
Icey.

njuffa · July 5, 2022, 9:17am

Generally speaking, GPU architecture specific kernels in CUBLAS are a thing. Whether SDOT is one of the BLAS functions affected, I do not know.

Architecture-specific kernels usually do use a different order of floating-point operations, and since algebraically identical computation is usually not identical in finite-precision floating-point, bit-wise identical BLAS results are not guaranteed across GPU architecture of CUDA versions.

SDOT is a function that is subject to a numerical phenomenon called subtractive cancellation, and when that occurs, relative error can get almost arbitrarily large. Whether this explains the observation reported we cannot tell, because the question does not include a minimal self-contained example code that reproduces the observation.

You may also want to double-check the assertion that the input data to the SDOT call is bit-wise identical on both platforms.

israel2 · July 5, 2022, 12:54pm

thanks

system · July 19, 2022, 12:55pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
cublasSgemm produces non-trivially different results in CUDA 9.1 vs CUDA 8.0 GPU-Accelerated Libraries	9	1128	February 19, 2019
double precision CUDA Programming and Performance	1	258	May 29, 2019
Is this a BUG of CuBLAS output not consistent for each run CUDA Programming and Performance	8	3184	July 29, 2010
sgemm precision wrong results cublasSgemm vs MKL sgemm CUDA Programming and Performance	4	5339	December 22, 2007
64bit vs. 32bit floats CUDA Programming and Performance	6	22149	January 16, 2009
cublas_sdot bug ? sdot should be single precision CUDA Programming and Performance	1	1097	January 21, 2009
Problem using double precision arithmetic on GT200 Incorrect results using double precision CUDA Programming and Performance	0	4273	May 30, 2008
CUBLAS vs GOTOBLAS2 CUDA Programming and Performance	20	10672	September 15, 2010
Question regarding Precision Issues in BLAS CUDA Programming and Performance	9	8516	March 4, 2010
[CUDA - CUBLAS] Deviation of computation results increase when calculating larger data CUDA Programming and Performance	2	1981	October 12, 2011

cublasSdot_v2() gives different results when running on different GPU types,

Related topics