Tackling Floating point differences caused by CUDA tool kits

RamprasadMohan · July 6, 2021, 11:22am

Hi,

The floating point computation results generated in CUDA tool kit 8.0 and CUDA tool kit 10.2.89 with compute capability 61 are different.

There is a minor deviation in the floating point results which creates considerable difference when processing further with those float values.

I did not notice such deviation in floating point results when generated with CUDA tool kit 8.0 and CUDA tool kit 9.2.

Only with CUDA 10.2.89 I see the deviation.

Why there is a deviation in floating point result from CUDA 8.0 and CUDA 10.2.89 environment when executed with same Compute Capability 61.

The release note provided by NVIDIA does not contain detail’s related to floating point differences caused by different CUDA tool kits.

Is there any other documentation/information provided by NVIDIA related to this issue.

How to tackle this issue ?

Note:

I am using Quadro P5000 card with Compute 61 in CUDA 8.0, CUDA 9.2 and CUDA 10.2.89 environment.

cbuchner1 · July 6, 2021, 6:45pm

Choosing known source code implementations of transcendental functions – such as those frequently posted by Norbert Juffa (njuffa) on this forum – could be a reasonable workaround for undocumented changes that nVidia have made to the official implementations.

Christian

njuffa · July 6, 2021, 7:54pm

Changes to optimizations affecting floating-point arithmetic are typically minor. There should be no expectation of bitwise identical results across different version of a tool chain, on any platform. Have you read NVIDIA’s floating-point whitepaper for background?

If relatively small changes in the tool chain’s handling of floating-point computation lead to significantly different final results, this is a pretty good indication that your software implementation lacks numerical stability, something you might want to investigate.

Orthogonal to that effort, in order to recommend mitigation steps, you would have to first narrow down which section of code is the root cause of the observed differences. Two common scenarios in the context of CUDA are: (1) compiler changes affecting contraction of FMUL followed by FADD into FMA (fused multiply-add); (2) accuracy improvements to transcendental functions in the standard math library.

If you use floating-point atomics, there is an indeterminate order of the operations, and because floating-point operations are generally not associative, results may differ. I am pretty sure that is spelled out in the whitepaper I mentioned.

Topic		Replies	Views
Tackling Floating point differences caused by CUDA tool kits CUDA-GDB cuda , kernel	1	668	July 6, 2021
Is CUDA's implementation of 64-bit floating precision in practice subpar to that of Fortran? CUDA Programming and Performance	2	1347	December 15, 2021
Do results vary depending on GPU or driver versions? CUDA Programming and Performance	11	8131	October 20, 2016
CPU and GPU floating point calculations Results are different CUDA Programming and Performance	6	22268	August 7, 2010
wrong floating point operation results CUDA Programming and Performance	6	6230	February 14, 2010
What is source of incorrect floating point math? CUDA Programming and Performance cuda , nvbugs	11	621	May 29, 2026
CUDA innacuracy? CUDA float produces different result from CPU float CUDA Programming and Performance	8	3176	September 9, 2011
Single Precision Accuracy CUDA Programming and Performance	9	9331	October 6, 2010
output difference between quadro K600 and K620 CUDA Programming and Performance	13	5116	December 2, 2014
Floats and floats... difference between CPU and GPU? CUDA Programming and Performance	12	14663	February 2, 2010

Tackling Floating point differences caused by CUDA tool kits

Related topics