double precision atomicAdd() problem

HDYi · July 1, 2015, 9:31am

cuda version : 6.5
GPU : Telsa K40c
compute capability : 3.5>=

double Precision atomicAdd() use this code.

device double atomicAdd(double* address, double val) {
unsigned long long int* address_as_ull = (unsigned long long int*)address;
unsigned long long int old = *address_as_ull, assumed;

            do { 
                  assumed = old; 
                  old = atomicCAS(address_as_ull, assumed, __double_as_longlong(val+__longlong_as_double(assumed))); 
           } while (assumed != old); 
           return __longlong_as_double(old);

}

but, the result of atomicAdd() is different with result of cpu code under 10th decimal place.
Are these differences inevitable?

Robert_Crovella · July 1, 2015, 1:16pm

They might be “inevitable”.
People who expect exact duplication of floating point results between host and device computations are frequently disappointed.

Floating point calculations may produce different results depending on the actual order of operations. Since parallel code running on the device will execute a given algorithm with possibly a different order of operations than the “same” algorithm running on the host, these differences pop up.

If you google “what every computer scientist should know about floating-point arithmetic” you may get some interesting information.

usmansyed606 · January 29, 2024, 5:11am

@Robert_Crovella: I am facing same problem but if print result of two nodes only, i get the same double values. but if print all the values i get very weird order and results reason is off course calculation are based on results from different nodes (its neighbors). the problem i am solving is louvain. even after applying atomic operations i am geeting the same weird order.

Robert_Crovella · February 1, 2024, 8:53pm

There is not enough information here for me to be able to make any further comments.

Topic		Replies	Views
atomicAdd, atomicExch and atomicCAS give random results CUDA Programming and Performance	1	2634	January 28, 2011
AtomicAdd result incorrect CUDA Programming and Performance	3	1684	December 29, 2018
CUDA dot product atomics problem CUDA Programming and Performance	4	1928	February 26, 2012
Get different results for every running with atomicAdd() CUDA Programming and Performance	2	411	October 3, 2022
atomicAdd and concurrent kernels CUDA Programming and Performance	5	2431	August 6, 2013
atomic add operation CUDA Programming and Performance	2	4482	July 22, 2014
atomicAdds within two loops CUDA Programming and Performance	5	923	October 12, 2021
atomicAdd occasionally fails on doubles CUDA Programming and Performance	3	6879	October 12, 2011
Speed of double precision CUDA atomic operations on Kepler K20 CUDA Programming and Performance	2	4604	January 29, 2024
atomicAdd not behaving as expected, atomicAdd_system not defined CUDA Programming and Performance	3	1622	September 5, 2022

double precision atomicAdd() problem

Related topics