shared memory atomics vs volatile does volatile eliminate the need for atomics in shared mem?

thanasio · March 23, 2012, 11:57pm

Hi

do i get this right. For latest version and cards of Cuda, if shared memory is declared as volatile, there is no need for atomics?
In other words, in the following example

volatile shared float memSlot[256];

…

memsLot[7] = 3;
…
//thread 5

memslot[7]++;

…
//thread 6 at the same time as thread
memslot[7]++;

//will result in memslot[7] to be guaranteed to be 5 after these concurrent operations without atomics?

Cheers,
Than

RezaRob3 · March 24, 2012, 5:41am

No, atomics is something else. If two different threads do atomic increments, the increments will be completely distinct from each other(each thread “checks out” the variable and locks it exclusively.)

With volatile, the two threads may do simultaneous reads, and after that, writes of the incremented value, but only one of the writes(the “last one” so-to-speak) gets to modify the variable, the end result being just one increment.

Atomics must be supported at the hardware level. volatile, on the other hand, is a signal to the compiler that says, “spill the registers or other temporary cached copies immediately.”

Topic		Replies	Views
Unsynchronized shared memory access CUDA Programming and Performance	5	1036	April 11, 2017
Atomic operation in shared memory CUDA Programming and Performance	1	3820	August 12, 2008
Useful Arbitrary Atomic Operation Hack CUDA Programming and Performance	0	10065	July 20, 2008
How's atomic operations in CUDA implemented? CUDA Programming and Performance cuda , kernel , programming	8	3498	March 26, 2024
Questions with shared memory CUDA Programming and Performance	3	1662	June 21, 2011
Force flush to global memory on grid level in cooperative kernels CUDA Programming and Performance	5	1258	August 13, 2019
Atomic functions and volatile shared memory declarations. CUDA Programming and Performance	6	14082	December 14, 2013
Shared memory write conflicts Looking for a little help... CUDA Programming and Performance	5	4917	September 7, 2007
volatile and __syncthreads and in warp... but still not what I expect Question on how to use shared CUDA Programming and Performance	1	1083	February 28, 2011
Atomic addition for floats in shared memory application: weighted histogram CUDA Programming and Performance	1	1905	February 20, 2008

shared memory atomics vs volatile does volatile eliminate the need for atomics in shared mem?

Related topics