Question about atomic operation

thisdisplaynameisalreadyi · June 18, 2017, 3:38am

From the book, I see when we doing atomic operation,The hardware guarantees us that no other thread can read or write the value at address addr while we perform these operations.But it make me confused, for instance:

If I have two threads(thread0 and thread1). and I have a buffer call buf, the size of the buf is two(buf[0] and buf[1]).The address of buf[0] is 10000 and the address of buf[1] is 10001.

Now,I do atomicAdd(&buffer[threadIdx.x],1).So thread0 will add 1 into buf[0], thread1 will add 1 into buf[1].My question is, it’s happend meanwhile(since the address 10000 and 10001 is different)or thread1 will do operation after thread0 finish his job?

Thanks for your help

BulatZiganshin · June 18, 2017, 8:16am

it is performed similar to simple loading/storing, i.e. in SIMD fashion with coalescing for global memory space and banking for shared memory

thisdisplaynameisalreadyi · June 18, 2017, 8:02pm

Im confused From what I understand you said:

all of the threads in a half-warp access shared memory at the same time which means thread0 access buf[0] and thread1 access buf[1] at same time. Then they will change buf[0] and buf[1] at same time. It’s right?

BulatZiganshin · June 18, 2017, 10:07pm

yes

btw, half-warp access was used in 10-year old GPUs afair, so you may need to read newer books :)

thisdisplaynameisalreadyi · June 18, 2017, 11:43pm

Thank you really helpful : )

BulatZiganshin · June 19, 2017, 12:35pm

note that your case is simple. and in more complex cases, coalescing/banking rules apply

Topic		Replies	Views
Atomic operations Noob question. CUDA Programming and Performance	5	1472	March 5, 2009
Atomic for two operations CUDA Programming and Performance	0	383	March 27, 2017
Atomic operation problem CUDA Programming and Performance	2	859	June 2, 2008
atomicAdd with float2 no API support, workarounds ? CUDA Programming and Performance	23	5189	January 28, 2021
About atomicAdd CUDA Programming and Performance	7	84	March 28, 2025
Variable Number of Results CUDA Programming and Performance	3	1680	April 10, 2009
Memory Coalescing CUDA Programming and Performance	5	9270	October 15, 2011
Atomic Operations in CUDA CUDA Programming and Performance	5	29236	June 9, 2009
Compare approach Two buffer comparition CUDA Programming and Performance	4	881	August 17, 2011
Useful Arbitrary Atomic Operation Hack CUDA Programming and Performance	0	10061	July 20, 2008

Question about atomic operation

Related topics