atomicCAS() doesn't work!

GiulioPU · July 20, 2010, 2:47pm

I am trying to use atomicCAS() to sum elements stored in global memory inside a kernel(the vector is the result of a parallel reduction…), but it doesnt work!

the code is easy:

[codebox]device int lock=0;

device_ float square_norm=0;

global mykernel(…){

…

if(tid==0){

do{}while(atomicCAS(&lock,0,1));//setlock

square_norm += temp[0]

__threadfence();//waitforwritecompletion

lock=0;//freelock

}

[/codebox]

Do you find where is the problem?

Thanks

Sarnath · July 20, 2010, 4:50pm

This can deadlock due to warp-divergence and the code’s reliance on un-defined behavior.
All these have been discussed long long time back. Try searching…

LSChien · July 21, 2010, 4:19am

(1)

don’t not use “lock=0” when fee lock.

try

atomicCAS(&lock,1,0)); // free lock

(2) if above modification does not work, then try to allocate lock outside the kernel.

GiulioPU · July 21, 2010, 8:44am

Whou! I have found the answer(…maybe…) in a topic after 20 replies!

[url=“http://forums.nvidia.com/index.php?showtopic=98444”]The Official NVIDIA Forums | NVIDIA

LSChien, thanks for your reply but I guess your code doesnt work, check the above topic.

Sarnath, PLEASE :( can you post the final WORKING code of a spinlock in CUDA? Many thanks!

Sarnath · July 22, 2010, 5:31pm

GiulioPU,
Unfortunately, I dont have that code now… I think ‘tmurray’ posted it in that topic… Check out…
Its difficult and tiresome…You may need to spend 1 or 2 days to get it working. Good Luck!

Topic		Replies	Views
questions about using atomicCAS as a lock CUDA Programming and Performance	0	1351	November 10, 2011
atomicCAS issue (possible deadlock) CUDA Programming and Performance	5	3307	October 26, 2011
atomiccas usage Legacy PGI Compilers	2	3724	December 25, 2014
atomicCAS for mutiple blocks & mutiple threads - CUDA 3.2 - Fedora 10 CUDA Programming and Performance	7	2570	April 25, 2011
Implementing mutual exclusion lock using atomicCAS() CUDA Programming and Performance	2	2408	August 5, 2009
Problem with lock using atomicCAS CUDA Programming and Performance	3	3614	July 19, 2014
atomicCAS CUDA Programming and Performance	8	4047	July 4, 2011
atomic locks CUDA Programming and Performance	15	13038	January 27, 2012
atomicCAS() doesn't compile! CUDA Programming and Performance	7	6156	April 20, 2011
why this deadlocks? try to invoke a critical area CUDA Programming and Performance	11	6195	November 6, 2009

atomicCAS() doesn't work!

Related topics