Spinlocks How to implement?

FlaviusV · April 11, 2011, 2:52pm

Hi,

I have tried to use spinlocks within my kernel code. Similary as in this article, I have used this code:

void lock(__global int *the_lock)

{

	int occupied = atom_xchg(the_lock, 1);

	while (occupied) {

		occupied = atom_xchg(the_lock, 1);

	}

}

void unlock(__global int *the_lock)

{

	int occupied = atom_xchg(the_lock, 0);

}

and initialized the spinlock to 0, issued a barrier and simply locked and unlocked inside the kernel. The code works as it should when ran with both global and local workgroup size 1, but with 2 it simply hangs as if the kernel looped forever. If I tried to lock and unlock the spinlock inside condition get_local_id() == 0 and then get_local_id() == 1, it works. With spinlock in local memory the problem repeats, adding some memory fences where it is possible has not helped.

After some time I have realized what’s the problem: The first work-item succesfully locks the spinlock, the second finds it locked (so far everything is OK). However, because of the SIMD fashion of instructions execution, the cycle executed by second work-item blocks also the first work-item (because there is no scheduling as on CPU and all work-items execute the same instructions). This causes the first work-item to never unlock the spinlock and therefore it hangs.

Is this explanation correct? How should I implement the spinlock correctly? I tried to rewrite it this way:

for(;<img src='http://forums.nvidia.com/public/style_emoticons/<#EMO_DIR#>/wink.gif' class='bbc_emoticon' alt=';)' /> {

    if (atom_xchg(the_lock, 1) == 0) {

        // critical section

        atom_xchg(the_lock, 0);

        break;

    }

}

so the unlock code would be inside the if but it does not work either (god knows what the compiler does with that). What should I do?

To provide enough information: I am running 64-bit Red Hat 6.0 Linux with Nvidia GeForce GTX 580 with driver version 260.19.26 (as what some OpenCL info utility reports).

FlaviusV · April 11, 2011, 3:22pm

Well, now I have tried to reproduce the problem in completely separate kernel and it works… I have to figure out if it’s a compiler’s magic or if I was doing something stupid in the previous case, although I really don’t know what could be the difference.

EDIT: I had an error in the separate code, it really hangs. Should I provide all the code, or is this a known issue? I think that spinlocks are a common practice.

chrisgregg · May 5, 2011, 11:50pm

Same problem here (GTX 480). Kernels hang when I use that same code, and I’m thinking you might be right about the SIMD problem. If you find (or found) a solution, I’d like to hear it. Cheers!

-Chris

FlaviusV · May 11, 2011, 8:43pm

Yes, I have solved it but not very generally - it’s working with current compiler and GPU. See last posts on this ATI’s forum thread.

csh.mantha · May 28, 2021, 8:23pm

Hi, the link you shared that contains the solution seems to be a blank link. Would you post the solution here? Thanks.

Topic		Replies	Views
why this deadlocks? try to invoke a critical area CUDA Programming and Performance	11	6096	November 6, 2009
Shared mem atomics Help needed to Fix this hang CUDA Programming and Performance	12	11761	June 8, 2009
atomicCAS does NOT seem to work Hardware Bug? or Improper use?? TESLA C1060 CUDA Programming and Performance	70	19757	January 21, 2010
My lock and unlock function. Help~~~ CUDA Programming and Performance	5	3459	April 30, 2010
How to implement lock on the gpu? CUDA Programming and Performance	5	5140	January 19, 2010
this code resets my computer CUDA Programming and Performance	21	7193	March 29, 2008
atomicCAS() doesn't work! CUDA Programming and Performance	4	9147	July 22, 2010
Understanding a spinlock implementation by Robert Crovella CUDA Programming and Performance	6	1584	September 26, 2023
Atomic Operations in CUDA CUDA Programming and Performance	5	29237	June 9, 2009
Comp Capability 1.0 Multiprocessor Synchronization Can it be done? CUDA Programming and Performance	5	3935	July 23, 2007

Spinlocks How to implement?

Related topics