How to implement lock on the gpu?

Austin · January 12, 2010, 6:28am

Hello,guys,
Is there someone do anything about the lock on gpu? In my program, I want to use some threads in one thread block, and these threads will contact with each other threads through the shared memory, i.e., threads in one thread block may write or read the same space, my question is : Is there CUDA APIs work as pthread_mutex_lock that can lock some code to guarantee the exclusive access to the same code piece? Or some other alternative methods to solve this situation?
I has read the atomic functions supplied by Nvidia, but the shared data is not just one int or other primitive type variable, it is a struct that contains some fields.
Assume there is a struct named ExampleStruct defined like this:

struct ExampleStruct {
int first;
int second;
int third;
};

ExampleStruct example;
using pthread_mutex_lock would like this:

pthread_mutex_lock(&lock);
example.first += 1;
example.second += 1;
example.third += 1;
pthread_mutex_unlock(&lock);

but the atomic functions can only assure that one variable (a memory space) access exclusively.

code like the following is not right:
atomicAdd(&example.first,1);
atomicAdd(&example.second,1);
atomicAdd(&example.third,1);

Any ideas,guys? :unsure:

Sarnath · January 12, 2010, 8:53am

Implementation of locks is non-trivial in CUDA.

The general idea is to

Make 1 representative thread inside a block contend for a block-level lock in global memory
Once a block gets the lock, threads inside the block should contend for a intra-block lock present in shared memory
Do critical section and Release Shared Memory lock and global memory lock and then (all threads in the block) go to step 1.

It is common to see that “spinning while loop” deadlocks due to WARP-Divergence. So, the spinning loops need to be written VERY carefully. Check out the thread. [url=“http://forums.nvidia.com/index.php?showtopic=98444”]The Official NVIDIA Forums | NVIDIA

NCC-1701D · January 13, 2010, 8:58am

You can check the histogram example SDK, it has a quite elegant implementation of software lock for shared memory access

Austin · January 18, 2010, 1:50am

OK, Thanks very much, guy! I will check it.

Austin · January 18, 2010, 6:10am

Sarnath, thank you for you tips, and I also read the topic thread, It’s so long a discussion, but still a bitter confused, now I come back to this topic again(last week I focused my attention on the other thing), and I plan to read the topic again. Hope I can get it.

Thanks again!

hehe

Sarnath · January 19, 2010, 10:09am

I suggest not to read it completely. SOrry I should have told you b4. The topic takes lot of deviations and one can easily get confused…

Try implementing the lock according to what I said – like, get 1 representative thread (threadIdx.x == 0) fight for the block, then among threads inside the block…

While you do that you might hit “deadlocks”… Read the topic (browse it peripherally to locate your region of interest)… Most of the deadlocks come from “spinning” loops…

Good Luck,

Topic		Replies	Views
How to implement a lock? CUDA Programming and Performance	2	18312	March 13, 2011
A problem of implementing mutex in CUDA CUDA Programming and Performance	6	1559	June 29, 2017
Try to use lock and unlock in CUDA CUDA Programming and Performance	1	19268	June 14, 2017
Problem with lock using atomicCAS CUDA Programming and Performance	3	3539	July 19, 2014
Simultaneous write Multiple threads writing to the same memory location? CUDA Programming and Performance	2	1134	June 6, 2010
Spinlock functionality? CUDA Programming and Performance	6	780	December 27, 2021
How to pass a struct to a kernel? CUDA Programming and Performance	8	2332	March 19, 2019
why this deadlocks? try to invoke a critical area CUDA Programming and Performance	11	6096	November 6, 2009
atomicCAS mutex not working on 2080ti? CUDA Programming and Performance	3	1042	November 30, 2019
Critical Sections on GPU (Sort of a Repeat) Implementing equivalents to pthread_cond_wait & pthr CUDA Programming and Performance	4	1800	April 17, 2010

How to implement lock on the gpu?

Related topics