Critical section and ballot

akss · June 16, 2022, 2:54am

This SO answer implements the critical section with using the ballot function. However, this code is deprecated in the current CUDA. How would this code look nowadays?

__device__ bool warp_lock(int req){
  return ((__ffs(__ballot(req))) == ((threadIdx.x & 31)+1));
}

rs277 · June 16, 2022, 4:14am

__ballot_sync() is the replacement, I believe:

akss · June 16, 2022, 6:19pm

Thanks for the quick link. The new version uses another set of parameters. What those parameters should be? This link seems more informative.

I’m not sure my understanding that __ballot(req) can be simply changed to __ballot_sync(FULL_MASK, result); is correct.

rs277 · June 16, 2022, 8:09pm

If you meant, “__ballot_sync(FULL_MASK, req);” above, then I believe this will do the job, taking into consideration both the SO code and the “Warp Level Primatives” post.

The difference between the two is that there is no synchronisation guarantee between all active threads at this point, in the __ballot(req) case and FULLMASK to capture all threads is a valid choice.

I’m assuming “FULLMASK” == 0xFFFFFFFF

Topic		Replies	Views
Using CUDA Warp-Level Primitives Technical Blog	20	1959	April 15, 2024
can anybody explain warp vote functions CUDA Programming and Performance	9	11312	February 11, 2011
Is there a block vote (analogous to warp vote?) CUDA Programming and Performance	7	20671	July 20, 2009
WARP Voting function CUDA Programming and Performance	6	6492	March 25, 2010
Critical Sections on GPU (Sort of a Repeat) Implementing equivalents to pthread_cond_wait & pthr CUDA Programming and Performance	4	1800	April 17, 2010
__ballot_sync inside for loop causes kernel to hang in sm_75 CUDA Programming and Performance	9	1274	October 12, 2021
Warp Vote Functions..When are they useful? CUDA Programming and Performance	1	1392	August 20, 2013
Ballot Based Reduction CUDA Programming and Performance	2	5448	December 6, 2010
Ballot Based Reduction CUDA Programming and Performance	0	1150	December 6, 2010
why this deadlocks? try to invoke a critical area CUDA Programming and Performance	11	6096	November 6, 2009

Critical section and ballot

Related topics