How do bar.sync and __syncthreads interact?

isaaclee2313 · December 9, 2018, 6:48am

What is the barrier resource that __syncthreads or other cuda runtime-level synchronization function use? Is it fixed to 0?
if bar.sync uses the same barrier resource as __syncthreads does, then how would they interact?

tera · December 9, 2018, 1:11pm

__syncthreads() used to translate to a bar.sync instruction on barrier #0. Whether that is still the case with CUDA 10.0 and compute capability 7.x I am not sure, but it would be a simple experiment to compile a kernel to PTX and check for yourself.

Robert_Crovella · December 9, 2018, 1:18pm

Since there is no specification of this that I am aware of, I view it as an implementation detail, and therefore a hazard to depend on any particular behavior, from a code correctness point of view.

Any time you have to ask for unpublished information, or disassemble code to inspect compiler behavior, a flag should be raised in your thought process that indicates that what you are observing may not be dependable behavior for code correctness.

isaaclee2313 · December 10, 2018, 1:12am

To tera,

Oh, I’m such a noob. Of course, I should have tried to look at the assembly.

Thanks.

Do you have any tips for reading assembly?

isaaclee2313 · December 10, 2018, 1:13am

To Robert_Crovella,

thanks, I will keep that in mind. A safe method would be to only use bar.sync instead of __syncthreads, if I know I have to use bar.sync at least once.

Topic		Replies	Views
question about __syncthreads and bar.sync CUDA Programming and Performance	1	1373	October 30, 2014
Semantics of __syncthreads CUDA Programming and Performance	18	18007	January 2, 2008
Early return and __syncthreads() function CUDA Programming and Performance synchronization	4	505	May 15, 2024
"cudaThreadSynchronize()" and "__syncthreads()" CUDA Programming and Performance	1	9749	March 22, 2008
Syncthreads and Stalling Kernels CUDA Programming and Performance	16	3995	August 26, 2010
__syncthreads() not a subset of cudaDeviceSynchronize()? CUDA Programming and Performance	3	581	June 2, 2022
Bug report: __syncthreads() mistakenly optimized away CUDA Programming and Performance	2	1861	January 2, 2012
why I do not have a problem with __syncthreads ? CUDA Programming and Performance	10	7151	May 26, 2010
barrier before sync necessary? CUDA Programming and Performance	0	275	January 3, 2019
Strange behaviour of __syncthreads() CUDA Programming and Performance	5	1161	January 29, 2017

How do bar.sync and __syncthreads interact?

Related topics