Inter-threads communication

man82bs · June 30, 2009, 2:11am

I am having a problem with inter-thread communication.

I made only one block and grid.

void main()
{
dim3 block(32,1);
dim3 grid(1,1);

Test<<<grid, block>>>(x);

}

So I thought below code worked as I wanted.

globla void Test(int * x)
{
shared int temp = *x;

            __syncthreads();

            temp++;

            __syncthreads();

            *x = temp;

}

I expected *x gave me 32.

It only worked well in EmuDebug.

In Release, above code gave me 1.

I have no idea what the problem is.

Can you help me??

Thanks.

tmurray · June 30, 2009, 2:17am

that increment is a giant race condition. __syncthreads() is not a critical section, it’s a barrier synchronization. have you done parallel programming before?

man82bs · June 30, 2009, 5:33am

no…I thought __syncthreads() acts like critical section. But…it’s not…

Is there anything like critical section can be used in cuda??

avidday · June 30, 2009, 5:41am

Basically no. The closest thing is a set of atomic memory operations which can operate on global memory (and shared memory if you have a compute capability 1.3 card). So you could implement your kernel using an atomicAdd(), for example.

Topic		Replies	Views
__syncthreads() issue CUDA Programming and Performance	10	1167	February 10, 2011
IntelliSense: identifier "__syncthreads" is undefined CUDA Programming and Performance	1	7892	March 1, 2012
Early return and __syncthreads() function CUDA Programming and Performance synchronization	4	499	May 15, 2024
syncthreads error? CUDA Programming and Performance	16	33188	June 2, 2008
Simple kernel producing wrong results: CUDA Programming and Performance	2	674	May 3, 2014
How can I be certain my Kernel runs with 32 threads in one block and thus perfect synchrony? (ie. via __syncthreads()) CUDA Programming and Performance	15	77	August 21, 2024
cudaLaunchCooperativeKernel and syncthreads CUDA Programming and Performance	1	244	June 30, 2024
What happens when I call __syncthreads() in a warp group? CUDA Programming and Performance	6	87	June 27, 2025
Understanding a spinlock implementation by Robert Crovella CUDA Programming and Performance	6	1678	September 26, 2023
Race condition in for loop Help! CUDA Programming and Performance	8	3286	September 10, 2008

Inter-threads communication

Related topics