set of lines of code exec by only one thread

sanf · February 1, 2012, 10:37am

Hi,

  How to make specific lines of cuda kernel to be executed only by a single thread?

  For example, memory has to be allocated for 50 elements(nodes of a linked list) of the cuda kernel on which 50 threads are working i.e. each thread will work on one element.

  But the memory should be allocated only once.

  Is there any way to handle such situation?

Thanks

tera · February 1, 2012, 12:28pm

__syncthreads();

if ((threadIdx.x==0) && (threadIdx.y == 0) && (threadIdx.z==0))

    {...}

__syncthreads();

cmaster.matso · February 3, 2012, 8:17am

Will it work only for one block? Should we have more then one, the operations will be done by every first thread in the block, am I right?

pasoleatis · February 3, 2012, 9:02am

Fo multiple blocks, you can use the threadfence() function. Whatever it is after the threadfence() will wait for the data to be visible to all blocks.

Topic		Replies	Views
Execute instruction only once inside a block/grid? CUDA Programming and Performance	7	2034	May 10, 2010
syncronize all threads from all blocks cudaThreadSynchronize() the only way ? CUDA Programming and Performance	11	8256	November 15, 2010
SINGLE THREADS CUDA Programming and Performance	1	1590	July 30, 2008
Shared Memory allocation.. CUDA Programming and Performance	5	5360	July 9, 2010
Thread sync CUDA Programming and Performance	2	802	May 9, 2011
The result is unpredictable. CUDA Programming and Performance	6	1075	October 25, 2013
Synchronize just first N threads of a block ? CUDA Programming and Performance	2	507	March 29, 2019
Doubt on __threadfence() require a detail description of this function. CUDA Programming and Performance	5	2940	January 25, 2010
Synchronize all blocks in CUDA CUDA Programming and Performance	12	45981	October 25, 2013
__syncthreads and __threadfence together in a loop CUDA Programming and Performance	5	3603	October 15, 2010

set of lines of code exec by only one thread

Related topics