Control Bock Execution

fender177 · February 27, 2011, 4:42pm

Hello,

I am writing an algorithm that needs to execute its tasks in a specific order. To work within this restriction, I have been calling my kernel with 1 block and X threads. I then just increment a counter in the kernel to deal with the fact that I have fewer threads than vector elements.

In some tests, I found that if I up the number of blocks to allow for 1 thread per element, the computation time is cut nearly in half (obviously…). Is there anyway to order the blocks to execute sequentially?

LSChien · February 28, 2011, 12:45am

you can use atomicAdd() in B.11 of CUDA programming guide.

for example

__global__ void foo( int *d_atomicBlockID, int num_blocks,...)

{

    __shared__ int bid_smem ; 

    int tid = threadIdx.x ;

if ( 0 == tid ){

       bid_smem = atomicAdd( d_atomicBlockID, 1 ) ;  

    }

    __syncthreads(); // necessary due to bid_smem

int bid = bid_smem; // all threads in a thread block focus on block bid

    if (bid >= num_blocks ){

        return ;

    }

    // do your computation on block bid 

}

int main(void)

{

   cudaError_t status ;

   int *d_atomicBlockID ;

status = cudaMalloc((void**)&d_atomicBlockID, sizeof(int));

   assert( cudaSuccess == status );

   status = cudaMemset(d_atomicBlockID, 0, sizeof(int));

   assert( cudaSuccess == status );

   ..... // prepare other data

foo<<< grid, threads>>>( d_atomicBlockID, num_blocks, ...)

status = cudaGetLastError();

   assert( cudaSuccess == status );

cudaThreadExit();

   return 0 ;

}

Topic		Replies	Views
Controlling of block execution CUDA Programming and Performance	2	1132	September 18, 2009
do not understand thread/block division CUDA Programming and Performance	10	2905	April 23, 2012
Controlling of block execution CUDA Programming and Performance	1	881	September 18, 2009
Need a little help to understand how thread change/works CUDA Programming and Performance	4	3898	December 10, 2011
Atomic block or transactions in CUDA CUDA Programming and Performance	0	879	July 21, 2013
set of lines of code exec by only one thread CUDA Programming and Performance	3	3731	February 3, 2012
Is it possible to assign specific block to be active? CUDA Programming and Performance	4	446	September 30, 2023
Record the execution order of the threads? CUDA Programming and Performance	7	7532	March 28, 2008
Blocks execution Are they executed concurrently? CUDA Programming and Performance	4	1307	December 14, 2011
CUDA simple questions. please answer! CUDA Programming and Performance	4	1477	April 29, 2009

Control Bock Execution

Related topics