difference between threadIdx, blockIdx statements

Whitchurch · September 15, 2009, 6:39am

Hi,
can someone explain to me the difference between
threadIdx.x, threadIdx.y and its other variant

blockIdx.x , blockIdx.y and its other variant .

Vector11 · September 22, 2009, 4:00pm

The best way to understand these values is to look at some of the schematics in the Introduction to CUDA Programming document, but I’ll an explanation a shot.

Basically threadIdx.x and threadIdx.y are the numbers associated with each thread within a block. Let’s say you declare your block size to be one dimensional with a size of 8 threads (normally you would want something in multiples of 32 like 192 or 256 depending on your specific code). The variable threadIdx.x would be simultaneously 0,1,2,3,4,5,6 and 7 inside each block. If you declared a two dimensional block size (say (3,3) ) then threadIdx.x would be 0,1,2 and you would now have a threadIdx.y value corresponding to 0,1,2. There are actually nine threads associated with the (3,3) block size. For instance, the thread indices (0,0) (0,1) (1,2) etc refer to independent threads. This convention is very useful for two dimensional applications like working with matrices. Remember, threadIdx.x starts at 0 for each block. Your block can be up to three dimensions which allows for a threadIdx.z index as well.

The blockIdx.x and blockIdx.y refers to the label associated with a block in a grid. You are allowed up to a 2-dimensional grid (allowing for blockIdx.x and blockIdx.y). Basically, the blockIdx.x variable is similar to the thread index except it refers to the number associated with the block.

Let’s say you want 2 blocks in a 1D grid with 5 threads in each block. Your threadIdx.x would be 0, 1,…,4 for each block and your blockIdx.x would be 0 and 1 depending on the specific block.

Now, let’s say you want to load an array of 10 values into a kernel using these two blocks of 5 threads. How would you do this since your thread index only goes 0 - 4 for each block? You would use a third parameter given in CUDA – blockDim.x. This holds the size of the block (in this case blockDim.x = 5). You can refer to the specific element in the array by saying something like…

int idx = blockDim.x*blockIdx.x + threadIdx.x

This makes idx = 0,1,2,3,4 for the first block because blockIdx.x for the first block is 0. The second block picks up where the first left off because blockIdx.x = 1 and blockDim.x = 5. This makes idx = 5,6,7,8,9 for the second block.

Once again, refer to the beginner manual for more on this subject. Hope this helps.

jdj · February 7, 2011, 8:05pm

Thanks a lot for that

ngarach · October 19, 2011, 2:44am

Thanks a lot that was very helpful.

sanf · October 19, 2011, 2:26pm

Let’s say you want 2 blocks in a 1D grid with 5 threads in each block. Your threadIdx.x would be 0, 1,…,4 for each block and your blockIdx.x would be 0 and 1 depending on the specific block.

Now, let’s say you want to load an array of 10 values into a kernel using these two blocks of 5 threads. How would you do this since your thread index only goes 0 - 4 for each block? You would use a third parameter given in CUDA – blockDim.x. This holds the size of the block (in this case blockDim.x = 5). You can refer to the specific element in the array by saying something like…

int idx = blockDim.x*blockIdx.x + threadIdx.x

This makes idx = 0,1,2,3,4 for the first block because blockIdx.x for the first block is 0. The second block picks up where the first left off because blockIdx.x = 1 and blockDim.x = 5. This makes idx = 5,6,7,8,9 for the second block.

Once again, refer to the beginner manual for more on this subject. Hope this helps.

In my case I’ve 100 elements. I want to use 10 threads per block.

num_blocks = 100 / num_threads;

So num_blocks=10. So 0,1,2,3,4,5,6,7,8,9 are the block indices here i.e. blockIdx.x

And threadIdx.x will be also again 0,1,2,3…8,9

What is the value of blockDim.x? How to findout it? Is it the same value as number of threads per block i.e. 10? (Atleast in 1D block, 1 D thread model)

AnuragUnnikrishnan · December 22, 2015, 9:06pm

I am not sure if you are still following this but I am obligated to answer your question.

blockDim.‘?’ - refers to the number of threads in the block at that dimension. Here, ‘?’ is either x, y or z since a block can have a 3-D representation of threads(in a block). This is in contrast to a grid which can have at most a 2-D representation of blocks in a grid(the bigger picture) i.e. only a ‘x’ and a ‘y’ dimension.
In the example cited above, blockDim.x is the same as the number of threads in the block i.e 10, since the block in question is 1-D and hence has only a ‘x’ dimension.

dev10e12 · January 24, 2018, 2:26am

Good starting point for beginners is available here.
[url]Page Not Found | NVIDIA

warren.yung.wang · May 30, 2018, 11:11pm

Great explanation! After watching a few basic CUDA videos on youtube I was still a bit confused one their meanings but this cleared everything up.

Thanks!

eddie.forgacs · October 19, 2022, 1:28am

This link is no longer valid. Judging by the name of the PDF file, however, I’m guessing it was supposed to be this: https://www.nvidia.com/docs/IO/116711/sc11-cuda-c-basics.pdf

trivedi.nagaraj · July 2, 2023, 7:43am

Very good explanation.

akhilgodvsdemon · May 14, 2024, 2:55am

Thanks, signed up only to like your comment

Topic		Replies	Views
simple question about blockIdx/gridDim CUDA Programming and Performance	7	9499	May 18, 2009
Thread id and thread index in 2D CUDA Programming and Performance	4	6137	November 20, 2014
The first inruction in cuda CUDA Programming and Performance	2	2907	October 16, 2008
Threads begginer question CUDA Programming and Performance	8	8121	July 16, 2007
Help me! CUDA Programming and Performance	5	1951	February 9, 2010
2D grid and 1D Thread Block CUDA Programming and Performance	7	7226	August 21, 2008
1D/2D indexes usage in a kernel CUDA Programming and Performance	3	772	January 31, 2011
Question about Block and Thread Organization dimBlock.x, dimBlock.y, dimGrid, dimBlock CUDA Programming and Performance	9	14613	April 22, 2012
what does the "dimension" of a block(or grid) mean some basic idea of CUDA CUDA Programming and Performance	5	6579	October 25, 2010
What does threadIdx.x+blockDim.x mean? CUDA Programming and Performance cuda	3	818	September 16, 2022

difference between threadIdx, blockIdx statements

Related topics