2D grid and 1D Thread Block

onepieceking · August 20, 2008, 3:00am

Hi,

Is it possible to call a kernel call with 2D grid and 1D thread block?

I have an array that is very big and I have to multiply each element with a constant.

If I use a 1D grid and 1D thread block, I cannot fit everything in. I was thinking of 2D grid, but I don’t know how to get the thread Id.

For 1d grid and 1d thread block,
int idx = blockIdx.x * blockDim.x + threadId.x

For 2d grid and 1d thread block?

E.D_Riedijk · August 20, 2008, 5:28am

for example

idx = (blockIdx.y * gridDim.x + blockIdx.x) * blockDim.x + threadId.x

in fact you only have to be careful when using 2/3D blocks, grids is no problem

onepieceking · August 20, 2008, 7:44am

Then for 2d grid and 2d thread blocks?

AndyL · August 20, 2008, 7:51am

Well if you want to access a 1D vector type array then I use the following:

int id = threadIdx.x+blockDim.xthreadIdx.y+(blockIdx.xblockDim.xblockDim.y)+(blockIdx.yblockDim.x*blockDim.y);

Remember you need to define the blocks and grids as multidimensional using
uint3 tids = make_uint3(TIDSX, TIDSY, 1); or similar

VrahoK · August 20, 2008, 12:28pm

almost, just missing the gridDim.x for the blockIdx.y:

int id = threadIdx.x+blockDim.xthreadIdx.y+(blockIdx.xblockDim.xblockDim.y)+(blockIdx.yblockDim.xblockDim.ygridDim.x);

now you can access blockDim.xblockDim.ygridDim.x*gridDim.y elements in your 1D vector

AndyL · August 20, 2008, 12:52pm

Ahhhh… yes indeed. Thanks for the correction!

onepieceking · August 21, 2008, 3:36am

Thank alot… But, i still don’t quite understand how you guys get the “equation” to find the ID…

If possible, can explain abit?

VrahoK · August 21, 2008, 8:04am

Lets say you have a 2x2 block and a 3x3 grid.

So you have 4 elements per block and 9 blocks per grid making a total of 36 adressable elements

0th element: tid.x = tid.y = bid.x = bid.y = 0

id = 0 + 02 + 022 + 0223 = 0

1st element: tid.x = 1 all others = 0:

id = 1 + …(0) = 1

2nd element: tid.x is zero again but tid.y = 1:

id = 0 + 12 + 022 + 0223 = 2

and so on…

so you see you have 4 “directions” to count:

threadIdx.x (innermost): if it reaches the blockDim.x border, it is reset to zero and threadIdx.y is increased by one.

threadIdx.y (number of lines of blockDim.x elements): if this one reaches the blockDim.y border, it is reset to zero and the blockIdx.x is increased by one.

blockIdx.x (number of blocks or in the 1D case “lines” of blockDim.x*blockDim.y elements): if this one reaches the gridDim.x border, it is reset to zero and the blockIdx.y is increased by one.

blockIdx.y (number of blocks or in the 1D case “lines” of blockDim.xblockDim.ygridDim.x elements): if this one reaches the gridDim.y border, the kernel is finished.

so your order in the 2x2 and 3x3 case would be:

+---------------------+

|+-----++-----++-----+|

|| 0  1|| 4  5|| 8  9||

|| 2  3|| 6  7||10 11||

|+-----++-----++-----+|

|+-----++-----++-----+|

||12 13||16 17||20 21||

||14 15||18 19||22 23||

|+-----++-----++-----+|

|+-----++-----++-----+|

||24 25||28 29||32 33||

||26 27||30 31||34 35||

|+-----++-----++-----+|

+---------------------+

Topic		Replies	Views
hitting the grid size limitation CUDA Programming and Performance	5	1401	November 13, 2009
Quick Thread Question Regarding Calling a kernel CUDA Programming and Performance	13	3613	June 26, 2008
simple question about blockIdx/gridDim CUDA Programming and Performance	7	9499	May 18, 2009
How to define iDs grids, blocks & threads? CUDA Programming and Performance	4	5454	February 24, 2008
block and thread IDs? CUDA Programming and Performance	2	2238	April 20, 2008
3D threads index How do you index 3D threads?!!! CUDA Programming and Performance	2	4501	April 1, 2010
difference between threadIdx, blockIdx statements CUDA Programming and Performance	10	154938	May 14, 2024
Unique Block Index in 3D Grid CUDA Programming and Performance	6	1259	November 27, 2024
Whats wrong with this simple kernel call? Invalid Configuration Argument (with empty Kernel) CUDA Programming and Performance	16	9035	November 23, 2009
Thread id and thread index in 2D CUDA Programming and Performance	4	6138	November 20, 2014

2D grid and 1D Thread Block

Related topics