Why is the z dimension smaller than the total thread block size limit

j.jozefowicz · March 28, 2023, 11:10am

Hi, this might not be the exact same case on all gpus, but on my p1000 the max dimensions of a thread block are listed as (1024,1024,64) with max number of threads per block being 1024.

I initially thought that the 3d blocks are just an abstraction and can be treated as if they had a layout of a 1d array so for example for a warp size of 32 a threadblock of dimensions (16,2,1) would execute in 1 warp, same as a threadblock of dimensions (32,1,1). But if that’s the case then there shouldn’t be a reason that a threadblock of size (1,1,1024) wouldn’t be same as a threadblock of dimensions (1,1024,1) or (1024,1,1). So why is this limitation in place?

Robert_Crovella · March 28, 2023, 1:53pm

It’s a hardware limit. You can think of a 3D block as being an “abstraction” of a 1D block, but there is more to it than that. The hardware, for example, supports the retrieval of the thread indices, and this means retrieval of 3 dimensional thread indices. That is just one example of the way the hardware interacts with the code in this case. So the hardware has limits as to what it can support.

Topic		Replies	Views
Block dimensions in CUDA CUDA Programming and Performance	0	868	November 4, 2011
Question regarding maximum amount of blocks CUDA Programming and Performance	2	795	January 28, 2011
Question about grid/block/thread sizes CUDA Programming and Performance	3	12280	November 13, 2012
is there a limitation for total number of threads? CUDA Programming and Performance	5	5271	October 22, 2009
Confusion about thread per block CUDA Programming and Performance	1	792	July 24, 2009
CUDA - thread block confusion concept clearity sought CUDA Programming and Performance	6	3001	November 10, 2011
What is the maximum number of blocks I can use? CUDA Programming and Performance	3	2823	February 8, 2022
Thread Number Limitation CUDA Programming and Performance	3	3890	December 22, 2008
3D thread blocks and arrays CUDA Programming and Performance	3	3425	December 3, 2008
Maximum possible number of threads (Total) CUDA Programming and Performance	1	1009	December 28, 2009

Why is the z dimension smaller than the total thread block size limit

Related topics