What is the maximum number of blocks I can use?

user143091 · January 25, 2022, 9:06pm

So, I have been reading for a while, and I have not quite understood this yet.

My understanding as of now is this:

A single block can have a number of 1024 threads at most. However, this block can also be structered 1, 2 or 3-dimensional, but the product of each dimension’s size must not exceed 1024. So a 8x8x8 block configuration could work, because it would host 512 threads, but a 16x16x16 configuration couldn’t, because that would exceed the maximum amount of 1024 threads. Also each dimension must not exceed a size of 1024, 1024 and 64, respectively, while the product of those sizes must be equal to or less than 1024. Is this correct?

As a block can be structured multi-dimensionally, so can the grid containing the blocks. But I am not sure what the maximum number of blocks I can use is.

Here is the result of the deviceQuery example on my computer:

It says that each dimension for the grid must not exceed 2^31, 2^16 and 2^16 respectively. But what is the limit for the product of those dimension’s sizes? For the threads inside a block, it is 1024. What is the limit for the blocks inside the grid? And where can I see it in the output of the deviceQuery?

Robert_Crovella · January 25, 2022, 9:19pm

Yes.

There is no published limit. (And therefore no report in deviceQuery) As a test, you can try launching a kernel of maximal dimensions. As long as you don’t run into another resource limit, it should work. I’ve only done this with an empty kernel (to convince myself); if you tried this with a kernel that did anything “meaningful”, that kernel would possibly take “forever” to run.

Regarding “forever”: The product of those 3 numbers (2^31-1, 2^16-1, 2^16-1) is over 9 sextillion (blocks). If the GPU required 1 nanosecond to process each block, on average, the kernel would require almost 300 years to run. So exploring the envelope is not really practical, IMO.

user143091 · January 25, 2022, 9:27pm

Thanks for the quick response

system · February 8, 2022, 9:27pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question regarding maximum amount of blocks CUDA Programming and Performance	2	788	January 28, 2011
Maximum possible number of threads (Total) CUDA Programming and Performance	1	1007	December 28, 2009
how many threads can used in one grid 5126553565535 CUDA Programming and Performance	1	1662	June 24, 2009
deviceQuery CUDA Programming and Performance	4	2073	June 14, 2007
Understanding deviceQuery CUDA Programming and Performance	2	4083	June 28, 2014
Question about grid/block/thread sizes CUDA Programming and Performance	3	12263	November 13, 2012
Newbie question about maximum number of blocks CUDA Programming and Performance	1	598	March 26, 2016
How determine max number of blocks and threads for a GPU? CUDA Programming and Performance	4	20567	December 13, 2018
Maximum number of threads on thread block CUDA Programming and Performance	12	72945	September 21, 2023
How to decide the optimal block size in CUDA CUDA Programming and Performance	4	27564	February 15, 2010

What is the maximum number of blocks I can use?

Related topics