block numbers related to the number of SMs blocks in multiple SMs

guitarmas · December 1, 2009, 4:07am

Hi all,

While I was trying to write a cuda program, I got a question related to the number of blocks in multiple SMs.

From my understanding, each SM can have up to 8 blocks. With GeForce GTX 285, the number of multiprocessor is 30. Programming Guide says that a device with more multiprocessors will automatically execute a kernel grid in less time than a device with fewer multiprocessors. Does this mean that I can assign more than 8 blocks when I decide the number of blocks? For example,

[codebox]dim3 dimBlock(256);

dim3 dimGrid(16);[/codebox]

then the number of threads are 256*16=2048 and the number of blocks is 16. Both are more than the constraints, 1024 threads per a SM and 8 blocks per a SM. Will each 30 SM automatically take blocks, keeping the constraints? I am so confused… Could anyone help me to understand this?

Thank you for reading. I appreciate your time!

Chulho

avidday · December 1, 2009, 7:21am

You can have as many blocks as you need as long as you keep the grid size to less than 65335x65335. As long the per block resource requirements of your kernel and execution parameters don’t exceed the per multiprocessor limits described in Appendix A of the programming guide, the GPU will just keep running blocks until all are executed. All all blocks in a grid don’t have to run at the same time.

NVIDIA provide an occupancy calculation spreadsheet which lets you see what effect different kernel resource requirements and execution parameters will have on the occupancy of the GPU. There is a link in a sticky thread in the programming and development forum where you can download it, if you don’t already have a copy in the SDK.

Topic		Replies	Views
Number of blocks parameter for kernel when GPU has just one SM CUDA Programming and Performance	3	581	August 4, 2017
Scheduling blocks to SMs at runtime CUDA Programming and Performance	7	2928	October 27, 2008
Limit to Number of Blocks? Noob Question CUDA Programming and Performance	4	3088	May 16, 2008
Maximum Number of blocks , SMs, and Grids CUDA Programming and Performance	0	481	December 7, 2018
How blocks will be distributed among SPs ? CUDA Programming and Performance	4	1627	October 13, 2008
Minimum number of blocks per SM as of CUDA 10.1 CUDA Programming and Performance	3	669	January 22, 2020
how to determine max number of blocks per kernel CUDA Programming and Performance	10	17386	September 11, 2011
Wisdom Around Optimal Number of Blocks in a Grid? CUDA Programming and Performance	11	4731	June 27, 2009
confusion of basic concepts CUDA Programming and Performance	8	6436	May 18, 2011
Scheduling Thread Blocks CUDA Programming and Performance	5	1354	July 29, 2021

block numbers related to the number of SMs blocks in multiple SMs

Related topics