maximum threads per block

lwan61c1t3 · October 24, 2014, 5:21pm

Hi all,

Is there any function that can be used to get the maximum_threads_per_block for a specific CUDA kernel? I know there is maxThreadsPerBlock, but it seems for the device not for a specific kernel.

njuffa · October 24, 2014, 5:38pm

If I understand your question correctly, the new occupancy calculator API introduced with CUDA 6.5 should be helpful:

[url]http://devblogs.nvidia.com/parallelforall/cuda-pro-tip-occupancy-api-simplifies-launch-configuration/[/url]

susangao · October 29, 2014, 4:34pm

This is nice. Then technically, if I call this API to get the max active block number per SM, and set the grid size that is as same as the number of block that can be hosted by all SMs, then all blocks can be guaranteed to scheduled to a SM at the beginning instead of waiting for another block to finish. Is it correct?

Topic		Replies	Views
Maximum block per grid CUDA Programming and Performance cuda	4	4640	March 24, 2023
Setting block size and avoiding errors CUDA Programming and Performance	7	6394	November 15, 2008
Limit to Number of Blocks? Noob Question CUDA Programming and Performance	4	3100	May 16, 2008
how to determine max number of blocks per kernel CUDA Programming and Performance	10	17457	September 11, 2011
max number of block CUDA Programming and Performance	21	18229	April 20, 2010
Max gridDim.x ? CUDA Programming and Performance	7	4678	March 11, 2010
maximum threads per block not always used CUDA Programming and Performance	2	845	June 14, 2018
max thread per block and memory device question CUDA Programming and Performance	2	17074	January 9, 2009
False information from occupancy calculator? CUDA Programming and Performance	1	717	February 2, 2018
Maximum possible number of threads (Total) CUDA Programming and Performance	1	1077	December 28, 2009

maximum threads per block

Related topics