How to specific the number of SMs used in my program?

xiaodongyee · April 9, 2018, 3:44pm

Dear all,
If I want to launch a kernel with N threads, How can I make the gpu scheduler allocate SMs to my programs as more as possible?

Thanks for your reply

Robert_Crovella · April 9, 2018, 3:58pm

Other than via CUDA stream priorities, you have no control over the block scheduler in a GPU.

The heuristics of block scheduling are not published.

The GPU block scheduler will generally attempt to deliver blocks to SMs in such a way as to maximize throughput of your kernel. This generally means delivering blocks evenly to all available SMs.

You should strive for full occupancy of the GPU. As a target minimum, this means create kernels that contain at least 2048*(# of SMs in your GPU), total thread count (or more).

Topic		Replies	Views
Question about the number of SMs using in the program. CUDA Programming and Performance	3	797	April 9, 2018
Assign blocks to SMs CUDA Programming and Performance	5	1590	February 4, 2019
Is there any way to control the GPU block scheduler? CUDA Programming and Performance	1	236	May 27, 2024
Ensuring blocks per SM CUDA Programming and Performance	4	1085	February 20, 2012
hardware scheduling logic on the GPU CUDA Programming and Performance	2	730	December 7, 2012
understand the mapping of the block threads to SMs in GPU CUDA Programming and Performance	3	2725	August 2, 2018
how are blocks scheduled for execution? CUDA Programming and Performance	3	3433	December 9, 2016
Relation between SM and block CUDA Programming and Performance	1	5594	March 18, 2010
Number of blocks parameter for kernel when GPU has just one SM CUDA Programming and Performance	3	515	August 4, 2017
Limit number of (or allocate) SM on a per stream basis CUDA Programming and Performance	3	1482	November 14, 2023

How to specific the number of SMs used in my program?

Related topics