Distribution Threads by the SMs

luisgo · December 15, 2014, 4:04am

Dear All

 If I launch a kernel from host (that runs in a single SM, something like kernel<<<1,16>>>()) and then I launch another kernel from that kernel (inside the device), it will all run in the same SM, or not?

Thanks

Luis Gonçalves

Robert_Crovella · December 15, 2014, 5:05am

The behavior of the CUDA work distributor is mostly unspecified. If you launch two kernels, each of which only consist of one threadblock, most likely those two threadblocks will execute on separate SMs, assuming they are launched and run concurrently, and assuming your GPU has 2 or more SMs. But there is no guarantee of that behavior.

Topic		Replies	Views
Concurrent execution of kernels on the same SM CUDA Programming and Performance	1	554	October 28, 2021
Cuda multi stream schedule CUDA Programming and Performance	2	1571	October 11, 2023
Dynamic Parallelism CUDA Programming and Performance	1	372	December 13, 2017
Distribute copy (kernel) across multiple SM CUDA Programming and Performance	6	876	April 30, 2018
Multiple concurrent device processes using multiple concurrent host threads CUDA Programming and Performance	4	3770	January 26, 2009
Running CUDA kernels from two different pthreads CUDA Programming and Performance	7	2928	May 10, 2016
Run different kernels parallely on different SMs CUDA Programming and Performance	4	1126	June 22, 2018
Thread Block Scheduler uses disjoint SMs for 2 kernels in separate streams CUDA Programming and Performance	3	99	February 3, 2025
Can it occur that 2 kernels run at the same time if the 2 kernels are continuously launched? CUDA Programming and Performance	2	407	January 8, 2019
The speed of program run on multiple SMs is similar to the speed that run on single SM? CUDA Programming and Performance	1	404	September 25, 2021

Distribution Threads by the SMs

Related topics