Will the neighbor blocks be batched in the same SM?

jingzhengboy · June 18, 2009, 3:19am

As we know, several blocks can be batched in the same SM.

my question is, in the beginning, when all the SM is idle, for example, the neighbor block 1, 2, 3, … , will be batched into the same SM?

that is
1, 2, 3 → SM1
4, 5, 6 → SM2
…
(assume that one SM can contain three blocks)

Is that true?

thank you for answer!

cvnguyen · June 18, 2009, 4:54am

Not necessarily. What can it help you?

SPWorley · June 18, 2009, 5:08am

Not in general, no. An obvious counterexample is if you have a GTX280 with 30 SMs, and a kernel with a grid of 50 blocks. Every SM will end up getting an initial assignment of only 1 or 2 blocks, even though they could hold 3.

jingzhengboy · June 19, 2009, 2:45pm

thank you for your answer, but will the neighbor blocks be batched on neighbor SM?

that is to say:

1 ->SM1

2 ->SM2

…

will?

cbuchner1 · June 19, 2009, 3:08pm

Some blocks may terminate early (depending on the algorithm in the kernel) so typically you can not expect blocks to be scheduled in sequence on the SMs.

Also any CUDA update may change the scheduling mechanism, so relying on undocumented behavior will get you in trouble eventually.

tmurray · June 19, 2009, 8:24pm

Never try to guess where your blocks will go. You may be wrong!

(and if you ever write any code that depends on block scheduling, I will sit in the corner and be sad. or maybe yell at you a lot. actually yeah, probably the second)

Topic		Replies	Views
Scheduling blocks to SMs at runtime CUDA Programming and Performance	7	2832	October 27, 2008
Relation between SM and block CUDA Programming and Performance	1	5604	March 18, 2010
Mapping of Thread Blocks to SMs CUDA Programming and Performance	1	1026	January 18, 2015
hardware scheduling logic on the GPU CUDA Programming and Performance	2	741	December 7, 2012
What will be happen in the situation CUDA Programming and Performance	9	6265	December 23, 2008
More blocks than SMs may not make sense CUDA Programming and Performance	13	2740	November 11, 2010
understand the mapping of the block threads to SMs in GPU CUDA Programming and Performance	3	2764	August 2, 2018
Can threads in a warp from different blocks? CUDA Programming and Performance	17	11901	March 26, 2010
Ensuring blocks per SM CUDA Programming and Performance	4	1109	February 20, 2012
Thread Block Scheduling on SM in Dynamic Parallelism CUDA Programming and Performance	6	3379	January 18, 2015

Will the neighbor blocks be batched in the same SM?

Related topics