Scheduling block execution Do multiprocessors block each other?

Sarnath · June 7, 2010, 5:37am

SPWorley’s second idea does not really require atomics…

You can always code like:

for(int blk=blockIdx.x; blk<n; blk += gridDim.x)

{

	effect_blkid = blk;

/* code goes in here... */

}

All the developer needs to take care is : Spawn just enough blocks keep all the MPs busy… and this number varies from device to device… Spawning logic must take care of it…

Ashtey…

tmurray · June 7, 2010, 6:51am

if your heterogeneous runtimes are periodic, you risk very bad scheduling bubbles by statically scheduling that way.

SPWorley · June 7, 2010, 6:54am

That static scheduling technique might help too, but since it doesn’t dynamically assign work to idle SMs you’d still have a lot of inefficiency if one set of blocks happened to be a lot slower than others.

It also requires you to figure out at runtime exactly how many blocks can run simultaneously on your device, which is nontrival (though certainly possible).

tera · June 7, 2010, 10:58am

I’m surprised you state it does not work. I could understand (and might even expect) that with a little help from the hardware the new scheduling in Fermi outperforms the software implementation. But it’s hard to see why it should not work.

tmurray · June 7, 2010, 4:10pm

I meant that it’s going to always be a performance loss versus using the hardware scheduler.

tera · June 7, 2010, 5:07pm

Thanks for the clarification!