disable multiprocessors

mahy · May 13, 2008, 8:16pm

Hi Everyone,

Does anyone know if there is any way to disable some of the multiprocessors through CUDA?

Thanks

MisterAnderson42 · May 14, 2008, 2:30am

There is no way to disable multiprocessors.

gonnet · May 14, 2008, 7:20am

Still you can make sure they are not in use by using some grid with a specific shape. However, what would be the actual purpose of that ? measuring some ““speedup””, saving some power External Media ?

Can’t you also create some “blank” kernels that some multiprocessor would be running ? if you manage to figure out how wraps are mapped, perhaps you can do as if some multiprocessor were disabled. But as you usually put much more blocks than there are multiprocessors, this might become a little ankward to implement (without a dummy grid)

mahy · May 15, 2008, 6:24pm

How can I manage so that only some multiprocessors run some blank kernels? any idea?

MisterAnderson42 · May 15, 2008, 7:10pm

You can’t. Blocks are executed in an undefined order that is different every time.

Why are you trying to do this in the first place?

mahy · May 16, 2008, 8:08pm

So, even if I don’t know which multiprocessors these blocks are assigned to, is there a way that I can have them run empty kernels? (basically do nothing? like wait ?)

MisterAnderson42 · May 16, 2008, 8:33pm

I suppose. Just run the computation with more blocks than it really needs and something like this:

__shared__ int a;

if (blockIdx.x >= real_num_blocks)

    {

    for (int i = 0; i < 10000; i++)

        a++;

    }

else

    {

    real kernel.....

    }

It’s horrible code, I know. The shared memory dummy variable is just to keep nvcc from optimizing away the loop. Adjust the 10000 to get longer/shorter delays from your dummy blocks. This really has no purpose except to make the kernel execute more slowly as you requested.

JHHPC · May 19, 2008, 8:33am

I suppose. Just run the computation with more blocks than it really needs and something like this:
__shared__ int a;

if (blockIdx.x >= real_num_blocks)

    {

    for (int i = 0; i < 10000; i++)

        a++;

    }

else

    {

    real kernel.....

    }
It’s horrible code, I know. The shared memory dummy variable is just to keep nvcc from optimizing away the loop. Adjust the 10000 to get longer/shorter delays from your dummy blocks. This really has no purpose except to make the kernel execute more slowly as you requested.

[snapback]378100[/snapback]

That way you just ensure that some blocks are doing nothing and not one MP. But you have the solution right there.

To really not compute on some mp’s you have to have a kernel which eats all registers or shared memory of one MP. So expand the shared memory to over 8000Kbyte and make the 10000 e.g. 10.

That way 2 blocks will execute the real part. As one block needs a whole MP they will execute on two MP’s.

However you cannot determine on which MP’s.

Adjust the 10 the way you want.

E.D_Riedijk · May 19, 2008, 10:22am

that does not work, you cannot expand registers of just one MP. A kernel is a kernel and is the same for all MP’s, the fact they take a different codepath does not change the register-usage of the kernel.

Mark_Harris · May 22, 2008, 10:29am

There is currently no way to do this. One of the design goals of CUDA is transparent scalability to chips of varying numbers of SMs, which is why you can’t select which SM a thread block runs on.

But if you explain what you really want to do, we might be able to offer suggestions…

Mark

AndreiB · May 23, 2008, 5:17am

Well, IMO it would be nice to have ability to dedicate some of multiprocessors to rendering UI while others are doing computations. Now it is not possible and when card is doing some maths UI is really slow and annoying.

Topic		Replies	Views
How to limit number of CUDA Cores CUDA Programming and Performance	7	6117	April 22, 2016
disable part multiprocessor in CUDA? CUDA Programming and Performance	2	1064	March 3, 2009
Using <<<...>>> CUDA Programming and Performance	6	2479	June 19, 2011
CUDA thread in background? CUDA Programming and Performance	10	16007	February 19, 2010
Kernel Execution issues related to Shared Memory CUDA Programming and Performance	5	5157	November 9, 2009
Controlling Multiprocessor Usage? CUDA Programming and Performance	2	1468	April 3, 2009
CUDA processor allocation CUDA Programming and Performance	7	3437	October 5, 2007
Scalability issue Scaling over the processors available CUDA Programming and Performance	8	2690	September 3, 2008
workarounds for not blocking UI while running kernel? wondering if there are ways to stop cuda from CUDA Programming and Performance	4	10022	January 17, 2010
Simple Question, please answer! CUDA Programming and Performance	4	3675	August 14, 2009

disable multiprocessors

Related topics