Run different kernel functions on different Multiprocessors simultaneously Is it possible to assign

Chin · December 24, 2009, 5:24pm

Is it possible to assign different kernels to different multiprocessors (on the same graphic card) and run different kernel functions simultaneously with CUDA 2.3? I am looking for a way to run slightly different applications accessing the same data reside on the global data at the same time. Searching through the forum, I found a post around Oct, 2007 which indicating this is not possible - [url=“The Official NVIDIA Forums | NVIDIA”]The Official NVIDIA Forums | NVIDIA

I would like to know, is this possible now with CUDA 2.3? If so, how to do it?

Chin

Gregory_Diamos · December 24, 2009, 5:27pm

This is currently not possible. However, it is possible to have several device functions that are called from the same global function. For example

__device__ void function1(void*) {...}

  __device__ void function2(void*) {...}

__global__ void dispatch(void* in)

{

  if(blockId.x > 16) function1(in);

  else function2(in);

}

The problem is that you have to manually determine how functions are assigned to blocks.

Fermi is supposed to add support for this, but I’ll bet that you will have to make sure that both kernels are launched in different streams.

SPWorley · December 24, 2009, 7:39pm

Gregory’s completely correct. This kind of micro-kernel switching works pretty well, actually, and I do it often in some of my more complex code.
The main disadvantage is compile time, actually… you can’t split the subkernels into multiple objects.

You can even break it down and do similar switches on the per-warp level, though you give up the ability to use syncthreads() to coordinate among the warps. However if your microkernels are indeed per-warp you won’t need syncthreads().

Chin · December 24, 2009, 8:25pm

This is currently not possible. However, it is possible to have several device functions that are called from the same global function. For example
__device__ void function1(void*) {...}

  __device__ void function2(void*) {...}

__global__ void dispatch(void* in)

{

  if(blockId.x > 16) function1(in);

  else function2(in);

}
The problem is that you have to manually determine how functions are assigned to blocks.

Fermi is supposed to add support for this, but I’ll bet that you will have to make sure that both kernels are launched in different streams.

Thanks!

I will design my application per your suggestion.

Have a nice holiday!

Chin

Topic		Replies	Views
CUDA processor allocation CUDA Programming and Performance	7	3434	October 5, 2007
putting multiprocessors in group CUDA Programming and Performance	6	1677	November 27, 2009
Can two kernels form two distinct applications run on two GPU cards simutaneously? CUDA Programming and Performance	1	709	April 7, 2013
Can we run diff kernels on different cores simultaneously ? CUDA Programming and Performance	3	1056	October 20, 2010
Parallel execution of multiple kernels possible? CUDA Programming and Performance	1	1633	June 4, 2008
can we use different Kernels on diffferent cores of a GPU at the same time ? CUDA Programming and Performance	5	3664	October 20, 2010
A question on concurrent kernel execution CUDA Programming and Performance	2	777	April 13, 2012
Fermi CUDA Programming and Performance	3	7724	March 25, 2010
CUDA 4.0 concurrent kernels CUDA Programming and Performance	6	1670	March 28, 2011
Multiple kernels in flight? CUDA Programming and Performance	19	26824	August 28, 2007

Run different kernel functions on different Multiprocessors simultaneously Is it possible to assign

Related topics