a kernel call within another kernel

nimals1986 · September 18, 2008, 9:30am

hai…
i am studying through the cuda programming guide…a doubgt now…

functions that need to be run the gpu device need to be declared as

with global or device

suppose i have a global function…can i make another function call within it…should i give it ( the inner function call) a global or device

can you help me clear my doubt…

sleon · September 18, 2008, 9:34am

All function calls from cuda functions are inlined, so no recursions are possible. Also you can not start parallel kernels from a kernel. Because each thread executes the code serial.

nimals1986 · September 18, 2008, 10:27am

suppose i run a kernel which executes well and produces a resultant data…without copying that resultant data to host can another kernel launch access that previous result value…

or i have to memcpy to host and then again to the device …

can you help me around…

nimals1986 · September 18, 2008, 11:04am

oops sorry …there seems to a function cudaMemcpyDeviceToDevice which should be doing the above…

AndreiB · September 18, 2008, 11:25am

If I get your question right, the answer is yes. If both kernels share same context you can write data to memory from first kernel and then read it from second kernel. This holds for global (device) memory, not for shared or local memory.

You will probably need to cudaMalloc() storage on GPU from host code and pass pointers to allocated device memory to both kernels.

nimals1986 · September 23, 2008, 9:10am

I was thinking if i could simply run my cprogram in a barbaric way on the GPU

is this possible…

device
{
…
device
{
…
device
{
…
}
}

}

jack · September 23, 2008, 2:35pm

No…again, recursion is not possible. You cannot call another kernel from within a kernel.

Reimar · September 24, 2008, 7:07am

Uh, why do you think this is recursion? As long as it can be completely inlined (e.g. your functions are just “syntactic sugar”) you can call another function, if you could not, device functions would be completely useless.

nimals1986 · September 25, 2008, 11:42am

sorry i could have made it clear…in my case they are not recursive functions ,each one is a separate device function

device funA()
{
…
device funB()
{
…
device funC()
{
…
}
}
}

Sarnath · September 25, 2008, 12:02pm

I dont have anything to say except that the code really looks barbaric… :-)

Jokes apart, I dont think you can do all that… but i really dont know.

E.D_Riedijk · September 25, 2008, 3:59pm

you can call device functions from device functions, which get called from global functions.

alex_dubinsky · September 25, 2008, 6:24pm

I love nested functions. Why’d C++ ever take them out?

bdg146psu · September 25, 2008, 6:31pm

Maybe I’m missing something here, but what is the point of nesting the functions? Anything that can be written as nested functions should be able to be written serially as one device function as well… or no?

Sarnath · September 26, 2008, 6:04am

I dont know for sure. but my guess is “scoping”. A nested function can be called only withing the scope of the function that nests it… May b…

nimals1986 · September 26, 2008, 9:14am

did you note that exampleof OceanFFT kernel…there device functions are passed as arguments to another device function…

alex_dubinsky · September 26, 2008, 8:02pm

Nested functions are nice. They help me tidy my code.

You mean function pointers? That’s impossible… ?

dev10e12 · January 23, 2018, 12:28am

I believe this may answer your questions.

http://developer.download.nvidia.com/assets/cuda/files/CUDADownloads/TechBrief_Dynamic_Parallelism_in_CUDA.pdf

/Mike

Topic		Replies	Views
Call kernel from kernel CUDA Programming and Performance	4	5855	July 19, 2007
Nesting kernels Can I do this in CUDA? CUDA Programming and Performance	11	12726	January 4, 2010
Nestled Kernel How to launch a kernel from another kernel CUDA Programming and Performance	2	1076	May 19, 2009
CUDA functions How should CUDA functions be called? CUDA Programming and Performance	7	5668	August 13, 2009
can a function be both __global__ and __device__ CUDA Programming and Performance	1	4505	November 9, 2007
Device function call from globalcu CUDA Programming and Performance	6	577	May 23, 2023
CUDA Fortran kernel invoke a CUDA C kernel Legacy PGI Compilers	3	4140	October 18, 2010
Calling kernels from within other kernels CUDA Programming and Performance	1	2762	June 24, 2010
Recursive non-kernel functions CUDA Programming and Performance	7	12744	April 7, 2010
NESTED __device__ calls CUDA Programming and Performance	0	1735	September 23, 2008

a kernel call within another kernel

Related topics