The way device funcs get allocated

tonhead · October 22, 2009, 7:13am

Sorry for such a stupid question but never called device funcs before.
Searched through CUDA documentation but was unable to find the answer for the question of actual allocation technique for device functions.
Consider I run my program on GPU with 27 multiprocessors with only 16 of them allocated to kernels. I’m calling device function from inside of every kernel. So how does the CUDA allocates device resources to these device calls (in terms of multiprocessors)? External Image

avidday · October 22, 2009, 7:23am

It doesn’t. All device functions are in-lined by the compiler.

tonhead · October 22, 2009, 7:52am

So what you are saying is that all these device functions just executed as part of the kernel called them, that is, on the same multiprocessor where they were called from? As include directive dictates compiler to replace itself (include) with the content of included file. Is that right?

avidday · October 22, 2009, 8:19am

It doesn’t exactly work that way, but that is the net effect, yes.

Topic		Replies	Views
Why would putting __device__ functions in the same file as kernels make them faster? CUDA Programming and Performance	4	718	June 7, 2017
What about calling non __device__ function inside kernel? Feature suggestion CUDA Programming and Performance	1	7822	June 3, 2011
__device__ function parameter? CUDA Programming and Performance	1	1153	February 23, 2009
calling a __device__ functions inside kernels CUDA Programming and Performance	4	20181	August 16, 2013
How to separate device function and kernel function? CUDA Programming and Performance	2	1496	November 22, 2009
Scheduling of kernels from different processes. Questions on how CUDA devices are shared amongst com CUDA Programming and Performance	0	775	October 21, 2009
How to re-structure code for CUDA (.cu, .cuh, .c)? CUDA Programming and Performance	4	5149	August 19, 2009
How to split the code in multiple files? CUDA Programming and Performance	3	4499	November 19, 2019
kernel function cuda, kernel CUDA Programming and Performance	3	7159	September 21, 2009
Functions CUDA Programming and Performance	1	4660	August 30, 2007

The way __device__ funcs get allocated

Related topics

The way device funcs get allocated