Function calls inlining : clarification

kaoken · April 16, 2008, 2:36am

In PTX I can have function calls. But the call graph needs to be finite and computable at compile time : is this correct? And the function calls are inlined when converting from PTX to CUBIN instructions?

seibert · April 16, 2008, 2:38am

The function calls are inlined in the compilation of C to PTX by nvcc, even before the PTX assembly stage.

kaoken · April 16, 2008, 2:46am

What happens if I specify the noinline option or if one writes PTX by hand? PTX does have call instructions as well .func labels?

seibert · April 16, 2008, 5:02am

Yup, see the PTX ISA guide. PDF pg. 84 for .func and PDF pg. 70 for call.

kaoken · April 16, 2008, 5:45am

Thanks for your response. Well I can see that I am not being very clear here.

So let me rephrase the question.

I know that PTX has call instructions as well as .func labels. I also know in PTX I can do “call fname” where fname has to be a label and cannot be recursive. My question is this : Does the call graph in PTX have to be finite? Is there for example a maximum “stack depth” or can I have an arbitrary call graph? Or is there some other restriction? What about cycles in the call graph for example? Ruling out recursion only rules out an immediate cycle.

I also know that cubin is not disclosed but I am curious about what happens when PTX is converted into cubin. Do function calls get inlined?

seibert · April 16, 2008, 11:17am

The answer to the first part is yes, there is a maximum call depth, as the call instruction says:

“In the current ptx release, parameters are passed through statically allocated ptx registers; i.e., there is no support for recursive calls.”

So the call depth is limited by register availability. As for the second part, I’m not sure if ptxas will perform additional transformations and inline functions that were not already inlined by nvcc. Maybe someone else knows this…

wumpus · April 21, 2008, 8:14am

There are native call and return instructions, but ptxas and nvcc prefer inlining. Anyhow, you have to write code as if everything is inlined in all cases. There is no way to do recursion.

Topic		Replies	Views
How functions are compiled? Are function calls expanded inline or are actually CALLED? CUDA Programming and Performance	3	1280	April 8, 2011
Call inline ptx function? CUDA Programming and Performance	5	2293	June 19, 2012
Function recursion CUDA Programming and Performance	2	13703	June 27, 2007
Problem with CUDA PTX function calls CUDA Programming and Performance	0	1358	June 29, 2008
What is __noinline__ equivalent on PTX ISA? CUDA Programming and Performance	2	515	October 12, 2021
Force inlining of PTX functions? CUDA Programming and Performance	0	362	July 12, 2018
asm inlining in CUDA code? CUDA Programming and Performance	5	6538	July 19, 2010
Recursive non-kernel functions CUDA Programming and Performance	7	12744	April 7, 2010
Inline functions not inlined in CUDA 6.5? CUDA Programming and Performance	7	6049	November 29, 2014
'ptxas' died CUDA Programming and Performance	4	4299	August 14, 2008

Function calls inlining : clarification

Related topics