What is __noinline__ equivalent on PTX ISA?

Hi,

I wrote a kernel with very much small .func definitions.
The problem is kernel’s ‘instruction fetch latency’ issue.
Final cubin file has one entry but it should contain many func.

I want to force .func definitions to be not inlined.
How can I do this?

call.uni works :-)