How to inline PTX with nvfortran

adench2 · August 20, 2021, 12:37am

Hello,

I was wondering what the syntax would be for inline ptx in CUDA Fortran kernels. I’d hope the functionality exists without having to interface to CUDA C. I initially assumed it would be the same as CUDA C, e.g. just call
asm(“prefetch.global.L1 [%0];” : : “r”(var) )
(the above taken from a CUDA C post about prefetching, where I replaced ptr with var)

However, compiling this gives me some syntax errors:
“NVFORTRAN-S-0034-Syntax error at or near ) (reduction.cuf: 254)
0 inform, 0 warnings, 1 severes, 0 fatal for device_reduce_warp_memaccesses_vec4_vectorized_prefetch
call_reduction.cuf:”

I tried finding documentation of this in the CUDA Fortran programming guide, as well as the PTX guide, but no luck.

For context, I wanted to experiment with prefetching, since the above reduction kernel is severely limited by long scoreboard stalls. However, I can see myself playing with inline PTX in other contexts, so I would like to know the syntax in CUDA Fortran.

MatColgrove · August 20, 2021, 4:48pm

Sorry, ASM statements aren’t supported in Fortran.

adench2 · August 20, 2021, 5:55pm

I see! Thanks for letting me know. It’s not the end of the world to tinker with the intermediate PTX file, so I’ll work on that.

Topic		Replies	Views
Call inline ptx function? CUDA Programming and Performance	5	2309	June 19, 2012
asm inlining in CUDA code? CUDA Programming and Performance	5	6558	July 19, 2010
Inline PTX assembly example CUDA Programming and Performance	1	14817	August 3, 2010
Issue using inline PTX functions, with address operands, in CUDA application - Any help much appreciated! CUDA Programming and Performance	8	1234	April 7, 2018
Problem about inline PTX code in CUDA program CUDA Programming and Performance	3	2274	January 10, 2013
ptxas compiles my program wrong CUDA 4.0RC2 CUDA Programming and Performance	2	4534	May 8, 2011
Inline PTX Assembly CUDA Programming and Performance	0	2568	August 10, 2010
Some problems with inline PTX CUDA Programming and Performance	6	1916	March 6, 2013
[Solved] Texture access and inline CUDA ptx assembly in VS 2010 CUDA Programming and Performance	3	1135	September 8, 2013
why CUDA 2.0 does not expose all PTX ISA 1.3 ? CUDA Programming and Performance	20	27966	November 5, 2008

How to inline PTX with nvfortran

Related topics