Embedding PTX in executables for execution

Goubermouche · January 8, 2024, 7:45am

Hi, I’m aware that this isn’t specifically a CUDA topic, but I don’t know where else I should ask, and since I’m interested in Nvidia GPU’s/drivers for this, I figured I’d ask here.

Say I have some PTX code, ie. this (note: am I to understand that the section containing .section is supposed to be a more readable example of the byte data representing the actual PTX as seen above?).

How would I embed this code in an executable (let’s just focus on x64 Windows here)? Specifically, I’m interested in which steps I’d need to take to insert the needed segments of data representing the PTX files, and how I’d go about invoking the contained kernels from withing the executable.

I’m aware that this topic is quite complicated, hence why I’d also really appreciate any pointers to useful resources on this topic.

Robert_Crovella · January 18, 2024, 3:56pm

This may be of interest. In general, the driver API provided by NVIDIA and its general usage doesn’t provide a toolchain ready-to-go that is designed or intended to embed PTX in an executable. It’s not a typical use case as far as I know. However the linked answer shows some possibilities. I’m aware it doesn’t answer all your questions.

Goubermouche · January 23, 2024, 12:39pm

Apologies for the late reply, I didn’t have time since it was finals week at my uni. As for the question; Thanks, your link looks incredibly useful. One thing I’d be really interested in, though, is the overall legality of such project. As far as I know, NVCC is a proprietary tool, and attempting to “recreate” a part of it (be it the assembler or code generator backend) could potentially cause issues. On the other hand, I’m aware of tools like LLVM or GCC, which are also capable of emitting PTX assembly.

Thanks in advance.

Robert_Crovella · February 3, 2024, 12:02am

Here is another possible method. Also there is the ptxjit CUDA sample code. And here.

Topic		Replies	Views
How are kernels embedded into the executable and how to mimic this in other languages/tools ? CUDA Programming and Performance	1	5130	July 14, 2011
PTX programming in Visual Studio CUDA Setup and Installation	1	770	January 7, 2018
How to compile hand-made PTX source? CUDA Programming and Performance	0	2020	March 6, 2009
.PTX & how to get it running. How to create a hello world type ptx program, and get it to run. CUDA Programming and Performance	1	2865	January 7, 2012
Example code using PTX CUDA Programming and Performance	6	8872	March 25, 2008
Integrate PTX code in compilation chain CUDA Programming and Performance	1	597	June 20, 2016
.loc in PTX code CUDA Programming and Performance kernel	6	670	March 16, 2023
linking hand-coded PTX CUDA Programming and Performance	4	4414	August 31, 2007
running .ptx on GPU CUDA Programming and Performance	3	4942	March 27, 2009
embeding device code in executable with cuda driver model CUDA Programming and Performance	0	1557	February 20, 2012

Embedding PTX in executables for execution

Related topics