How to compile OpenCL code into binary for a GPU I do not physically have?

AlexJoe · February 5, 2023, 1:16pm

We are a small company developing image processing software. We use OpenCL for GPU acceleration (single code base for Nvidia, AMD and Intel GPUs), and we deploy our OpenCL kernels as compiled binary, one per each line of GPUs that requires separate compilation. In practice this means that with every new major line of Nvidia GPUs we have to buy one model from this line in order to obtain a binary for it. I think it’s quite understandable that we don’t want to include the source code for our proprietary algorithms into our commercial product.

So my question is: does Nvidia offer any way to install the latest driver and obtain the binary for a GPU that the driver knows about, but I don’t physically have in my system? E. g. I have RTX3090, but I want to compile for RTX4090. AMD supports this via the CL_CONTEXT_OFFLINE_DEVICES_AMD extension.

Robert_Crovella · February 5, 2023, 3:40pm

I’m not aware of any method to compile for a non-existent device using standard OpenCL toolchain, nor any extension provided by NVIDIA to do that.

You can always file a bug (a feature request, basically). I don’t know if the development team would tackle such a project, or not.

Skybuck · March 1, 2023, 10:31am

It depends on what you call/define as a “compiled binary”.

It seems to be possible to use nvcc compiler to compile opencl/c based code.

If this is indeed so then perhaps using the following command line parameters will compile the kernel to ptx for a certain machine bitness and also a certain architecture.

Here is an example:
-ptx --machine=64 -arch=sm_53

Basically each GPU is part of a certain architecture.

So compiling to each architecture should allow to do what you want.

(ptx is a virtual instruction set/virtual program and could be considered the binary you seek)

The cuda runtime and/or driver can execute ptx and translate/compile it further into gpu-specific instructions… sass if I remember correctly.

Good luck !

Robert_Crovella · March 1, 2023, 11:40am

I don’t think that is possible with any recent version of CUDA.

Any attempt to compile any code that includes OpenCL specific syntax like __kernel results in syntax errors.

If you have a counter-example, please provide it.

nikhilj · March 1, 2023, 12:14pm

Have you looked at
CL_PROGRAM_BINARIES query of clGetProgramInfo
You should be able to run your apps on NVIDIA OpenCL on any supported NVIDIA GPU and use query above to get the binary for the kernel.
You can pass this later to clCreateProgramWithBinary
Hope this solves your problem

AlexJoe · March 2, 2023, 5:02pm

That is exactly what we’ve been doing for many years. The problem is that you get the binary for your selected GPU (the one you created the OpenCL context for), and you cannot select a GPU that’s not physically present. Or can you?

Skybuck · March 2, 2023, 7:26pm

A quick google shows some examples,

CUDA 8.0.61 is mentioned.

AlexJoe · March 2, 2023, 7:38pm

This is different. It compiles a C/C++ program that uses OpenCL, and the program, in turn, compiles the OpenCL code at runtime via clCreateProgramWithSource. It’s not clear to me why nvcc is even needed for that instead of regular gcc, maybe a Linux thing. I only need it to work on Windows, btw.

Robert_Crovella · March 3, 2023, 4:02pm

nvcc isn’t compiling opencl device code in that example.

That example is irrelevant for the discussion here.

Skybuck · March 4, 2023, 6:55pm

What I do know about OpenCL is that it can be compiled to PTX.

Here is a link describing it, it seems to use GCC:

https://arrayfire.com/blog/generating-ptx-files-from-opencl-code/

Perhaps GCC has compile directives to target certain architectures of nvidia graphics cards ?

Topic		Replies	Views
How to get the kernel binary file from OpenCL Nvidia GPU toolkit CUDA Programming and Performance	3	4586	November 24, 2011
[Suggestion]Precompilation tool CUDA Programming and Performance	3	1356	January 18, 2011
Building Device Binarys CUDA Programming and Performance	8	6947	September 23, 2009
Get program binaries How to get the program binaries using the STL bindings? CUDA Programming and Performance	6	7103	January 24, 2012
Invoking OpenCl-compiler from command line, Platform : Windows 7 64 bit CUDA Programming and Performance	1	1321	May 31, 2011
Is possible to compile OpenCL code for all current video cards? CUDA Programming and Performance	2	1137	March 27, 2014
OpenCL implementation clCreateProgramWithBinary() on CUDA CUDA Programming and Performance	4	5439	July 2, 2010
OpenCL Offline Compilation CUDA Programming and Performance	7	6339	July 11, 2010
OpenCL Offline Compilation Nsight Visual Studio Edition	0	1521	November 1, 2012
How to use clCreateProgramWithBinary() on cuda device using OpenCL ? CUDA Programming and Performance	1	1639	January 18, 2013

How to compile OpenCL code into binary for a GPU I do not physically have?

Related topics