Pre-Compiling OpenCL Kernels Tutorial

Mesa_Boogie271 · June 9, 2010, 8:16pm

I was roaming through many forums trying to find some concrete examples on how to pre-compile OpenCL kernels. I was never successful in finding all the pieces in one spot. Here are a few functions I have created, gathered, and modified in order to accomplish this. First you need to write out the binary file using the code in the first code block. Call this with clCreateProgramWithSource and after clBuildProgram.

[codebox]void writeBinaries()

{

ofstream myfile("kernel.ptx");

cl_uint program_num_devices;

clGetProgramInfo(cpProgram, CL_PROGRAM_NUM_DEVICES, sizeof(cl_uint), &program_num_devices, NULL);

if (program_num_devices == 0)

{

		std::cerr << "no valid binary was found" << std::endl;

		return;

}

size_t binaries_sizes[program_num_devices];

clGetProgramInfo(cpProgram,	CL_PROGRAM_BINARY_SIZES, program_num_devices*sizeof(size_t), binaries_sizes, NULL);

char **binaries = new char*[ciDeviceCount];

for (size_t i = 0; i < ciDeviceCount; i++)

		binaries[i] = new char[binaries_sizes[i]+1];

clGetProgramInfo(cpProgram, CL_PROGRAM_BINARIES, program_num_devices*sizeof(size_t), binaries, NULL);



if(myfile.is_open())

{

	for (size_t i = 0; i < program_num_devices; i++)

	{

			myfile << binaries[i];

	}

}

myfile.close();

for (size_t i = 0; i < program_num_devices; i++)

		delete [] binaries[i];

delete [] binaries;

}[/codebox]

Next, you will need to comment out your load program from source routine (ex. oclLoadProgSource), clCreateProgramWithSource and writeBinaries(). After this you will need to add this code.

[codebox]FILE* fp = fopen(“oclLLtoUTM.ptx”, “r”);

fseek (fp , 0 , SEEK_END);

const size_t lSize = ftell(fp);

rewind(fp);

unsigned char* buffer;

buffer = (unsigned char*) malloc (lSize);

fread(buffer, 1, lSize, fp);

fclose(fp);

cl_int status;

cpProgram = clCreateProgramWithBinary(cxGPUContext, 1, (const cl_device_id *)cdDevices, 

			&lSize, (const unsigned char**)&buffer, 

			&status, &ciErr1);



if (ciErr1 != CL_SUCCESS)

{

    cout<<"Error in clCreateProgramWithBinary, Line "<<__LINE__<<" in file "<<__FILE__<<" "<<endl;

    Cleanup(EXIT_FAILURE);

}

ciErr1 = clBuildProgram(cpProgram, 0, NULL, NULL, NULL, NULL);[/codebox]

This will now read in your ptx file and create the binary. That is all there is to it.

leiming · January 6, 2013, 9:56pm

Thanks!

RianFlo · May 8, 2013, 1:18am

Hey there. Thanks for posting this.

The CUDA docs (old ones) say that this should only work if produced and consumed by the same driver, and might be removed in future versions.

Any idea what the reality behind this is? Is the ptx format a viable option for distribution to multiple NVIDIA devices?

Thanks,
Florian

Topic		Replies	Views
How to get the kernel binary file from OpenCL Nvidia GPU toolkit CUDA Programming and Performance	3	4592	November 24, 2011
[Suggestion]Precompilation tool CUDA Programming and Performance	3	1357	January 18, 2011
Get program binaries How to get the program binaries using the STL bindings? CUDA Programming and Performance	6	7108	January 24, 2012
OpenCL implementation clCreateProgramWithBinary() on CUDA CUDA Programming and Performance	4	5453	July 2, 2010
Building Device Binarys CUDA Programming and Performance	8	6969	September 23, 2009
How to compile OpenCL code into binary for a GPU I do not physically have? CUDA Programming and Performance opencl	9	3431	March 4, 2023
How to use clCreateProgramWithBinary() on cuda device using OpenCL ? CUDA Programming and Performance	1	1644	January 18, 2013
OpenCL Binary Reuse At what level can binaries be reused? CUDA Programming and Performance	8	11579	July 30, 2011
PTX kernels in OpenCL CUDA Programming and Performance	1	2125	July 1, 2013
pre-compile opencl kernels CUDA Programming and Performance	1	7942	March 19, 2010

Pre-Compiling OpenCL Kernels Tutorial

Related topics