Any chance to create a tool to precompile our OpenCL kernels for ALL the NVIDIA’s devices, pls?
Something like the tool included in the Intel’s OpenCL SDK:
Important: it must be able to compile for ALL the NVIDIA devices(G80,GT200,Femi), so I’ll just need to save the result to a file and pass it to clBuildProgramFromBinaries() when a NVIDIA OpenCL device is detected.
This has three main advantages over using the source code:
- It hides better our IP because we won’t need to supply the kernels’s source code.
- Kernels will compile much faster than if they’re JIT-compiled.
- We could see the generated code and to search for optimizations or bugs.
I personally think every IHV supporting OpenCL precompiled kernels should provide a tool as this one.