What’s the best way to compile a kernal at runtime, assuming that I want minimal requirements for what the end user must install to get it working? Is there a way to run NVCC to compile GPU only code without requiring cl.exe?
I know that pyCUDA offers runtime compilation of kernals, but getting it installed and running is far more involved that I can reasonably expect the end user to figure out. What I need is a self-contained, user friendly way to allow user customized kernals to be compiled and executed on the fly.
My application is a flame fractal viewer, and one of my goals is to allow the user to enter in custom formulas and explore the fractals they produce.