Hello,
My question is:
How are the cuda kernels embedded into the executable produced by Visual Studio 2010 / Visual C/C++ 2010 / NVCC ?
I can imagine different techniques:
-
Embedding PTX as some kind of resource string or something/some other string.
(Also if this string technique is chosen, how to compile this string ?) -
Some kind of binary equivalent resource.
(How to load this, probably load image api ?)
What techniques are used storage-wise and loading/executing/api-wise ?
And which technique and compiler options are available to do the same for other languages like pascal/Delphi ?
Bye,
Skybuck.