My question is:
How are the cuda kernels embedded into the executable produced by Visual Studio 2010 / Visual C/C++ 2010 / NVCC ?
I can imagine different techniques:
Embedding PTX as some kind of resource string or something/some other string.
(Also if this string technique is chosen, how to compile this string ?)
Some kind of binary equivalent resource.
(How to load this, probably load image api ?)
What techniques are used storage-wise and loading/executing/api-wise ?
And which technique and compiler options are available to do the same for other languages like pascal/Delphi ?