I’d like to compile a .cu file to all platforms in a single pass compilation, generating a .cubin which could contain code for all compute capabilities.
The Fermi Compatibility Guide for CUDA Applications, on item 1.3.1 says:
Well, after many trials and errors, errors, errors, I realized that in spite nvcc 3.2 is able to generate CUBIN files for all architectures, it does not imply that nvcc is able to do it in a single run.
The other route could be a .ptx file but I don’t have any experience on it.
Could you guys advise me what route I should take?
Thanks a lot