What software needs to be installed on a non-development machine to make calls into the CUDA runtime API?
I’m developing with the CUDA Toolkit 4.0, (MAY 2011). [font=“Courier New”]cudaRuntimeGetVersion()[/font] returns [font=“Courier New”]4000[/font]. I x-copy deploy my executable, DLLs and [font=“Courier New”]cudart64_40_17.dll[/font]. It works great on my development machine and on my Amazon Cluster GPU instance. On my colleague’s computer, it fails at the first CUDA call, which happens to be [font=“Courier New”]cudaGetDeviceCount(int *count)[/font].
When my colleague first tried to run the program, he got this error:
CUDA driver version is insufficent for CUDA runtime version.
Therefore, he got the latest driver from http://www.nvidia.com/Download/index.aspx.
Now, the error is:
non-CUDA capable device is detected
He has two GTX 480s, which are certainly CUDA-capable.
I’m trying to avoid installing anything more than necessary, for when we roll this out to more users.
What do I need to install to get this running on non-development machines?
EDIT. I should have mentioned that all machines are Windows 7.