I installed Cuda 8 yesterday on a Windows 7 PC that has a Quadro K2000 GPU board installed in it. This card has a compute capability of 3.0 which supports unified memory access according to Nvidia’s website. I can get the GPU to execute my cudaMalloc/cudaMemcpy code ok, but it will not execute my cudaMallocManaged code. Whenever my code tries to initialize the shared data from the CPU, it crashes the program. The code I am trying to run is from Mark Harris’s post on the Parallel ForAll forum titled: “An Even Easier Introduction to CUDA”.
Could the installation have picked up the wrong GPU architecture during the installation process?
If so, is there a way to force it to use the 3.0 capability?