Hi All,
Having a strange issue with cudaMemcpy. I’m writing some C wrappers for F95. Before I go off and post a bunch of code, I’ll just explain for now. Essentially, cudaGetDeviceCount(…) returns 0, so I have a CUDA capable device (GTX 9500GT with 1Gb, so compute level 1.1 capable). cudaGetDevice(…) gives me 0. cudaSetDevice(…) returns cudaSuccess. So all looks well. But, cudaMalloc(…) and cudaMemcpy(…) both are returning cudaErrorNoDevice(…). Any suggestions on what I’m missing? And yes, I compiled the example Fortran to Cuda code from CUDA u and I have the same issues. The compiled test cases (pure C, C++) work perfectly. BTW, I’m using gfortran 4.3. No segfaults and no crashes.