When calling
[b] cudaError_t cuda_error;
int device;
cuda_error = cudaGetDevice(&device);
assert(cudaSuccess == cuda_error);
cudaDeviceProp prop;
cuda_error = cudaGetDeviceProperties(&prop, device);
assert(cudaSuccess == cuda_error);[/b]
on a GTX 970 (with cuda driver 6.5.19, display driver version 343.98, Win7 Professional SP1 x64),
compiled with
-gencode=arch=compute_52,code=“compute_52,sm_52”
prop.totalGlobalMem == 0 and
prop.sharedMemPerBlock == 49152 (48k).
I expected the total memory to be ~ 4GB
and shared memory per block 96kB, as the GTX 970 uses GM204.
Am I missing something?
Thanks in advance for any support,
michel