After using cudaFuncSetCacheConfig( Kernel1, cudaFuncCachePreferL1);
there is no effect on shared memory. It is still 48KB.
I understand we should use sm_20 architecture, that might give me the correct functionality.
How do we change this GPU Architecture in Visual Studio 2008?
I tried: Properties->CUDA Build Rule v 3.0-> general-> GPU Architecture
But in GPU Architecture there is no option for sm_20 . Its giving an error when I am typing “sm_20”: