I have a single GTX 295 card installed on to a Rampage II extreme board, 12 GB RAM, i7 965 CPU, and I am running Vista x64.
When I compile & run the ‘deviceQuery’ project included with the CUDA SDK samples, I get the following output:
There is 1 device supporting CUDA Device 0: "GeForce GTX 295" Major revision number: 1 Minor revision number: 3 Total amount of global memory: 939524096 bytes Number of multiprocessors: 30 Number of cores: 240 Total amount of constant memory: 65536 bytes Total amount of shared memory per block: 16384 bytes Total number of registers available per block: 16384 Warp size: 32 Maximum number of threads per block: 512 Maximum sizes of each dimension of a block: 512 x 512 x 64 Maximum sizes of each dimension of a grid: 65535 x 65535 x 1 Maximum memory pitch: 262144 bytes Texture alignment: 256 bytes Clock rate: 1.24 GHz Concurrent copy and execution: No Test PASSED Press ENTER to exit...
I was expecting that CUDA would ‘see’ 2 cards with 30 MPs each, or possible 1 card with 60 MPs (either way, a total of 480 cores).
This would suggest that the CUDA code I write cannot take full advantage of the GTX 295.
How can I ‘see’ the full resources of the GTX 295?