Hi there,
sorry, maybe this question seems very stupid, but I am new to CUDA, so I tried
the examples in the SDK. And I am a bit confused:
Here is what ./deviceQuery tells me:
//////////////////////////////////////////////////////////////////////////////////////////////////////////////
There is 1 device supporting CUDA
Device 0: “GeForce GTX 280”
Major revision number: 1
Minor revision number: 3
Total amount of global memory: 1073479680 bytes
Number of multiprocessors: 30
Number of cores: 240
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 16384 bytes
Total number of registers available per block: 16384
Warp size: 32
Maximum number of threads per block: 512
Maximum sizes of each dimension of a block: 512 x 512 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 1
Maximum memory pitch: 262144 bytes
Texture alignment: 256 bytes
Clock rate: 1.30 GHz
Concurrent copy and execution: Yes
Test PASSED
Press ENTER to exit…
///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
Ok, that’s ok.
But then I use ./simpleStreams.
//////////////////////////////////////////////////////////////////////////////
./simpleStreams
memcopy: 22.09
kernel: 18.53
non-streamed: 39.16 (40.62 expected)
8 streams: 34.93 (21.29 expected with compute capability 1.1 or later)
Test PASSED
Press ENTER to exit…
According to the result (34.93), I should have compute capability < 1.1.
But ./deviceQuery tells me 1.3.
Any ideas?
Thank in advance.
A.