[*] I can run deviceQuery and get the correct result.
bin/linux/release/deviceQuery
CUDA Device Query (Runtime API) version (CUDART static linking)
There is 1 device supporting CUDA
Device 0: "Quadro FX 570M"
CUDA Driver Version: 2.30
CUDA Runtime Version: 2.30
CUDA Capability Major revision number: 1
CUDA Capability Minor revision number: 1
Total amount of global memory: 133496832 bytes
Number of multiprocessors: 4
Number of cores: 32
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 16384 bytes
Total number of registers available per block: 8192
Warp size: 32
Maximum number of threads per block: 512
Maximum sizes of each dimension of a block: 512 x 512 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 1
Maximum memory pitch: 262144 bytes
Texture alignment: 256 bytes
Clock rate: 0.95 GHz
Concurrent copy and execution: Yes
Run time limit on kernels: Yes
Integrated: No
Support host page-locked memory mapping: No
Compute mode: Default (multiple host threads can use this device simultaneously)
But: I cannot run any real application using cuda; neither the ones compiled against the cudart nor the ones using cuda driver API.
I honestly have no idea as to what might be the reason for this. And I haven’t given up hope, yet! ;-)
This has pointers to a consistent set of 3.0-beta driver, toolkit and sdk. Once I deleted all the 2.3 (or whatever it actually was) and loaded the 3.0 versions, it all works.
You’re absolutely right about the little memory this card has, but for developing purposes this should suffice. And futhermore the basic SDK examples do run under Windows (like bandwithTest), so there should be no reason as to why this example shouldn’t run under linux.
Well I am still kind of puzzled and I am currently trying to remote develop on another cuda-capable machine.