I’ve tried searching these forums and haven’t found an answer, so here’s the problem:
CUDA code doesn’t seem to work on the 2nd GPU. The deviceQuery program in the SDK samples gives information which conflicts with nvidia-smi. Is this a sign of a broken GPU?
I’ve attached the outputs for both queries.
The deviceQuery output seems to indicate that everything is fine. However, the nvidia-smi output gives some disturbing information.
On the 2nd GPU, both the CPU and Memory Utilization fields give Unknown Errors. There are also some ECC memory errors. The clock speeds on the 2nd GPU also don’t seem to match factory specs.
Is this due to a software misconfiguration or is there something more going on here along the lines of a hardware failure?