I have some experience using CUDA with multiple GPUs (I’m running 14 in total, distributed across several machines). Of the 14 cards I originally ordered (GTX 570, from EVGA), 4 tested positive for faulty memory. I used memtestG80 for this. These cards with bad memory were also yielding erroneous results in computations. I recommend checking the memory if you can.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Device Enumeration and cudaSetDevice SDK Examples Failing to Run on Device 0, but run fine on Device | 5 | 30731 | August 25, 2011 | |
| Speed problem on 295 gtx cards | 19 | 10669 | January 8, 2010 | |
| Correct on Device 0, Incorrect on others | 1 | 1313 | July 21, 2009 | |
| Failure with independent devices on independent processes Try it yourself! | 19 | 3641 | March 10, 2011 | |
| same code gives different results on two Nvidia 2080Ti GPU | 7 | 1554 | November 2, 2019 | |
| Problem using multiple device 2 Tesla C1060 and 1 Quadro FX 1700 | 0 | 5672 | July 21, 2010 | |
| GPU does not work why? | 9 | 15535 | March 5, 2010 | |
| Multiple Devices in CUDA - Crash | 2 | 1768 | July 16, 2009 | |
| Unstable/Unreliable GPU Device (Tesla C1060) | 3 | 3319 | May 13, 2010 | |
| CudaMalloc fails when more of 2 linux process acces to the GPU 0 | 2 | 1187 | February 24, 2009 |