I’ve been meaning to make this thread for a while - but having just damaged by 8800 as well, I think I need to bring it up.
Using CUDA w/ the Driver API - I’ve found it’s possible to (and very easily to accidentally) damage the hardware.
To date I’ve damaged both a Geforce 8800 GTX and a Quadro NVS 140M - simply by out of bounds memory transactions, which in some cases caused the driver to crash - however due to the fact I can’t definitively say the Quadro was damaged by CUDA - I’ll disregard it from now on.
Our 8800 GTX was absolutely damaged by CUDA, sometime around CUDA 2.1-2.3 - around about the time the new drivers came out which added better support for crash recovery (eg: instead of BSODs for the watchdog timer, or a really bad crash - the driver would simply reset).
To date the ‘damage’ has been purely superficial, corrupting only 2D drawing operations (my title bars, windows login screen, adobe reader, cmd.exe, etc - all show signs of graphical corruption - but not overly major). I’ve always had graphical corruption after writing out of bounds memory before, however it’s always been cured by rebooting (cold boot required in some cases) - however I’m only reporting this case because it’s permanent damage - rebooting, cold booting, even taking the card out of the motherboard for a few days doesn’t fix this issue.
To my knowledge, the card has only experienced out of bounds smem and gmem transactions (reads and writes) - I’m not aware of any other operations I’ve done that have caused corruption/crashes/etc.
Firstly, I’d like to know if this is a known (either internally, or publicly) issue? If so, is it ‘all cards’, or only some architectures/models? Is there a driver fix (and known ‘bad’ driver versions?)
And most importantly, how far can the damage go? (Clearly I don’t care so much about the 2D graphics stuff having random pixels flying all over it - but if this can affect the results of 3D rendering via OpenGL, or worse - computational results from the SPs… this is VERY BAD)
The last thing I want is to ship out a tech demo to a strategic partner, only to have it crash on some corner case - destroying their video card in the process…
(But the fact I’ve damaged one, possibly 2 cards - in a matter of 10 months of fulltime CUDA development… tells me it’s certainly a possibility.)
Edit: Okay, time flies - I’ve been working w/ CUDA for almost a year now… not 6 months.