Does anyone know of a logging method that might capture the problem that I am having with my Tesla M2090 card. It loads jobs from the program, completes the job then when a new one is assigned within the program the card fails. The process just stops, and its always when it completes the first assigned task and movers on to the next one. this time it completed 33.033% this time but that changes on every attempt. What ever is occurring, the card is removed from device manager, Nvidia control panel will not load as it says there is no card to manage. This isn’t isolated to windows as I have multiple OS boot and can load into my ubuntu drive and run the Linux version of the software with the same results, dosent matter the length of the job it will always complete the first run and will fail after loading in the next task. I cant tell if it is failing when it is reporting the results of the task completed. I was thinking maybe a driver issue, but I cannot get any driver to recognize the card except 390.77 for Linux and 386.45 for windows. Even a restart will not resolve the issue as the system has to be powered down completely for the device to show back up on the machine.
If anyone could suggest some logging or solutions, as I am using this card for my graduate degree for my programming classes and would really like to avoid replacing it, and the troubleshooting provides valuable information. can provide any information needed if will help with feedback