Hi All!
I am working on GTX 285 on Mac. The GPU card is only used for CUDA computation (use another GPU card for displaying).
Now, my CUDA application is facing with three troubles.
T1) My application is doing some operations for a given volume data. Now, it is working well with small volume data.
But, when I increase the volume dimension (twice), it returns an error called, unspecified launch failure.
In brief, my kernel is invoked from host code as following;
for(int iter=0; iter<NITER; iter++){
cudaThreadSynchronize();
invoke_cuda_kernel();
}
The CUDA error I mentioned above occurs after random number of iterations. When I checked the result before the error, everything is correct.
Can I get any ideas or advices regarding what can cause the error and how can I resolve the error?
T2) After I got the error, the application is terminated (of course). Right after that, when I tried to run my application with small volume dimension, which is working well before I got the error, it seems I never get any responds from GPU. It just stop working; even I could not terminate the application from terminal by using clrt+z. Thus, to run my application, I need to reboot my computer. Why do I have this kinds of problems? Anyone has any suggestion for this problem? Similarly, after I terminated my application using clrt+z via terminal during the iteration, I also encountered same problem? What should I do to resolve this kind of problem?
T3) I hope I can get some valuable suggestions or advices from two questions above. But, if not, I wonder there are some ways to just reset my GPU via terminal so that I do not need to restart my computer??? If there were, please advise me.
Thanks for all suggestions, comments, advises, and any replys in advance.
Best,
ss