I’ve begun a new CUDA project recently, and I’ve run into an error that’s got me stumped. I’m building the project on top of the driver API because I don’t want to have to include the runtime API DLL in my releases (for various reasons). Unfortunately, the driver API doesn’t give much information in the way of errors (other than the error code and a relative idea of where the error is occurring).
Are there any methods for troubleshooting errors that occur when working with the driver API? It would be helpful if the driver could write information out to a log file (like when kernels were launched, a printout of the parameters that were passed (in raw hex format), why an error occurred, etc.).
In any case, the error I’m getting is a CUDA_ERROR_NOT_READY value when I make a call to cuEventElapsedTime(); prior to that, I’m simply uploading some data, executing a small kernel (which I know works fine, since I compiled it with the <<< >>> syntax in a previous project), and reading the data back. I recorded an event at the start and finish of all that, and none of it returns an error until the cuEventElapsedTime() call (and I’m not using any async functions).