as I’ve read much here about CPU burning and “hacks” to prevent it, I wanted to ask if there is now any really good technique to prevent CPU burning while waiting for the kernel?
I tried cudaEventRecord together with cudaEventSyncronize, but my CPU still seems to be burned.
Any chance that this will be fixed in a future release of CUDA?
At the moment, my application is nearly unusable as I need low-latency because I’m processing audio streams. I can introduce latency, but I don’t want that of course…