I’m working in strictly emulation mode for the time being while waiting on cuda hardware and I need to see the execution time of each cuda kernel and also the cuda fft routine in my code. I have tried using the cudaEventElapsedTime() method, but no matter what I try it simply returns 0.0 in time. Does anyone know how to get the elapsed time while in emulation mode? Also, the time would need to be from the GPU standpoint instead of simply the CPU it’s being executed on.
Thanks in advance.