events/markers within kernel

cfreese · June 25, 2019, 9:07pm

I have a kernel that has multiple phases to it. I’d like to stick some sort of event or marker in between the phases to see how long each phase takes. I can do this on host code using NVTX and/or the runtime API cudaEvent* calls. But, I don’t want to break up the kernel so that each phase becomes a separate kernel. The NVTX markers/ranges don’t work within device code. cudaEvent Create/Destroy/Record do seem to have device versions, but that’s all which leaves me at a loss to understand how to use them for what I’m trying to do. I’d be happy to pack up the cudaEvents and copy them to the host for analysis, but I can’t find a definition/handle for the actual underlying structure of events. (I haven’t actually tried it but I don’t expect copying the pointers from the device to host is going to do me any good).

I guess the broader question is what, if any, techniques are available to instrument code intra-kernel.

Robert_Crovella · June 25, 2019, 11:30pm

The usual suggestion is to use clock64()

Topic		Replies	Views
Timer&Event CUDA Programming and Performance	3	3587	December 1, 2009
Timing inside the kernel How to measure times inside the kernel? CUDA Programming and Performance	10	12065	December 21, 2009
Concurrent kernel timing with cudaEvents CUDA Programming and Performance	1	1929	April 27, 2017
Number of GPU clock cycles CUDA Programming and Performance	15	10433	June 16, 2017
Should we rely on events recording or nvprof values for kernel execution time ? CUDA Programming and Performance	4	734	August 20, 2019
performance measurement of full kernel inside of kernel CUDA Programming and Performance	2	803	March 16, 2015
Profiling inside a kernel CUDA Programming and Performance	1	2265	May 8, 2009
Overhead of cudaEventRecord/cudaLaunchKernelExC in multithreading CUDA Programming and Performance	10	333	August 12, 2024
Precision of events for recording time elapsed of a kernel CUDA Programming and Performance	5	1198	December 21, 2017
How to measure time IN a kernel? CUDA Programming and Performance	10	3685	August 25, 2010

events/markers within kernel

Related topics