Semantics of recording a cudaEvent | Accuracy of cudaEvents Vs nvprof

singam-sanjay · August 13, 2017, 12:44pm

I’d like to understand how cudaEvents are recorded and how that affects the way kernel launches are timed.

Consider the following code timing a kernel call from a larger program,

cudaEventRecord(start);
kernel_1<<<...,...>>>(...);
cpu_func(); // Is this recorded ?
cudaEventRecord(stop);
[...]
cudaEventSynchronise(stop);
cudaEventElapsedTime(&result, start, stop)

I’m guessing that calls to cudaEventRecord merely specifies the order of events, i.e. “start” → “kernel” → “stop”, where start and stop are “virtual” events specified to happen with respect to an actual event of running the “kernel_1” kernel. Therefore, cudaEventElapsedTime will not record the time taken by cpu_func. Please correct me if I’m wrong. (Also, I’m currently not in a position to check this by myself).

Considering the above code snippet without calling cpu_func,

cudaEventRecord(start);
kernel_1<<<...,...>>>(...);
cudaEventRecord(stop);
[...]
cudaEventSynchronise(stop);
(&result, start, stop)

Would nvprof’s summary stats provide a more accurate value of the time elapsed (assuming I’m calling kernel_1 only once) compared to calculating it using cudaEvents and cudaEventElapsedTime?

Topic		Replies	Views
timing performance of kernels how ? cudaprof vs cudaEventRecord vs cutStartTimer CUDA Programming and Performance	3	5298	March 21, 2009
Should we rely on events recording or nvprof values for kernel execution time ? CUDA Programming and Performance	4	726	August 20, 2019
cudaLaunchHostFunc + cudaEventElapsedTime? CUDA Programming and Performance	4	854	August 3, 2022
CUDA OpenCL comparison CUDA Programming and Performance	9	3399	August 23, 2011
using cudaEvent no elapsed time given CUDA Programming and Performance	2	3443	July 2, 2008
cudaEventRecord functionality CUDA Programming and Performance	1	11468	March 10, 2009
Timing using cudaEvent****() VS clock_gettime() CUDA Programming and Performance	6	2150	August 26, 2015
Timing cudaEventRecord() ok for cpu timing? CUDA Programming and Performance	2	7618	August 14, 2009
Does a cudaEvent record the time of cuModuleLoad and cuLaunchKernel CUDA Programming and Performance	0	176	April 1, 2024
cudaEvntRecord for stream gives incorrect result CUDA Programming and Performance	0	2049	October 31, 2008

Semantics of recording a cudaEvent | Accuracy of cudaEvents Vs nvprof

Related topics