CUDA-Kernel time measurement

Deus · June 8, 2010, 8:44pm

Hello,

what is the best possibility for the time measurement of CUDA-Kernels?

I use C++ - Timer and cudaThreadSynchronize() , but is it the best possibility?

cudaThreadSynchronize()

timer.start();

kernel(...);

cudaThreadSynchronize()

timer.stop();

jjtapiav · June 8, 2010, 10:47pm

There are the cudaEvents

Here’s a minimal example that makes use of it…

float memsettime;

cudaEvent_t start,stop;

cudaEventCreate(&start);

cudaEventCreate(&stop);

cudaEventRecord(start,0);

\

\cuda coda here

\

cudaEventRecord(stop,0);

cudaThreadSynchronize();

cudaEventElapsedTime(&memsettime, start, stop);

cudaEventDestroy(start);

cudaEventDestroy(stop);

Ringworm · June 8, 2010, 10:52pm

There are the cudaEvents

Here’s a minimal example that makes use of it…

float memsettime;

cudaEvent_t start,stop;

cudaEventCreate(&start);

cudaEventCreate(&stop);

cudaEventRecord(start,0);

\

\cuda coda here

\

cudaEventRecord(stop,0);

cudaThreadSynchronize();

cudaEventElapsedTime(&memsettime, start, stop);

cudaEventDestroy(start);

cudaEventDestroy(stop);

I assume this is also reliable in timing non-CUDA code, right?

jjtapiav · June 8, 2010, 10:58pm

It depends I guess. If you have a CPU multi threaded program, cudaThreadSynchronize will do nothing synchronization wise so you would have to additionally include your own barrier. Other than that I guess it would work, although the phrase killing a fly with a bazooka comes to mind. (given the additional overhead of having to synchronize the cudaEvents’ timings).

Deus · June 9, 2010, 3:24pm

Thank you!!!

Topic		Replies	Views
Kernel execution is async? CUDA Programming and Performance	1	4562	May 23, 2008
GPGPU Time Measurement CUDA Programming and Performance	2	5153	August 27, 2011
Is CUDA timer trustable? CUDA Programming and Performance	1	3930	July 6, 2007
CUDA event timer or C++11 <chrono> timers, which one should I use? CUDA Programming and Performance	4	4012	May 21, 2019
how to measure the time elapsed (or no. of clock cycles) between the start and the end of a cuda thr CUDA Programming and Performance	7	2789	December 13, 2009
Concurrent kernel timing with cudaEvents CUDA Programming and Performance	1	1918	April 27, 2017
timing performance of kernels how ? cudaprof vs cudaEventRecord vs cutStartTimer CUDA Programming and Performance	3	5300	March 21, 2009
is cudaThreadSynchronize() will take 600+ms to execute? CUDA Programming and Performance	3	1539	April 21, 2009
Mesuring Kernel Performance CUDA Programming and Performance	3	1081	September 29, 2009
Compare Execution Times CPU vs GPU the proper way? CUDA Programming and Performance	5	5999	September 8, 2009

CUDA-Kernel time measurement

Related topics