Why compute a float in cudaEventElapsedTime() instead of a long integer?

allanmac · February 4, 2010, 7:11pm

Anyone care to explain why the event stream elapsed time function computes a float instead of a 64-bit integer?

[indent][font=“Courier New”]cudaError_t cudaEventElapsedTime (float* ms, cudaEvent_t start, cudaEvent_t end)[/font]
[font=“Georgia”]Computes the elapsed time between two events (in milliseconds with a resolution of around 0.5 microseconds). If
either event has not been recorded yet, this function returns cudaErrorInvalidValue. If either event has been recorded
with a non-zero stream, the result is undefined.[/font][/indent]

Uncle_Joe · February 4, 2010, 11:21pm

I’m guessing they use float for convenience. Most GPU functions take a few ms, so it gives you something human readable without further conversion.

But imo, that’s moot because the event interfaces is too cumbersome to use in the 1st place. I still use CPU clocks.

tmurray · February 4, 2010, 11:26pm

Yeah… using CPU clocks to time GPU performance is pretty useless. Resolution isn’t high enough, there’s OS scheduler and PCIe interference, etc.

Uncle_Joe · February 4, 2010, 11:38pm

It works fine for my purposes. I’m using QueryPerformanceCounter on Windows and clock_gettime(CLOCK_REALTIME, &t) on Linux, which both give have a resolution of at least 10^-5s. I still use CUDA profiler if I want to know the time without launch overhead.

Actually, I take back about the event interface being too cumbersome. That’s also moot because the SDK has a timer wrapper class, which I totally forgot about.

tmurray · February 5, 2010, 12:21am

If you ever send me code that uses cutil, I helpfully ignore it completely.

Basically if you are timing what is happening on the card you need to be using events because otherwise context switches will ruin your results.

allanmac · February 5, 2010, 2:39am

I was just curious because floats are hardly a great representation for time duration and conversion but, as you say, I bet they thought this would be more convenient.

I consider it a bug and hope someone will deprecate it (unless there is something I’m missing).

Perhaps the cudaEvent_t is not entirely opaque and there is an unofficial way to get an integral timestamp?

Topic		Replies	Views
using cudaEvent no elapsed time given CUDA Programming and Performance	2	3443	July 2, 2008
Timing Question timing of a function not clear CUDA Programming and Performance	15	10241	November 30, 2007
Timing using cudaEvent****() VS clock_gettime() CUDA Programming and Performance	6	2152	August 26, 2015
Compare GPU and CPU function time CUDA Programming and Performance	7	6306	May 30, 2011
Number of GPU clock cycles CUDA Programming and Performance	15	10317	June 16, 2017
Using cudaEvents to synchronise with cudaStreamCallback CUDA Programming and Performance cuda	5	627	May 9, 2024
CUDA OpenCL comparison CUDA Programming and Performance	9	3401	August 23, 2011
Poor half performance CUDA Programming and Performance	9	2305	February 16, 2020
cudaEvntRecord for stream gives incorrect result CUDA Programming and Performance	0	2049	October 31, 2008
Time for Splitting up Memory Transfers? CUDA Programming and Performance	8	3365	August 14, 2010

Why compute a float in cudaEventElapsedTime() instead of a long integer?

Related topics