timing a kernel in CUDA Dortran

Doe PGI Fortran implement a built-in timing like cutStartTimer() in CUDA C?
Or is there a way to test the runtime of a kernel, not using the profiler.


Probably the most dependable timing mechanism is to use cuda Events.

istat = cudaEventCreate(startEvent)
istat = cudaEventCreate(stopEvent)

istat = cudaEventRecord(startEvent, 0)

! Do whatever you want to time here

istat = cudaEventRecord(stopEvent, 0)
istat = cudaEventSynchronize(stopEvent) ! or cudaThreadSynchronize()

istat = cudaEventElapsedTime(time, startEvent, stopEvent)

Thanks, Brentl. This is exactly what I want.