Cuda Profiler Performance boost?

JHHPC · March 7, 2008, 7:28am

Hi everybody,

posted this already here:
[url=“http://forums.nvidia.com/index.php?act=ST&f=71&t=57443”]The Official NVIDIA Forums | NVIDIA

I have a very erratic behaviour together with one of my benchmark kernels. If I execute it within the CudaVisualProfiler it doubles its performance. The time measured by CudaVisualProfiler and my own timing routine are the same.

Thanks!

Johannes

laffer · October 15, 2008, 1:57pm

I am seeing this today for two of my kernels. They run twice as fast under the profiler.

Did you ever resolve this issue?

JHHPC · October 16, 2008, 10:25am

Unfortunately not.

However I had no further occurences with other codes.

MisterAnderson42 · October 16, 2008, 11:57am

Does your programs total run time change by an appreciable amount, too?

Enabling profiling causes an implicit cudaThreadSynchronize() after every kernel call. So, in the following situation:

cudaThreadSyncrhonize()

mark time on wall clock

call kernel1

mark time on wall clock

call kernel 2

cudaThreadSynchronize()

mark time on wall clock

The time spent in kernel 2 would appear to drastically decrease when enabling profiling because of the missing thread synchronize. You could see if you get the same behavior when you enable the “sync after every kernel call” environment variable, too (sorry, don’t recall the exact env var: check the release notes).

laffer · October 16, 2008, 12:30pm

I opened up a different thread that may or may not be related to the original problem posted here:
http://forums.nvidia.com/index.php?showtop…=0&#entry452562

Anderson: Good suggestion. But as regards my problem (which may be different from original post): yes, I do synch and yes, the entire program speeds up.

Topic		Replies	Views
Two copies of same kernel, one runs 2x faster CUDA Programming and Performance	2	704	January 30, 2014
Profiler speeding up my kernels? Nvidia employees please read Weird timing behavior during profiler CUDA Programming and Performance	6	5819	November 9, 2009
Profiler timing measurements wrong? Visual Profiler and nvprof	0	1892	June 3, 2015
kernel runs much faster when being profiled with Visual Profiler Visual Profiler and nvprof	4	4690	August 29, 2014
visual studio performance profiler on CUDA code CUDA Programming and Performance	1	6919	March 20, 2008
Optimisation using Visual profiler Some guess I would like to discuss with you CUDA Programming and Performance	5	1616	April 10, 2012
Profiler v. cudaEventSynchronize CUDA Programming and Performance	6	8140	March 27, 2008
Kernel Launch Time (CPU Time) Reported in Visual Profiler how to optimize kernel launch CUDA Programming and Performance	1	683	July 7, 2011
Multiple CPU threads Performance hit CUDA Programming and Performance	5	5381	February 28, 2008
Kernel Overhead/Profiler Accuracy CUDA Programming and Performance	4	6395	May 25, 2008

Cuda Profiler Performance boost?

Related topics