Why these measurements so different?

yxnabc · December 17, 2011, 8:27am

In the host code, I measured the total time of these six kernel launchs with CUDA timer. In the windows 7, the result is about 1 ms and in Linux , the result is 0.17ms. But in the output of Visual Profiler, the total consumed time of these kernel is 0.13ms! why there is such big diferences between these results? Thanks for your help!

cutCreateTimer(&timer);

    cutStartTimer(timer);

    cufftExecD2Z(Plan,cubfft,fft_result);

    cudaThreadSynchronize();

    // first kernel

tri_transpose<<<grid,threads>>>(fft_result,cu_result, argu);

    cudaThreadSynchronize();

    //second kernel

cudaFuncSetCacheConfig(tri_solver, cudaFuncCachePreferShared);

tri_solver<<<grid2,threads2>>>(cu_result,cu_act_trace,cu_tri_main);

    cudaThreadSynchronize();

    // third kernel

tri_transpose2<<<grid,threads>>>(cu_result, cubfft);

    cudaThreadSynchronize();

    // fourth kernel

cufftExecD2Z(Plan,cubfft,fft_result);

    cudaThreadSynchronize();

    // fifth kernel

post_process<<<grid2,threads2>>>(fft_result,cu_result,argu);

   // sixth kernel

cutStopTimer(timer);

tera · December 19, 2011, 12:16pm

I believe cutil timers only have millisecond precision on Windows. Use a higher precision timer or time multiple iterations before concluding that the code is slower on windows (which still might be the case though - kernel invocations are slower on windows).

DrAnderson42 · December 19, 2011, 1:00pm

The windows WDDM drivers also have substantially more latency and overhead in calling kernels than the linux drivers.

Topic		Replies	Views
Different times Ubuntu Vs Windows CUDA Programming and Performance	8	1678	October 12, 2015
First kernel execution takes longer CUDA Programming and Performance	8	2859	December 8, 2014
time measurement discrepancy timer, clock(), profiling CUDA Programming and Performance	4	6695	April 7, 2010
Execution time is different in Profiller and Console. why? CUDA Programming and Performance	4	3742	August 3, 2009
CPU vs GPU Timer Is CUDA Timer accurate ? CUDA Programming and Performance	3	6766	February 19, 2010
cutil Timer and visual profiler CUDA Programming and Performance	2	1005	April 21, 2010
Kernel execution overhead CUDA Programming and Performance	2	1159	July 6, 2009
Profiler Kernel Speeds faster than cmd? CUDA Programming and Performance	4	6780	June 24, 2008
On timing and timer CUDA Programming and Performance	7	4191	July 15, 2009
Oscilating performance, Code total times variates CUDA Programming and Performance	10	10571	June 21, 2009

Why these measurements so different?

Related topics