Varying Execution time

GiulioPU · June 9, 2010, 3:39pm

Hi,

I run the follow code and I realized that the execution time change, sometimes is 0 and sometimes is 0.001(too much for that code!). Does anyone know why this happend?

Do you thing that I can improve the performance of the code storing the scalar variable in device as well ?

//all the matrices and vectors are stored in the device.

//all the scalar are stored in the host

.

.

clock_t start = clock();

		cublasScopy(N, v, 1, v_old, 1); //v_old = v;		

		

		//v = v_hat/beta; 

		cublasScopy (N, v_hat, 1, v, 1); // v = v_hat;

		cublasSscal(N,(1/beta),v,1);

		// alpha = v'*A*v;

		cublasSsbmv('U', N, K, 1, Ab, (2*K+1), v, 1, 0, sup0, 1); // sup0= A*v;

		alpha= cublasSdot(N, sup0, 1,v, 1); // alpha = dot(sup0,sup0)=sup0'*sup0

	

				//v_hat = A*v - alpha*v - beta*v_old;

		cublasSaxpy(N, -alpha, v, 1, sup0, 1); // sup0 = -alpha*v +sup0 ----- sup0 was A*v

		cublasSaxpy(N, -beta, v_old, 1, sup0, 1); // sup0 = -beta*v_old +sup0

		cublasScopy(N, sup0, 1, v_hat, 1); //v_hat = sup0;

	

		beta_old = beta;				// beta_old = beta;

		beta = cublasSnrm2(N,v_hat,1);		 // beta = norm(v_hat);

printf("\n Time elapsed : %f \n", ((double)clock() - start) / CLOCKS_PER_SEC);

.

.

.

tera · June 9, 2010, 5:32pm

Apparently [font=“Courier New”]CLOCKS_PER_SEC[/font] is 1000 on your system. Before you try to optimize your code based on measured timings, improve the measurement itself. Run the timed code multiple times in a loop, or user a higher precision timer like CudaEvents.

MisterAnderson42 · June 10, 2010, 11:40am

You also need a cudaThreadSynchronize() prior to each measurement of wall clock time. gettimeofday has a much higher resolution than clock()

Topic		Replies	Views
Time Measurement for CUBLAS why time (clock()) for CUBLAS is always 0 ms for any array size? CUDA Programming and Performance	2	2700	March 21, 2009
10ms Block each seconds during execution CUDA Programming and Performance	1	3747	January 3, 2012
SPMT: Single Program Multiple (Exeuction) Time CUDA Programming and Performance	15	4052	July 4, 2009
Function executing time CUDA Programming and Performance	7	6519	December 17, 2007
Execution timings varying from instance to instance CUDA Programming and Performance	10	588	September 29, 2023
Issue with measuring execution time after accelerating with CUDA CUDA Programming and Performance	1	578	March 12, 2018
Timing CUDA Code To find the best way to time CUDA code CUDA Programming and Performance	5	2084	January 6, 2009
Getting Different Execution Times of Running Same Kernel Twice CUDA Programming and Performance	2	79	August 13, 2024
Execution time is not proportional to the time steps CUDA Programming and Performance	5	1154	May 6, 2012
more time taken by CUDA rather than reducing time CUDA Programming and Performance	7	4704	November 18, 2010

Varying Execution time

Related topics