I want to know how i obtain the time execution of the kernel for example for a matrix multiplication in number of cycle of clock.
I use this code to obtain result in ms but i want to obtain it in number of cycle clock
// create and start timer
unsigned int time = 0;
// stop and destroy timer
printf(“Processing time: %f (ms) \n”, cutGetTimerValue(timer));