There is any method to measure a part of code inside kernel?

There is any method to measure a part of code inside kernel?

clock()

clock()

The only thing to do is

```
void mykenrel(...)
{
...
...
start = clock();
for()
...
...
end = clock();
time = end - start;
}
```

The clock is time or cycle of gpu? If it is cycles how can I take time?

The only thing to do is

```
void mykenrel(...)
{
...
...
start = clock();
for()
...
...
end = clock();
time = end - start;
}
```

The clock is time or cycle of gpu? If it is cycles how can I take time?

If core clock cycles (half of the shader clock).

If core clock cycles (half of the shader clock).

The deviceQuery print Clock rate = 1.9Ghz. I must divide with this rate? Also this rate is from a processor of my 128 or the rate for all processor together?

The deviceQuery print Clock rate = 1.9Ghz. I must divide with this rate? Also this rate is from a processor of my 128 or the rate for all processor together?

I understand that all thread in one block do the same number of clocks but how can I take time from clocks?

I understand that all thread in one block do the same number of clocks but how can I take time from clocks?

Interested as well