Timing and Profiling with OptiX

aksxay · December 5, 2016, 3:34pm

Hey

So I have been working on a Bidirectional Path Tracer with optix and having finally completed it, was moving on to analysis on the kernels. With OptiX, the only way I found to actually do timing was the following:

clock_t start_time = clock();

// some small amount of code

clock_t stop_time = clock();
	
int time = (int)(stop_time - start_time);
rtPrintf("time in func %fms\n", time / clockRate);

where clockRate is the shader clock frequency that I found out.
My question is, is there any other “standard” way to do this in other kernel functions, like cudaEvent? (cudaEvents do not work in a RT_PROGRAM since cudaEvent is a host function)
Also, profiling with NSight gives information only about the top level kernel function, am I missing something or is there another way to profile with OptiX?

Thanks.

Keith_Morley · December 5, 2016, 4:10pm

Hello,

For low level perf counting this is currently the best option. NSight can also be useful for higher level statistics (eg, total local memory accesses, etc). A variation on your above code is to use your ‘time’ variable to render a heat-map like rendering where costly pixels are white and low cost is close to black.

We are hoping to improve support for NSight such that source level profiling can be performed on optix codes, but this is not ready yet.

We are also working on some improved diagnostic reporting tools for an upcoming release. This will include reporting of time spent and invocation counts for each of the programs in your kernel as well as other helpful statistics.

Thanks

aksxay · December 5, 2016, 7:34pm

Thank you for your response. Looking forward to upcoming updates with NSight and OptiX. The heat-map was a really effective analysis and thank you for the idea.

Topic		Replies	Views
Timing rtTrace via NVAPI OptiX	3	1102	June 14, 2022
Compute rays/sec for Optix Program OptiX	2	137	November 26, 2024
OptiX Shader Kernel Profiling Nsight Compute	2	43	December 12, 2025
OptiX 7 visual profiling with timeline OptiX	2	754	June 14, 2022
Measuring Execution Time Inside a GPU Kernel Nsight Compute cuda , nsight	2	1913	January 23, 2024
Clock() function in Optix OptiX	3	699	October 12, 2021
Timing inside the kernel How to measure times inside the kernel? CUDA Programming and Performance	10	12266	December 21, 2009
OptiX profiling? Nsight Compute cuda , optix	8	1214	November 27, 2023
OptiX and Performance Counter reports in Nsight Compute OptiX	5	977	June 14, 2022
Optix profiling using Nsight OptiX	10	2338	June 14, 2022

Timing and Profiling with OptiX

Related topics