Hello,
I have a long running kernel to profile, which takes more than 5 minutes using nvprof so as to obtain all the metrics. Is there a better way to speed up the profiling, besides disabling the replay?
Best,
Hello,
I have a long running kernel to profile, which takes more than 5 minutes using nvprof so as to obtain all the metrics. Is there a better way to speed up the profiling, besides disabling the replay?
Best,
Hi, smartvoice
I have deliver the question to our dev.
Please wait the response.
Best Regards
VeraJ
Hi, smartvoice
Following is the advice
Profiler requires multiple passes to collect all the metrics due to limited number of hardware events supported in a single pass. This can result in a significant increase in the application execution time. There are several ways to reduce the execution time of kernel/application under profiling:
Thanks for the suggestions!