Hi folks -
I’m trying to use nvvp to profile some code I wrote for the GPU. If the kernel completes in a few seconds or less, I get kernel measurements (as expected) as a row on the timeline when the profiling is done. However, if the kernel takes longer than a few seconds to run, no measurements are shown, with no entry for the kernel on the timeline.
Is this expected behavior?
Tested with multiple programs on multiple machines. Running with Cuda 7.5 on CentOS 7.