Hi,
I had really tough time running visual profiler from my laptop. when i try running the visual profiler it keep geenerating timer and when i application closes it either comes with signal 134 error or display problem.
Since i needed profiling urgently, i used nvprof and gprof and got the profile picture of Cuda cores and CPU profiles. But i am not sure whether i can depend on these two datas, as they are generated separately . Can nvidia guys comment on this and guide me how i can profile and get whole system picture (CPU+GPU profiles), so that i can look at hotspots.
It’s recommended to run nvprof on the device, save the result into file and open it with the NVIDIA visual profiler on the host.
To profile a CUDA application, please use nvprof.
Thanks AastaLL. I wanted combined profile on Jetson Tx2, which i tried doing as follows:
nvprof --cpu-profiling on
But it shows warning as below :
======== Warning: CPU Profiling is not supported on the underlying platform
Hence i wanted your help on how i can profile and get both results.
Or
My approach is correct ? Approach :
1)Use nvprof for cuda kernels
2)Use gprof for CPU functions
Point 1 and Point2 are run separately , so instances are at different times, as gprof need modification in make file to add -pg option.
Thanks AastaLLL.
I wanted to profile both CPU and GPU together , not separately.
I believe system monitor will do only for CPU.
Please let me know if this together (CPU+GPU) profiling is not available ?
Sorry for the misleading. Nsight and nvprof are different profiling tool.
Although nvprof doesn’t support CPU profiling on Jeton, Nsight can do this as website mentioned.
I also tried few sample examples like dct8x8 and vecadd .
Nsight nor nvprof gives CPU and GPU data. They only give GPU data, so i am confused on how system profileing can be done.
Did you tried on these samples ?