My CUDA programs are annotated using nvtx markers and range. They are perfectly visible when using Nvidia Visual Profiler.
I try to automate my performance graphing and statistics, and therefore I’d like to collect the marker and range data automatically using scripts. As far as I know, this is not possible using the standard tools (nvprof does not output this data). If anyone knows how to do this using the standard tools I’d be very grateful!
Because it is not possible using the standard tools, I tried to capture the nvtx markers and ranges using a custom program that makes use of the CUPTI library. I started with the activity_trace example, activated the CUPTI_CB_DOMAIN_NVTX and changed the callback handler to output something when a event of this domain is received. BUT: There never seems to be such a callback (but there are markers inside the code!)
Any hints/pointers? I’m using CUDA 5.0 on a Ubuntu machine running Kernel 126.96.36.199 with 2 GTX 560 Ti.
Thank you very much!