I have a question about how to understand the visual profiler results given by the Nvidia Visual Profiler, and hope someone could help me to figure it out!
The question lies in the timeline rows. When I was using nvvp, I found that neither the position of a certain call in the “OpenACC” row is aligned with the corresponding kernel in the “Compute” row, nor is the duration of that call in the “OpenACC” row as same long as the relevant kernel in the “Compute” row. I have some screenshots which can help me to explain this question more clearly however I do not know how to upload the figures.
Thank you very much for your help!