Why both ExecutionContext::enqueue bar charts have a different start time and execution time?

Hello,

I would like to ask a question related to Nsight System profiling with samplemnist on Xavier.


Based on the image, I would like to know why both ExecutionContext::enqueue bar charts have a different start time and execution time?

Thank you.

Hi,

Top one is from CPU side and the below one is for CUDA.

Thanks.