Hello, I am trying to understand the meaning of the “Sync Copy Engine Active” and “Async Copy Engine Active” metrics shown in Nsight Systems.
In the Release Notes — nsight-systems 2024.5 documentation, I found the following description:
Copy engines are also exposed in the general metrics-set to better understand GPU activity for some architectures such as NVIDIA Ada Architecture. Synchronous copy engines are used in the graphics command sequences. Async copy engines are used in both compute and graphics to copy resources (typically) in the background.
I am using NVIDIA RTX 4000 Blackwell GPUs, and I can also see both Async Copy Engine Active and Sync Copy Engine Active in Nsight Systems.
However, I am still confused about how I should interpret these metrics.
My question is:
- Should I understand this as
“the GPU has copy engines, and Nsight Systems classifies copy-engine activity into sync and async categories”?
or
- Does this mean
“there are physically distinct types of copy engines, namely sync copy engines and async copy engines”?
If there is any official documentation or architectural explanation clarifying this, I would really appreciate it.
Thank you very much for your time and help.