/dvs/p4/build/sw/devtools/Agora/Rel/CUDA12.4/QuadD/Common/AgentAPI/Src/SessionImpl.cpp(18): rpc Start(.Agent.StartRequest) returns (.Agent.EmptyMessage);
is canceled because the timeout period is expired
Hi @cuda_new_bird, which Nsys command were you using before seeing this error? I noticed you were using Nsys 2023.4.4 from CUDA Toolkit 12.4 release, this Nsys version has been pretty outdated, could you try the latest release 2024.7 from Nsight Systems - Get Started | NVIDIA Developer?
Both DCGM and Nsight Systems use the NVIDIA CUPTI tool under the covers to get information about CUDA kernels. Unfortunately, CUPTI does not support multi-subscribers, so only one of them can successfully attach and get data.
Long term we are working on a way to get around this.