We use nsys commands to profile our application(very large), our commands like:
nsys launch ./application
nsys start
nsys stop
When nsys start begin, the application subtask collapsed and got coredump.
The application run well alone. Anyone know why? and how to debug.
So now two phenomena appear accidentally:
First, my application has two processes. After nsys start starts running, one of the processes will crash, while the other process can be collected by nsys normally.
Second, both processes can collect data, but the collected data is abnormal and there are a large number of blank areas, as shown in the figure.
@herrywangyi163 Could you try the last version and see if there’s any change? It can be downloaded from Nsight Systems - Get Started | NVIDIA Developer. Currently the latest is 2024.2, but 2024.3 is expected to be out perry soon (probably within this week)
Regarding the “Missing Data” on the timeline - how long was the profiling session? And did you change the GPU metrics sampling frequency? It might be related to the amount of sampled data, if it exceeded the buffer size before Nsys has a chance to process, some will be lost.
Is it possible to share the report file and the full Nsys commands you used?