Nsight-system failed to start profiling

364083042 · August 9, 2021, 1:37am

Hi! I am new to Nsight-System and I tried to run some sample commands to understand how it works.
The first thing I did was running on Windows and using SSH to connect the linux server remotely. This part was successful and connected to the server.
Then I tried several commands like: ls, cd ./xxx, top, ll, etc.
But only ls command works and all the rest gave similar error log like below

.
I tried to restart and redo the previous steps but nothing has changed. Could you give me any suggestion?

The local device setting is:
Windows 10
Nsight-system 2021.2.1

The server setting:
Tesla V100
Linux-18.04
CUDA-11.0
Driver Version-450.51.06

hwilper · August 9, 2021, 7:27pm

Hi, let’s make sure I understand what you were doing.

You launched the GUI on Windows, and from the GUI told it you wanted to run a profile on your linux box? And then you told it the application you wanted to launch was a standard Linux tool like top?

(this is a windows screenshot) Like this?

The error that you are getting is that the application you are running actually ended before the tool could find any profiling data

364083042 · August 10, 2021, 1:42am

Hi! thanks for your reply!
Most of the part was correct , My GUI on Windows was like below,
(Windows screenshot)

I wonder if it is the case that “the application running actually ended before the tool could find any profiling data”, would there ba any other commands that I can try to make the tool find any profiling data? Or should I actually run a training program or maybe a small demo can do?

hwilper · August 10, 2021, 6:44pm

Do you have root or paranoid set on the system as needed? See Installation Guide :: Nsight Systems Documentation

You could also grab the samples from the CUDA toolkit and try them, turning on CUDA trace.

364083042 · August 11, 2021, 6:52am

Thank you very much for your reply! Yes that is the problem!
After I changed the paranoid value from 1 to 2. All other commands have worked!
However, I want to dig more on this tool, because all my codes were actually running in docker. So, I wonder if it is possible that I can track the program that is running on docker? If possible, then that would be truely helpful!

hwilper · August 11, 2021, 3:30pm

You can run Nsys in a docker, for more information see User Guide :: Nsight Systems Documentation or my blog post at https://developer.nvidia.com/blog/nvidia-nsight-systems-containers-cloud/

364083042 · August 13, 2021, 10:41am

Hi sorry about the late reply, I did try to follow the Documentation to test it in my container, and it did worked!! Thank you very much for your help!!
Then I got the profile document back like below:

The entire program was itering a resnet50 model doing classification on an image 1000 times.
I think it works fine, but the diagram seems not quite readable. The first time I saw this software was from a TensorRT presentation given by Tencent. During the presentation they showed the entire running prcess like below:

which has blocks with labels on and telling what does each part did. I wonder if it is a hidden function that I didn’t use it correctly or it is achived by using external supports?
I would be appreciate if you can help.
And BTW, thanks for all the replies you gave!

hwilper · August 13, 2021, 2:45pm

Okay, a few things.

You’ll need to zoom in, a lot. I recommend CTRL and the scroll wheel on your mouse as the most effective way to zoom in. When you scroll in, as there is enough space on-screen, the underlying kernels will become big enough to display text. You can also mouse over kernels to get information, but I find it clearer to just scroll a long way in. You are showing 8 seconds of time in your screen shot…there isn’t a time indicator on the one from the presentation, but I bet it is less.

You’ll need to open the process and find the GPU associated with the activity. Then open that GPU. The screen shot you are showing on the bottom is looking inside a particular CPU to see where the items that were annotated with NVTX are showing up.

(and of course, you will want to have NVTX trace turned on, which I assume you do, as it is usually on by default, but I can’t actually see from your screenshot if it is).

This run did not have NVTX, but you can see that I drilled down to ~16 ms of time, that information about the GPU activity (memory transfers) is starting to show up, and that mouse-overing reveals more information:

Now the same thing, but I zoomed in again and you can see the actual working kernels and the mouseover text for same.

364083042 · August 13, 2021, 2:56pm

Hi, that is exactly what I am looking for!! Thank you so much!
I will check it back on Monday because it was running on the server of the company.
Really appreciate for the detailed explanation and screenshot. And now I think I am ready to use it on my project!

system · October 12, 2021, 2:57pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Failed to start profiling ... exited before profiling started Profiling x86 Windows Targets cuda , visual-studio	22	3212	April 7, 2022
Nsight system failed to start daemon while profiling a remote Linux server Profiling Linux Targets	7	1579	November 9, 2022
How to profile remotely (QNX)? Nsight Compute	11	1803	May 27, 2021
Nsight stopped profiling too soon Profiling x86 Windows Targets visual-studio	7	1456	July 28, 2021
Nsight Systems never finishes profiling Profiling x86 Windows Targets	2	1547	November 18, 2019
Nsight Systems fails to start profiling on Win 10 x64 with runtime target side error Profiling x86 Windows Targets	10	2837	May 16, 2023
Nsight system HPC Linux installation nvc, nvc++ and nvfortran	7	1720	August 31, 2021
Nsight Compute Fails To Profile Kernels on WSL Windows11 Nsight Compute	4	676	April 15, 2024
[SOLVED] Nsight compute unable to connect 2070 super Nsight Compute	3	2881	September 3, 2019
Nsight-Compute returns “No kernels were profiled” warning Nsight Compute	9	1338	July 27, 2023

Nsight-system failed to start profiling

Related topics