Hardware: Nvidia Jetson Xavier NX
Command (as root): nsys profile --trace-fork-before-exec=true --process-scope=process-tree python3 test.py
Nsight systems version: NVIDIA Nsight Systems version 2021.2.3.8-78c8c79
My goal: See the GPU usage of a python script including all the child processes forked within it.
Hi, I get this unrecognised option '--trace-fork-before-exec=true' message when I try to profile the GPU usage of a python script that forks some child processes that also uses the GPU. I’m confused because this option is listed in the user guide of nsight systems.
In my understanding this option is needed to profile my script since python doesn’t need to exec after it forks, however other methods around this is also appreciated.
Sorry for the confusion. That option is not available for the Tegra target. We are in the process of changing --help to be a little more user-friendly about what options are available on a platform.
And no, you do not need this for your python script. We use this for C++ forks, where trace behavior is undefined if you try to trace between fork and exec call. Because Nsys might crash in that situation, we require the explicit request.
The above is what I’m getting when running my script,
pid 23021 forks two children 23053 and 23054 to do 5 TensorRT inferences each.
I’m not getting any GPU information, also when I compare other people’s profiling output:
Hi @user107170, what was the operating system that you were using? Was it QNX?
If 23053 and 23054 were forked but not exec-ed, we won’t trace it unless --trace-fork-before-exec=true is specified. However, this feature is not supported on QNX currently. It’s only supported on Linux platforms (x64, PPC, L4T, SBSA). That’s why you do not see CUDA APIs for these processes. The reason you don’t see “CUDA (GM20B)” row is due to the same reason - CUDA is not traced after fork but before exec.
The uname -a output is Linux username-xavier 4.9.253-tegra #1 SMP PREEMPT Mon Jul 26 12:19:28 PDT 2021 aarch64 aarch64 aarch64 GNU/Linux
The OS was installed by downloading jetson-nx-jp46-sd-card-image.zip from nvidia’s website on December 4th, 2021, and flashing the SD card. However nsight cli was not in it so I downloaded nsight-systems-cli-2021.2.3_2021.2.3.8-1_arm64.deblast week using nvidia sdkmanager, copied it to the Xavier NX, and installed it.
Hi @user107170, it seems this switch will only be available for L4T platform for version >= 2021.5. Unfortunately the current public release version for L4T (bubdled with Jet Pack 4.6) is 2021.2, so you’ll have to wait for newer version to be available.
Are you an NDA customer with NVIDIA? If so I can share you an internal build for latest version.
@user107170 My personal expectation is that there will be a new Jet Pack release in about a month which contains Nsight Systems 2021.5, but since Jet Pack release schedule is decided by a different team, I’m not sure of it.
You may be able to get more information about when will there be a Jet Pack update from Jetson forum.