Running Nsight profiling on code within Docker container

natanel · November 13, 2022, 4:45pm

Hello, I am trying to run the Nsight Systems profiler on some code I have inside a Docker container. I am not sure what’s the correct way to do that.

I tried to run the profiler on the nano inside the container but the nano just freezes, is the nano supposed to be able to run it? or is it a problem in my code?
I also saw someone suggesting to run the profiler on a remote host and run the code on the nano remotely, but how do I do that if the code is inside a container?

If anyone can help me I would really appreciate it

hwilper · November 14, 2022, 11:06pm

There are a couple of resources I am going to point you at. There is a blog post that I wrote on using Nsys in containers and the cloud - Using NVIDIA Nsight Systems in Containers and the Cloud | NVIDIA Technical Blog

There is also documentation on running in docker at User Guide :: Nsight Systems Documentation (that’s a direct link to the section on running in Docker, this interface just munges the name poorly).

natanel · November 15, 2022, 10:17am

Hey, thanks a lot for the links. I tried reading through them, however, I didn’t find a way to connect Nsight from a remote desktop to a container that is running on the Nano (that might be because I didn’t fully understand everything, I’m pretty new to using Docker and Nsight). Is that possible? I’m asking this because when I run Nsight on the Nano inside the container the Nano freezes, or takes a few hours to finish.

Thanks again for the help

hwilper · November 15, 2022, 8:41pm

I’m not too familiar with the combination of Docker and Nano, let me loop in a tegra expert.

@Andrey_Trachenko to answer or assign.

nouagued · July 15, 2024, 1:09pm

Hello,
I need some help with running my project using Docker. I’m trying to use the nsys profiler, but I’m having trouble combining the two commands into one. Here’s what I’ve tried so far : nsys profile
-w true
-t cuda,nvtx,osrt,cudnn,cublas
-s cpu
–capture-range=cudaProfilerApi
–capture-range-end=stop-shutdown
–cudabacktrace=true
-x true
-o my_profile
docker run --shm-size 2g --gpus all --rm …

mhallock · July 16, 2024, 8:24pm

Greetings,

When you do docker run ..., you are using the docker CLI to request that the Docker engine start a container on your behalf. Your command, as you currently have it structured, would be profiling the docker call itself, not the process in the container.

You need to have nsys available inside the container, and then run your container like:

docker run <docker options> \
    <image>:<tag> \
    nsys profile <nsys optons> \
    <your application and options>

This also means that the profiling report will be generated inside the container as well, so you’ll want to map a volume into docker for the output, and then use the nsys '-o` option to make sure you are writing to that mapped volume.

The nsys documentation on profiling containers has more information regarding collection of CPU sampling inside a container that you should look over as well.

Additional side note: you’ve replied to an older thread that is relating to embedded systems profiling, are you also trying this on a tegra?

nouagued · July 17, 2024, 1:59pm

Hello mhallock,

Thank you for your response. I tried this earlier, but it didn’t work. I can’t even launch the container with -it because it launches the simulation and finishes immediately. The problem is that my Docker command is personalized to launch simulations, so I have to enter the data repository name “simulation_test” and the driver I used “local,” along with the Docker image name “compute.” I have tried several times to run NVTX inside the container, but it doesn’t work. That’s why I want to know if there’s a solution to combine these commands. I’ll give you the complete commands I use so that it might be clearer for you:

I use docker build to build my Docker image.
Then, I run docker run --shm-size 2g --gpus all --rm -v $(pwd)/data/simulation_test:/app/data/simulation_test my_compute simulation_test local.
Additionally, I have the NVTX command I wrote before with all the options I want.

To respond to your question, I’m not using Tegra. I just want to profile my PyTorch model with NSYS, but the Docker specificity is blocking me.

mhallock · July 24, 2024, 4:12pm

With respect to your comment about -it, You could try supplying --entrypoint=/bin/bash to your docker arguments to override an ENTRYPOINT directive in your Dockerfile, and do not supply the program arguments simulation_test local. That should allow you to be able to get into a shell in the container, and you can then attempt to run nsys manually.

Again, you’ll need to have nsys available within the container’s filesystem.

nouagued · August 9, 2024, 2:04pm

It works perfectly, thank you so much!

Topic		Replies	Views
Installing Nsight Systems in Docker Container Profiling Linux Targets docker	4	1272	July 25, 2024
Nsight Systems Profiling in L4T Container? Jetson Nano nsight	3	1613	October 15, 2021
Nsys profile in Deepstream container Profiling Linux Targets nsight , deepstream	9	1579	September 10, 2022
Remote profiling within a container docker/k3s Profiling Embedded Targets nsight , docker , containers	9	2305	January 19, 2022
Nsight Systems Remote connection to Docker Profiling Embedded Targets	6	1894	February 24, 2024
NSight not running in container inside azure VM Profiling Linux Targets nsight , docker , vscode	3	797	June 20, 2023
Nsight-system failed to start profiling Profiling x86 Windows Targets	9	2422	October 12, 2021
Can we run ros2 through nsight system as "attach or launch"? Profiling Linux Targets	7	1660	March 30, 2023
Can't use nsight compute in nvidia-docker container Nsight Compute cuda , deep-learning-profiler	5	1534	September 28, 2022
[Optimize] nsight systems remote profile python in container on Orin DRIVE AGX Orin General drive-devtools	3	539	July 26, 2023

Running Nsight profiling on code within Docker container

Related topics