Relation between MPS, Nsight Systems, CUDA Drivers and Singularity Containers

Ziqi · November 11, 2021, 1:22am

I am trying to profile a multi-process application within a container using MPS. The relation between MPS and a multi-process is clear: MPS organizes GPU contexts from multiple processes into a uniform one to save extra resources for context switches. I had no problem profiling the application without using containers. However, I am confused when singularity containers are involved.

First of all, “–nv” option is needed to provide host GPU resources to containers in singularity. My understanding is that only the CUDA driver is shared with containers through the “–nv” option, while CUDA libraries are installed within a container image. I am wondering if this understanding is accurate.

Secondly, I am wondering how MPS interacts with applications within containers. Should the MPS daemon be started from within a container, or is it shared from host to containers through “–nv”? Also, I observed that root privilege is needed to write to “/var/log/nvidia-mps”. In a container, the log cannot be written to “/var/log/nvidia-mps” as I started the MPS. What is the correct way to use MPS with containers?

Thirdly, how can I profile MPS-based multi-process applications from containers? In my application, there is a server that handles jobs (including kernel launch), which are forwarded from multiple clients, and it is the server’s performance that I care about. Specifically, what will the input command be like for the Nsight Systems? Should the Nsight Systems be started from within a container using command line, or from the outside using GUI?

I would really appreciate it if someone can provide me general advice given my tasks. Thanks in advance.

Topic		Replies	Views
Nsight Systems Missed Information in a Singularity Container Profiling Linux Targets cuda , nsight	2	832	January 4, 2022
Support for MPS Profiling Linux Targets	7	1035	December 20, 2021
MPS is not working CUDA Programming and Performance	7	3133	July 13, 2022
What is the best way to partition the SM of a GPU? CUDA Programming and Performance hw , cuda , kernel	2	1095	August 17, 2023
MPS client failed to reserve virtual memory range at address (nil) CUDA Programming and Performance	2	884	January 11, 2020
Question about CUDA MPS CUDA Programming and Performance	15	2827	August 22, 2022
CUDA MPS Server blocks applications on run on a different GPUs on a multi GPU environment CUDA Programming and Performance	4	1172	November 22, 2017
Can Hyper-Q/MPS be used by processes from multiple virtual machines? CUDA Programming and Performance	17	2325	April 11, 2022
GPU sharing among different application with different CUDA context CUDA Programming and Performance	23	18304	December 17, 2020
Question about GPU sharing of Multi-process service CUDA Programming and Performance	9	6542	April 30, 2018

Relation between MPS, Nsight Systems, CUDA Drivers and Singularity Containers

Related topics