Hello,
I am using ncu in multi-process service(MPS). The ncu command line options used are:
ncu --metrics gpc__cycles_elapsed.max application
I am getting nan for my run as follows.
The details of the GPU are as follows.
GPU card: NVIDIA A40
Driver version: 515.65.01
CUDA Version: 11.7
To start the MPS, I have used the following commands:
export CUDA_VISIBLE_DEVICES=“0”
nvidia-smi -i 0 -c EXCLUSIVE_PROCESS
nvidia-cuda-mps-control -d
However, when I run the application without MPS, it gives the correct profiling results.
Any help would be appreciated.