How to control the resource of each client in NVIDIA-MPS

In nvidia-mps, we launch the mps-server by running sudo nvidia-cuda-mps-control -d , I have two questions.

  • How to specify which GPU to run mps-server when I have multiple GPUs on the same server.
  • How to control the resources (such as computation and memory) allocated for each mps client when I have multiple concurrent processes?

cuda

Hi @Daniel_Wong
Your question might be better suited for CUDA Programming and Performance - NVIDIA Developer Forums forum branch.

I have moved your post there.