In nvidia-mps, we launch the mps-server by running sudo nvidia-cuda-mps-control -d
, I have two questions.
- How to specify which GPU to run mps-server when I have multiple GPUs on the same server.
- How to control the resources (such as computation and memory) allocated for each mps client when I have multiple concurrent processes?