Hi, I am CUDA developer working on improving performance of CUDA program using MPS.
I am using Volta MPS Execution Resource Provisioning method to improve performance of CUDA MPS - reference : https://docs.nvidia.com/deploy/mps/index.html#topic_3_3_5_2
I am working on this environment :
- Tesla V100-SXM2
- VGRAM : 16GB
- CUDA version : 10.0
- Driver Version : 410.104
I tried many ways for this method, however it didn’t work.
First, I turned on MPS server using this code :
following must be performed with root privilege
nvidia-smi -i 0 -c EXCLUSIVE_PROCESS
taskset -c 0 nvidia-cuda-mps-control -d
And then I runned 10 clients simultaneously.
For Volta MPS Execution Resource Provisioning, I set CUDA_MPS_ACTIVE_THREAD_PERCENTAGE to 10.
However I couldn’t observe improvement. The result was same although I set value to 20 using other formula.
Even I set value to 0, the output was almost same. I think the program couldn’t be run if I set that value to 0.
How can this happen?
Is there any mistake I made? Or should I change process for this method?
Tae Young Yeon.