Hi,
I know for Volta GPUs setting CUDA_MPS_ACTIVE_THREAD_PERCENTAGE in a MPS client’s environment will limit the client’s thread usage. Is there a similar way to limit the GPU memory usage for a client process?
thanks,
Xiaoning
Hi,
I know for Volta GPUs setting CUDA_MPS_ACTIVE_THREAD_PERCENTAGE in a MPS client’s environment will limit the client’s thread usage. Is there a similar way to limit the GPU memory usage for a client process?
thanks,
Xiaoning
Anybody knows? Thanks.