General question on MPS set_active_thread_percentage

shanggdlk · December 14, 2020, 7:06pm

Hi, there,

I read the MPS document and want to leverage set_active_thread_percentage to limits the GPU resource to each running model. My setup is like this: I run two inference models at the same time on the MPS server and want to see their concurrency performance. I did export set_active_thread_percentage=50 on the command line and start the MPS service. Then I run python model1.py & python model2.py on the command line. I’d like to know whether set_active_thread_percentage is applied to each inference model (meaning that each model inference takes 50% of SMs, for example) or the two models as a whole (meaning these two models together takes up 50% SMs of the GPU). If it is the latter case, can you tell me how to apply 50% to each model?

Thanks!

Topic		Replies	Views
Set_default_active_thread_percentage mps server limits memory too CUDA Programming and Performance	1	444	February 15, 2023
MPS set_default_active_thread_percentage not working as expected CUDA Programming and Performance	3	2082	November 23, 2021
Multi-Process Service Active Thread Percentage CUDA Programming and Performance	0	461	May 5, 2022
Multi-Process Service setting CUDA_MPS_ACTIVE_THREAD_PERCENTAGE variable while application is running DGX User Forum	1	639	May 8, 2025
MPS: Limiting threads to different thresholds for multi-GPU processes CUDA Programming and Performance tensorflow , kernel , ubuntu , python , linux	1	723	October 27, 2021
MPS with multiGPUs Triton Inference Server (archived)	0	981	May 9, 2020
Can I dynamically change CUDA_MPS_ACTIVE_THREAD_PERCENTAGE to a running MPS process? CUDA Programming and Performance	2	513	May 8, 2025
Improving MPS performance using Volta MPS Execution Resource Provisioning CUDA Programming and Performance	5	1362	July 4, 2019
Can CUDA MPS limit the GPU memory usage of a client process? CUDA Programming and Performance	1	719	May 7, 2020
MPS thread limit and 100% GPU usage CUDA Programming and Performance	6	92	June 10, 2025

General question on MPS set_active_thread_percentage

Related topics