Multi-Process Service Active Thread Percentage

The MPS document says changing the CUDA_MPS_ACTIVE_THREAD_PERCENTAGE variable does not affect the existing client process and/or client CUDA contexts. I am wondering why MPS is designed in this way? Why couldn’t the MPS change the active thread percentage while the process or the context is running? Is it because of the scheduling algorithm behind it?