Why is MPS not default?

myih · September 7, 2018, 6:36pm

I was experimenting with MPS and it did improve the performance when single task doesn’t saturate GPU.
I’m wondering is there a performance-level reason MPS is not default when running multiple processes?
Or is it just convention to give all the resource to one process and finish it ASAP?

Also, what’s the difference between using MPS to share hardware and using multi-cuda-stream in terms of scheduling and workload balancing?

Thanks!

Robert_Crovella · September 7, 2018, 8:21pm

There are some limitations on CUDA behavior when MPS is used. These are covered in the MPS manual.

Topic		Replies	Views
Question about CUDA MPS CUDA Programming and Performance	15	3127	August 22, 2022
Mps not work like i think in multi thread CUDA Programming and Performance	3	339	March 26, 2024
MULTI-PROCESS SERVICE(MPS) has no effect CUDA Programming and Performance	3	878	October 16, 2018
CUDA MPS Problem CUDA Programming and Performance cuda	7	1400	May 23, 2022
Fine grained Kernel scheduling with MPS CUDA Programming and Performance tensorflow , kernel , ubuntu , python , linux	10	1646	January 11, 2025
Parallelization of kernels without MPS CUDA Programming and Performance	6	856	February 5, 2019
Program stuck with MPS CUDA Programming and Performance	0	524	August 2, 2018
cuda kernels from different process can run concurrently? same performance with MPS on and off? CUDA Programming and Performance	9	2270	May 3, 2018
MPS has gotten really good, but can CUDA streams replicate the benefits? CUDA Programming and Performance	1	468	September 23, 2024
CUDA MPS Not Working as Expected in Multi-GPU Environment CUDA Setup and Installation	4	711	November 12, 2024

Why is MPS not default?

Related topics