Unable to see effect of MPS

anshulpandey0606 · September 19, 2024, 7:03am

Hi all,

I’m trying to utilize MPS (Multi-Process Service) on my L4 GPU server, but I’m not experiencing any noticeable benefits. I have two processes, each with a single thread, and both threads are utilizing CUDA to perform inference on a model. When I profile my application with and without MPS, there isn’t a significant difference in the start and end times of the processes. Essentially, it appears that my processes are running in parallel or concurrently even without MPS.

Could this be due to the time-sliced scheduler of the GPU? If so, how can I verify whether time-sliced scheduling is occurring in the absence of MPS, and whether the GPU is context switching between the threads of the two processes?

Additionally, do you possess the MPS profiling statistics regarding GPU utilization during multiple concurrent inferencing of the ResNet50 model? I am currently performing inference using a ResNet-based model, and in my scenario, I have observed that the GPU utilization (checked using nvidia-smi) reaches 100% shortly after initiating two threads.

Could you kindly share any benchmark results pertaining to an L4 GPU for any ResNet-based model, if available?

Topic		Replies	Views
Difference between vGPU and CUDA MPS CUDA Programming and Performance	4	1777	November 25, 2020
Question about GPU sharing of Multi-process service CUDA Programming and Performance	9	6424	April 30, 2018
Concurrency in MPS and multi-stream GPU-Accelerated Libraries	2	1631	October 12, 2021
MULTI-PROCESS SERVICE(MPS) has no effect CUDA Programming and Performance	3	798	October 16, 2018
Multiple tensorflow sessions running on the same GPU in parallel (with/without MPS) CUDA Programming and Performance	0	677	October 7, 2020
Cocurrent execution with MPS CUDA Programming and Performance	5	530	November 11, 2020
Parallelization of kernels without MPS CUDA Programming and Performance	6	741	February 5, 2019
CUDA MPS Problem CUDA Programming and Performance cuda	7	1121	May 23, 2022
MPS (Multi-Process Service) in two GPUs CUDA Programming and Performance	0	518	February 2, 2021
Nvidia-smi isn't logging GPU usage when MPS is enabled CUDA Programming and Performance	2	1078	August 21, 2019

Unable to see effect of MPS

Related topics