T4 GPU in MPS mode, but processing serially

user117602 · June 29, 2022, 5:53pm

Hi there,
I have a nvidia Tesla T4 GPU deployed with 2vCPU/32GB memory on GCP. GPU is in DEFAULT mode (which is MPS mode).
Memory of GPU is 16 GB. Currently held up memory is 4500 MB by a gunicorn process.
I activated 2 conda environments and ran django servers by “python manage.py runserver 0.0.0.0:8000” and on port 8001.
Now, since image-processing takes almost 2 GB GPU memory each, I am expecting that GPU can process them in parallel. But, GPU is processing them serially.
Is there anything we can do about it (by choosing some option of nvidia-smi or in conda)?

Topic		Replies	Views
MPS (Multi-Process Service) in two GPUs CUDA Programming and Performance	0	543	February 2, 2021
Parallel Dispatch Tesla T4 CUDA Programming and Performance	1	622	September 14, 2020
CUDA MPS Problem CUDA Programming and Performance cuda	7	1201	May 23, 2022
Unable to see effect of MPS GPU-Accelerated Libraries	0	32	September 19, 2024
What happens if two processes running with MPS need more memory than available on GPU CUDA Programming and Performance	2	447	October 12, 2021
Using Multiple GPU Turns Out Running Serially CUDA Programming and Performance	3	584	March 6, 2017
MPS (Multi-Process Service) in two GPUs General cuda , gpu	3	1030	October 12, 2021
When using MPS, the GPU memory is enough, but CUDA shows that out of memory General Topics and Other SDKs cuda	2	893	April 19, 2023
MPS enabled process is slower than MPS disabled process. CUDA Programming and Performance	1	745	December 4, 2018
Performance Optimization and Troubleshooting in GPU-Based Machine Learning Deployments GPU - Hardware	2	38	January 23, 2025

T4 GPU in MPS mode, but processing serially

Related topics