How to use CUDA Green Context with MPS

raywan · December 10, 2024, 9:46am

Hello,
I’m conducting an experiment where I run identical MMUL workloads on separate SMs using the Green Context API. Without enabling MPS, the two processes do not execute in parallel as expected. However, when I enable MPS, I encounter a limitation: it seems that the ability to specify the number of SMs via Green Context is no longer available. Instead, MPS takes over, and the CUDA_MPS_ACTIVE_THREAD_PERCENTAGE environment variable appears to override the SM allocation.

Could you clarify if this behavior is expected, and is there a recommended way to maintain explicit SM allocation when using MPS alongside the Green Context API?

kperelygin · December 20, 2024, 8:49pm

Yes this is expected behavior - you have to specify the SM affinity through either MPS’s dynamic active thread percentage or through the static partitioning of green contexts. If you want to maintain the static SM allocation while still taking advantage of not having the context switch between multiple processes, you can still use MPS + Green Contexts but you need to either unset or set to 100 the CUDA_MPS_ACTIVE_THREAD_PERCENTAGE variable.

Let me know if you have any further questions!

Topic		Replies	Views
Interaction between Green Contexts, MPS, and GPU resource allocation for parallel kernel execution CUDA Programming and Performance cuda	0	150	June 3, 2025
The Usage Scenarios of Green Context CUDA Programming and Performance cuda	5	226	October 17, 2025
Questions about Resource Isolation and Execution Control using CUDA Green Contexts + MPS CUDA Programming and Performance cuda	8	278	October 14, 2025
Using Green Context in CUDA on Jetson Devices with Ampere Architecture CUDA Programming and Performance cuda , jetson	3	291	April 28, 2025
Green-context-sm-allocation-not-affecting-kernel-runtime in Jetson Orina Jetson Orin Nano cuda	8	209	May 16, 2025
Question about scheduler CUDA Programming and Performance	14	191	July 7, 2025
What is the best way to partition the SM of a GPU? CUDA Programming and Performance hw , cuda , kernel	2	1381	August 17, 2023
Question about CUDA MPS CUDA Programming and Performance	15	3226	August 22, 2022
How to Enforce Per-Client Memory and SM Limits in CUDA MPS? CUDA Programming and Performance cuda , kernel , inception	1	148	August 13, 2025
Green Context SM Allocation Not Affecting Kernel Runtime CUDA Programming and Performance cuda , jetson	9	371	May 6, 2025

How to use CUDA Green Context with MPS

Related topics