How to monitor # of SM used on a CUDA kernel under MPS SM oversubscribed environment?

sakaia · November 1, 2024, 10:27am

I have a question about monitoring a number of SM used on a CUDA kernel on Multi Process Service (MPS) environment.
If we run one CUDA stream, maximum # of SM is static in each CUDA stream and we can guess # of SM on each CUDA Kernel.
Under the MPS oversubscribed environment, maximum # of SM is dynamic in each CUDA stream.
It is because each CUDA stream sets # of SM. And total sum of SM is following equation.

My case example is follows

  CUDA stream A(10) + B(20) + C(30) > GPU Device (40)
  For example A=10, B=20, C=30, Device=40 (unit SM  CU_EXEC_AFFINITY_TYPE_SM_COUNT )

Each CUDA stream has multiple CUDA kernels. # of SM allocation is differentiated by CUDA Kernel.

For example We asssume # of SM allocation as follows

  C-1 30
  C-2 20
  C-3 10

When we try to run the stream B in parallel, Is there any method to monitor available # of SM?
When we are in C-1 phase, can we know only 10 SM remaining?

I want to monitor # of SM allocated to running CUDA kernel. It is because processing resource is coming from # of SM x time duration.

cuCtxCreate_v3/cuCtxCreate_v4 CU_EXEC_AFFINITY_TYPE_SM_COUNT

cuCtxCreate_v4
https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__CTX.html#group__CUDA__CTX_1gd84cbb0ad9470d66dc55e0830d56ef4d

Topic		Replies	Views
Unaccurate # SMs when using MPS Nsight Compute	2	47	August 27, 2025
Can I know the number of streaming processors currently running in one GPU? CUDA Programming and Performance	0	374	July 19, 2021
How to control the SMs number during kernel execution? CUDA Programming and Performance	2	949	January 14, 2010
Cocurrent execution with MPS CUDA Programming and Performance	5	599	November 11, 2020
Is it possible to allocate the SMs to kernel or kernelet CUDA Programming and Performance	3	545	July 30, 2018
Is there a tool to monitor the real time usage of the SM or the cores inside SM CUDA Programming and Performance	1	619	October 23, 2013
What is the "Number of Streaming Multiprocessors (SM)"? CUDA Programming and Performance	3	4863	August 1, 2010
How to get SM number? CUDA Programming and Performance	1	1010	January 4, 2011
Doubt Streaming Multiprocessor CUDA Programming and Performance	0	3447	June 19, 2009
Updating SM count of an MPS context after it is created CUDA Programming and Performance cuda	5	852	February 26, 2022

How to monitor # of SM used on a CUDA kernel under MPS SM oversubscribed environment?

Related topics