MPS: Best practise for SM partitioning

manospavlidakis · January 11, 2022, 8:24am

In MPS document 2021 there is a new section 5.2. BEST PRACTICE FOR SM PARITITIONING. In that section, it is mentioned that “Creating a context is a costly operation in terms of time, memory, and the hardware resources”. However, with MPS only one CUDA Context is created in the GPU, to allow kernels to execute in parallel. Consequently, I could not understand why one should care about contexts? The only explanation could be that you refer to Client CUDA Contexts. Can you please explain?

Additionally, in the example that you create a pool of contexts why do you use cudaLaunchCooperativeKernel? According to the documentation cudaLaunchCooperativeKernel " Launches a device function where thread blocks can cooperate and synchronize as they execute". As a result, simple cudaLaunchKernel will have the same behavior, since we do not have kernels that cooperate. Is that correct?

Thank you in advance. Manos

Topic		Replies	Views
What is the best way to partition the SM of a GPU? CUDA Programming and Performance hw , cuda , kernel	2	967	August 17, 2023
Why kernels from different cuda contexts could not run concurrently CUDA Programming and Performance	0	303	March 4, 2021
Question about GPU sharing of Multi-process service CUDA Programming and Performance	9	6412	April 30, 2018
GPU sharing among different application with different CUDA context CUDA Programming and Performance	23	18139	December 17, 2020
Question about CUDA MPS CUDA Programming and Performance	15	2657	August 22, 2022
concurrent execution of cuda kernels from different contexts CUDA Programming and Performance	1	618	April 18, 2019
Single process multi context CUDA Programming and Performance	1	500	November 10, 2023
Concurrency in MPS and multi-stream GPU-Accelerated Libraries	2	1628	October 12, 2021
Concurrent execution of kernels from different contexts CUDA Programming and Performance	1	5995	July 9, 2010
MPS: can pre-volta devices have multiple kernels execute on the same SM? CUDA Programming and Performance	0	426	May 24, 2019

MPS: Best practise for SM partitioning

Related topics