cuDevicePrimaryCtxRetain vs cuCtxCreate

excubiteur · August 17, 2019, 3:08am

I see this in official Nvidia documentation:

From CUDA Runtime API :: CUDA Toolkit Documentation

Note that the use of multiple CUcontext s per device within a single process will substantially degrade performance and is strongly discouraged. Instead, it is highly recommended that the implicit one-to-one device-to-context mapping for the process provided by the CUDA Runtime API be used.

From CUDA Driver API :: CUDA Toolkit Documentation

Note:
In most cases it is recommended to use cuDevicePrimaryCtxRetain.

Under what circumstances would cuCtxCreate have to be used instead of cuDevicePrimaryCtxRetain?

Robert_Crovella · August 17, 2019, 4:18am

If you wanted to create multiple contexts per process (per device).

excubiteur · August 17, 2019, 4:40am

To be clear, within a process, I can still use cuDevicePrimaryCtxRetain to create a context each per device.

In a single process, under what circumstances would more than one context per device be needed?

Robert_Crovella · August 17, 2019, 5:38am

I’m not sure I can give an exhaustive answer. One of the aspects of separate contexts is isolation. The address space of one context is isolated from the address space of another context. There might be some situations where that is desirable.

Another aspect of multiple contexts might be called resilience. If I have 2 contexts, and one of them becomes corrupted, the other can still function normally, without requiring a device reset or any other behavior that you would need with the CUDA runtime API to restore behavior.

It might also be useful to have a separate context for a dynamically linked library. In fact, a library might create its own context.

I’m sure I can’t imagine all the cases where multiple contexts might be useful.

excubiteur · August 17, 2019, 5:53am

To help me better understand the level of isolation involved,
compare the following two scenarios (single process)

Two devices and a context each
Single device, two contexts.

What are isolated in 1) that are not isolated in 2)

I guess what I am asking is whether this isolation is also
happening in the device itself rather than just at the host.

Topic		Replies	Views
Video Codec SDK 9.0.20 Samples using cuCtxCreate GPU-Accelerated Libraries	4	617	August 17, 2019
When is it necessary to create Multiple CUcontext s per device within a single process? GPU-Accelerated Libraries	0	509	August 17, 2019
Reccomended way of managing contexts in the driver API CUDA Programming and Performance	2	1003	December 25, 2021
Multiple CUDA contexts per device in a single process CUDA Programming and Performance	2	4978	April 22, 2016
Want examples of using multiple Contexts. CUDA Programming and Performance	1	493	August 18, 2019
CUDA,Context and Threading CUDA Programming and Performance	6	19642	May 29, 2012
Host memory use of retained primary context CUDA Programming and Performance	2	667	June 12, 2021
Confusion about context management by CUDA runtime CUDA Programming and Performance	3	701	December 25, 2023
CUDA contexts CUDA Programming and Performance	2	4928	July 17, 2007
Do we need to create new CUDA contex to use with CUPTI or can the default context be used? CUPTI – CUDA Profiler Tools Interface	2	624	June 19, 2020

cuDevicePrimaryCtxRetain vs cuCtxCreate

Related topics