Confusion about context management by CUDA runtime

huangjk5 · December 24, 2023, 7:17am

I have some questions about the automatic context management offered by CUDA runtime API after reading some documentation, 6.2.1. Initialization and 6.31. Interactions with the CUDA Driver API.
I have three questions.
Q1. For example, I have the below code.

int main() {
    cudaSetDevice(0);
    cudaSetDevice(1); 
    cudaSetDevice(0);
}

I suppose the CUDA Runtime would behave as follows.

Runtime sets up something and creates a primary context (context#0) for device 0.
Runtime sets up something and creates a primary context (context#1) for device 1.
Runtime sets up something and sets context#0 as the calling host thread’s current context.

Is my understanding correct?

Q2. I have two host threads. If the first host thread calls cudaSetDevice(0) explicitly and then the second one starts and calls cudaSetDevice(0) explicitly again.
For the second call, Runtime will not initialize any context but set the context that is already created as the second host thread’s current context.
Is it correct?

Q3. I learn that There exists a one to one relationship between CUDA devices in the CUDA Runtime API and CUcontext s in the CUDA Driver API within a process. So no matter how many host threads call cudeSetDevice(), if the primary context for the desired device doesn’t exist, Runtime creates it otherwise Runtime just sets the existing context as the calling host thread’s current context. Is it correct?

Any help will be appreciated.

Jack

Robert_Crovella · December 25, 2023, 1:55am

The CUDA runtime can be fully utilized without trying to understand the detail behavior of context management. Furthermore, since this level of detail is unpublished (beyond what is available at that link), it is subject to change. Therefore depending on a specific interpretation of detail behavior may be risky.

In my experience, the CUDA runtime will create one, and only one context per device, per process. The runtime may create contexts on devices that you don’t explicitly use. You can restrict this behavior with CUDA_VISIBLE_DEVICES.

If a thread needs to use a particular device, it will use the context that the runtime has created on that device.

huangjk5 · December 25, 2023, 8:03am

Thanks, Robert. I will keep these two things in my mind.

The CUDA runtime will create one, and only one context per device, per process.
If a thread needs to use a particular device, it will use the context that the runtime has created on that device.

Jack

system · January 8, 2024, 8:04am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
CUDA,Context and Threading CUDA Programming and Performance	6	19666	May 29, 2012
Problems when mix using CUDA runtime API and CUDA driver API CUDA Programming and Performance	1	3240	August 6, 2015
Reccomended way of managing contexts in the driver API CUDA Programming and Performance	2	1009	December 25, 2021
What's the expected behavior of calling cudaGetDevice when the process has no cuda context? CUDA Programming and Performance	6	86	June 11, 2025
questions memory allocation and CUDA contexts CUDA Programming and Performance	7	11319	February 4, 2008
Devices, Contexts, Host Threads What is the relationship? CUDA Programming and Performance	0	2284	July 18, 2008
Does CUDA work with seperate calls coming from different CPU threads? CUDA Programming and Performance	3	3822	September 12, 2009
is it possible that multiple device contexts of cuda devices on single host thread CUDA Programming and Performance	1	713	December 3, 2010
Working with multiple cards CUDA Programming and Performance	1	1375	August 12, 2008
Support for multi-threaded apps on cuda and multiple applications on cuda CUDA Programming and Performance	13	12786	January 24, 2011

Confusion about context management by CUDA runtime

Related topics