CUDA Driver API: OpenGL Contexts

Gregory_Diamos · April 30, 2010, 4:03pm

So I am trying to implement a new version of the CUDA Runtime API in Ocelot using the CUDA Driver API 3.0 and am having problems with opengl contexts and cuGLCtxCreate. My first idea was to try to create an opengl context for every application, and then fall back on cuCtxCreate if cuGLCtxCreate failed. However, cuGLCtxCreate segfaults (rather than returns an error) if it is called before glInit() or glutInit() in the host application.

My first question is whether or not this is how cuGLCtxCreate is supposed to work? It seems kind of fishy for an api call to segfault like this.

To get around this, I tried lazily allocating an opengl context on the first open-gl related cuda call. This works for some simple applications (all of the cuda sdk except for volumerender), but it fails in cases where some resources were allocated on a regular context, an opengl buffer was allocated on another context, and a kernel accesses both. This is because both contexts cannot be active at the same time, and resources cannot be shared between contexts.

My only recourse at this point I think is to find some way of having multiple contexts active at the same time (probably not possible), or manually migrating state from one context to the new one when an opengl call is made (difficult),

Any suggestions?

indy2718 · April 30, 2010, 5:19pm

My experience with X11, threading, gl and cuda contexts:

Same thread:
Create X11 Display connection. glX and gl functions use this X11 connection.
create glcontext
make glcontext current
cuGLCtxCreate

It is correct to seg fault (although it could have better error handling) because there is no current gl rendering context that cuGLCtxCreate requires. That context was created by glutInit.
To get around this, one would check for a current gl rendering context first. If it is are valid, then call cuGLCtxCreate.

For me I serialize all gl and cuda calls in the same application. Multiple threads seem to be supported, but the underlying Display must be lock guarded. It may work with multiple threads if there are multiple gpus and Display connections, each having their own XLockDisplay and current context. But you cannot put those contexts in the context shared list because that is a X11/XGL call and each X11 Display is somewhat invisible to each other.

edit: the function call for getting the current context is glXGetCurrentContext. Also glxgetprocaddress may be useful.

Gregory_Diamos · April 30, 2010, 6:47pm

Much appreciated. glXGetCurrentContext seems like a reliable way to determine if cuGLCtxCreate will succeed.

Topic		Replies	Views
Order of cuInit, cuGLInit, cuGLCtxCreate CUDA_ERROR_INVALID_CONTEXT CUDA Programming and Performance	2	9259	April 13, 2009
CUDA / OpenGL interop (2 OpenGL context) CUDA Programming and Performance cuda , opengl	8	2136	March 17, 2023
cudaGraphicsMapResources() and cuCtxCreate() incompatible? CUDA Programming and Performance	9	2092	April 7, 2018
Using same device pointer for two CPU threads CUDA Programming and Performance	4	2960	September 27, 2010
GL interop in multithreaded host app CUDA Programming and Performance	5	12215	December 19, 2011
Segfault when transferring OpenGL context to another thread Jetson TX1	1	1220	December 21, 2015
CUDA GL Interoperability - existing opengl context CUDA Programming and Performance	1	2903	June 25, 2008
cudaGL + multithreading? CUDA Programming and Performance	1	4392	April 29, 2010
cudaGLSetGLDevice bug? CUDA 4.0 CUDA Programming and Performance	4	2906	December 5, 2011
Seperate Cuda thread and Opengl context thread CUDA Programming and Performance	2	10973	April 8, 2009

CUDA Driver API: OpenGL Contexts

Related topics