OpenGL interop performance

cybernoid · December 5, 2024, 12:43pm

There seems to be a lot of unclear information regards how to efficiently do cuda/opengl interop for textures/surfaces. There are unanswered questions on this forum and unclear or wrong answers elsewhere on this issue.

Right now I have something like:

cudaArray* arrayPtr;

CHECK_CUDA(cudaGraphicsMapResources(1, &cudaRegisteredTexture, 0));
CHECK_CUDA(cudaGraphicsSubResourceGetMappedArray(&arrayPtr, cudaRegisteredTexture, 0, 0));

cudaSurfaceObject_t surface;
cudaResourceDesc surfaceDetails{};

surfaceDetails.resType = cudaResourceType::cudaResourceTypeArray;
surfaceDetails.res.array.array = arrayPtr;

CHECK_CUDA(cudaCreateSurfaceObject(&surface, &surfaceDetails));

RenderOptixRaytracingAndCopyToSurface(surface);

CHECK_CUDA(cudaDestroySurfaceObject(surface));
CHECK_CUDA(cudaGraphicsUnmapResources(1, &cudaRegisteredTexture, 0));

This has some performance hiccups when run every frame - more than you would expect. From the documentation cudaGraphicsSubResourceGetMappedArray() might not always return the same value, and even then its not clear if it would be safe to cache the surface object if so (and recreate the surface object if it changes).

It might make sense to have multiple mappable textures that are cycled between if there is such observed stalls, but nowhere mentions this.

I’d respectfully suggest the cuda documents need a bit more information regarding the best practices here.

cybernoid · December 9, 2024, 4:29pm

After much digging I came across this old document:

I think some of that info should be summarised in the main Cuda documents.

rs277 · December 11, 2024, 3:26am

You’ve perhaps already seen this, it’s a little more recent:

Topic		Replies	Views
CUDA-OpenGL interop performance CUDA Programming and Performance	2	2445	May 30, 2014
Opengl Texture and cuda CUDA Programming and Performance	1	521	June 1, 2014
OpenGL & CUDA interop with surfaces slow... CUDA Programming and Performance	2	956	July 6, 2018
DX11 <> CUDA interop is slow compared to GL <> CUDA CUDA Programming and Performance	3	3029	January 5, 2020
OpenGL interop very slow! CUDA Programming and Performance	6	6733	July 28, 2011
Question on image procesing performance CUDA Programming and Performance	4	620	August 23, 2018
Inefficient CUDA and OpenGL Interop CUDA Programming and Performance	4	2272	December 5, 2012
Reading OpenGL texture data from CUDA CUDA Programming and Performance	10	3169	January 17, 2020
CUDA Multi-GPU with OpenGL interop CUDA Programming and Performance	8	13010	December 13, 2010
D3D interop RELOADED isn't supposed to be better than OpenGL...? CUDA Programming and Performance	2	3702	April 16, 2009

OpenGL interop performance

Related topics