OpenGL Interoperability Copying from OpenGL To Cuda

quirin · September 3, 2007, 12:33pm

I figured that parts of my code perform way better on classic-GPGPU using OpenGL (mainly it’s because I am making use of the blend stage).

So I used the regular OpenGL render to texture mechanism using FBOs to perform the OpenGL part of the computation. After that I use Buffer Objects to map these OpenGL textures into Cuda’s global memory (i.e. glBindBuffer, glReadPixels). However, this seems to break down performance a lot and I anticipate that it is due to the glReadPixels call.

So I was wondering, if there is a way of avoiding the read pixels thing and using OpenGL textures within a Cuda context right away?

Simon_Green · September 3, 2007, 2:07pm

Not currently. You can’t read directly from OpenGL textures in CUDA.

See the FAQ q8:
[url=“http://forums.nvidia.com/index.php?showtopic=36286”]http://forums.nvidia.com/index.php?showtopic=36286[/url]

quirin · September 3, 2007, 6:01pm

Thanks for the quick response.

Do you have any idea if it will be possible soon? (“Not currently” makes me think positively)

Or is there any possibility to leverage the blending stage directly from CUDA?

My problem is that I have a super large array from where I read and another to where I write. However reads and writes are completely random and writes might occur several times for one and the same memory location, thus race conditions can occur if you don’t create a proper schedule ahead of time (on the CPU). The blendstage however is able to deal with these conditions for free (well if it wasn’t for the memory copy futher down my pipe…)

Simon_Green · September 5, 2007, 12:45pm

Not anytime soon. It’s not in CUDA 1.1, certainly.

Note that in CUDA you could implement the equivalent of blending by doing a read/modify/write to global memory (as long as each thread is only writing to its own location).

asadafag · September 6, 2007, 6:29am

The thing is… blending is atomic. However, on 8800, CUDA doesn’t support atomic ops yet. For things like dynamic sparse matrix ~ static dense vector mul, it’s really really handy.
Also, it’s sometimes easier to do list operations using geometry shaders, especially for one-to-many maps.
I’m longing for the day GL and CUDA can be used interleavedly without memcpy…

quirin · September 7, 2007, 8:06am

I also have some list operations which I will probably implement using GS, as well, and see how that performs…

Simon_Green · September 7, 2007, 8:36am

In our experience list operations are best implemented in data-parallel fashion using CUDA (using scan etc.), rather than using the GS.

asadafag · September 9, 2007, 3:04am

That’s good to hear. Can I cite this later in my paper?

Topic		Replies	Views
OpenGl textures into cuda linear memory how to use OpenGl textures with cuda CUDA Programming and Performance	1	2340	April 5, 2007
rendering from textures CUDA Programming and Performance	3	1375	August 20, 2009
Using OpenGL fragment output in Cuda CUDA Programming and Performance	2	8727	February 21, 2007
OPENGL Texture Object And CUDA kernel CUDA Programming and Performance	0	1329	August 12, 2008
CUDA / OpenGL / GLSL Pixel Shader running within the same application. CUDA Programming and Performance	3	10083	July 26, 2010
direct access to OpenGL textures from CUDA? CUDA Programming and Performance	2	1232	March 12, 2010
CUDA vs. OpenGL textures read-only vs. read-write CUDA Programming and Performance	5	11006	March 30, 2007
Mixing shaders and CUDA CUDA Programming and Performance	3	3371	January 11, 2009
The best way to copy OpenGL texture to CUDA CUDA Programming and Performance	6	17719	January 13, 2008
OpenGL in 3.0 CUDA Programming and Performance	3	5236	March 26, 2010

OpenGL Interoperability Copying from OpenGL To Cuda

Related topics