OpenGL interop very slow!

rivierakid · April 15, 2010, 6:57am

Hi,

I’m using CUDA v3.0, because of better OpenGL interop. support, but it is very slow. I’m writing simple voxelizer using OpenGL. The result (3d linear memory) could be then used by some CUDA kernels. The problem is the transfer between OpenGL and CUDA.

The main algorithm:

For every slice (z):

rasterize slice into 2d texture
copy 2d texture to CUDA 3d linear memory

The cudaGraphicsMapResources function seems to be very slow (for texture, renderbuffer and pbo too), the speed is almost equal to copying the texture to CPU pinned memory!

BTW I tried cudaGraphicsMapResources without any CUDA and OpenGL calls:

[codebox]

while(1)

{

cudaGraphicsMapResources

cudaGraphicsUnMapResources

}

[/codebox]

No kernel, no OpenGL commands. And it is still very slow.

Any idea?

cbuchner1 · April 15, 2010, 9:11am

Is your compute card also the display card?

Did you, by chance, enable a second monitor output on the display card? In many cases it makes interop slower.

cbuchner1 · April 15, 2010, 9:11am

Is your compute card also the display card?

Did you, by chance, enable a second monitor output on the display card? In many cases it makes interop slower.

rivierakid · April 15, 2010, 10:23am

Yes, there is only one GPU.

I think I found walkaround:

create 3d texture (opengl)
attach to fbo
render to 3d texture (using z-slice)
map 3d texture to CUDA (cudaArray)
use tex3D for lookups

Seems to be pretty fast External Media

rivierakid · April 15, 2010, 10:23am

Yes, there is only one GPU.

I think I found walkaround:

create 3d texture (opengl)
attach to fbo
render to 3d texture (using z-slice)
map 3d texture to CUDA (cudaArray)
use tex3D for lookups

Seems to be pretty fast External Media

mcleary · July 27, 2011, 8:04pm

How fast is “pretty”?

I’m doing just like you, but I’m rendering to a 2D texture.

I noticed that my FPS drops significantly when I attempt to map the texture for cuda reading.

Simon_Green · July 28, 2011, 8:17am

This isn’t really addressing your problem, but another possible solution would be to just do the voxelization in CUDA too :)

Topic		Replies	Views
CUDA-OpenGL interop performance CUDA Programming and Performance	2	2424	May 30, 2014
DX11 <> CUDA interop is slow compared to GL <> CUDA CUDA Programming and Performance	3	3015	January 5, 2020
OpenGL interop performance problems CUDA Programming and Performance	2	1321	February 2, 2010
OpenGL & CUDA interop with surfaces slow... CUDA Programming and Performance	2	949	July 6, 2018
OpenGL interoperability Performance issue concern CUDA Programming and Performance	8	6672	December 3, 2008
OpenGL interop performance ... yes, STILL CUDA Programming and Performance	6	6458	March 29, 2010
CUDA / OpenGL Interoperability : Questions about speed CUDA Programming and Performance	0	855	April 18, 2013
D3D interop RELOADED isn't supposed to be better than OpenGL...? CUDA Programming and Performance	2	3701	April 16, 2009
Question on image procesing performance CUDA Programming and Performance	4	619	August 23, 2018
Cuda-OpenGL Interop. sometimes pop out very low performance CUDA Programming and Performance cuda	0	68	July 4, 2024

OpenGL interop very slow!

Related topics