argmax with cuda using openGl texture

benoit · April 13, 2007, 11:12pm

Hi,

I am doing a 3D convolution using openGL, and get my result back in 32

textures (each texture has 4 components in z (r,g,b,a))

the 3rd dimension is hence 32*4=128, then I would like to take

this volume and do an argmax over the 3rd dim in cuda

I would like to compare them all in cuda and compute a 2D matrix

that would be the max value over all the textures within the z

components.

what is the most efficient way to do that, given that I have a limited number of bufferObject in opengl (8 or 16 depending on the driver) ?

My idea was the following :

copy all the textures into cudaArrays
bind them using cudatextures.
fetch the textures inside the kernel

But the problem is you can’t have a dynamic number of textures.

I need to hardcode each texture reference as global like this :

texture<float4, 2, cudaReadModeElementType> tex;

so I’m stuck if I have 64 textures instead of 32…

don’t know if that’s clear.

Ben

benoit · April 16, 2007, 9:42pm

anybody has an idea on how create a dynamic array of textures ?

Simon_Green · April 17, 2007, 9:41am

Couldn’t you just declare a large array of texture references and then only use the ones you need? Presumably there is some maximum depth to your volume?

The maximum number of texture samplers is 16 on DirectX 10 class hardware, CUDA doesn’t get around this. You can dynamically re-bind texture references (samplers) to arrays (i.e. texture images) using cudaBindTexture().

There are several other ways you could do this:

One method would be to create a single OpenGL buffer object, and then read all the textures (the whole volume) into it using glGetTexImage. Then you could map this buffer object in CUDA and calculate the maximums for each of the 128 values in parallel, doing the correct addressing in the CUDA kernel.

Another method would be to not use CUDA at all, and just render all of the slices to the framebuffer using a max blend function.

benoit · April 18, 2007, 1:49am

Does cuda allow to use a frambuffer object ?

is there a way to do a mapping with cuda afterward ?

Something like :

cudaGLMapBufferObject( (void**)&in_data, myFrameBuffer)

and then use in_data as any linear memory ?

Simon_Green · April 18, 2007, 4:12pm

No, you can’t map framebuffer objects directly (in CUDA or OpenGL). The names are confusing, but FBOs are not buffer objects in the same way vertex buffer objects and pixel buffer objects are.

The only way to do this is to read from the FBO to a PBO using glReadPixels, and then map the PBO in CUDA.

benoit · May 11, 2007, 9:48pm

how can I do that ?

Topic		Replies	Views
Array of texture references CUDA Programming and Performance	8	8435	April 16, 2009
no texture arrays == severely limiting? CUDA Programming and Performance	15	9079	May 31, 2007
OpenGl textures into cuda linear memory how to use OpenGl textures with cuda CUDA Programming and Performance	1	2338	April 5, 2007
texture array or texture pointer want to dynamic allocate texture CUDA Programming and Performance	5	4264	April 28, 2008
An array of texture references? CUDA Programming and Performance	30	29900	October 29, 2007
render to Texture help needed I need help with rendering with cuda to a OPenGL usable texture CUDA Programming and Performance	9	2453	September 28, 2010
The best way to copy OpenGL texture to CUDA CUDA Programming and Performance	6	17716	January 13, 2008
OpenGL cuda textures CUDA Programming and Performance	0	5131	June 10, 2009
limits on number of textures? CUDA Programming and Performance	10	3091	November 12, 2009
CUDA array to opengl texture? CUDA Programming and Performance	1	2270	April 13, 2009

argmax with cuda using openGl texture

Related topics