Grid is only 2d?

Willaim · March 13, 2007, 2:06pm

I tried to set up a grid:

dim3 dimGrid(nx,ny,nz);

When it went to execute the kernel, I get this message:
Cuda error: GPU Kernel execution failed in file ‘xxxx’ in line nn : invalid configuration argument.

If I use only 2 dimensions (dimGrid(nx,ny)) it works.

The docs seems to indicate that the grid can be up to 2 dimensions but doesn’t explicitly indicate that fact. The fact that it’s a dim3 seems to imply that it can handle 3 dimensions.

Is this a limitation in the beta 0.8 version or is this going to be a hard limit in the future?

Thanks

tachyon_john · March 13, 2007, 3:48pm

I’m curious about this as well, since I do a lot of processing of volumetric data…

Cheers,
John

prkipfer · March 13, 2007, 4:22pm

The grid is 2D only. But the thread block is 3D. So you can process volumes as a grid of rectangular blocks. This does get a bit tricky for addressing however if your volume has depth > 512… :huh:

Peter

Mark_Harris · March 13, 2007, 4:28pm

It’s a hardware limitation of G80.

If your depth is greater than one thread block can handle, you can either

a) tile the depth dimension across the other two dimensions – this is analagous to the “flat 3D textures” approach of “traditional” GPGPU.

B) Loop over depth in the kernel.

Mark

prkipfer · March 13, 2007, 4:37pm

[quote name=‘Mark Harris’ date=‘Mar 13 2007, 05:28 PM’]

a) tile the depth dimension across the other two dimensions – this is analagous to the “flat 3D textures” approach of “traditional” GPGPU.

B) Loop over depth in the kernel.

/quote]

My 2c:

Both options can be quite a pain wrt addressing when you need to push a say 3x3x3 convolution kernel through the volume. You should consider reformulating what one kernel invocation means, ie. (kernel = volume element to process) but rather (kernel = volume slice). That way you aggregate more work per kernel invocation which might give you the opportunity to share intermediate results. Always a good idea: check if the convolution kernel is separable.

Peter

tachyon_john · March 14, 2007, 12:06am

Yeah, I have’t started on it yet, but we’ve got various codes that require 3-D convolutions, thus the reason for some of my interest there. For the simpler code I’ve been working on to date, I’ve been processing one slice of the volume at a time, and that has worked well so far.

John

[quote name=‘prkipfer’ date=‘Mar 13 2007, 11:37 AM’]

Topic		Replies	Views
Whats wrong with this simple kernel call? Invalid Configuration Argument (with empty Kernel) CUDA Programming and Performance	16	9185	November 23, 2009
grid dimensionality kernels CUDA Programming and Performance	11	10599	May 29, 2008
Wrong gridDim ... causes: invalid configuration argument CUDA Programming and Performance	2	6903	November 26, 2009
Dimensions of a Block and a Grid CUDA Programming and Performance	7	13137	May 1, 2008
The 3rd dimension can't be greater than 1? CUDA Programming and Performance	8	7439	September 23, 2011
problem with 3 dimensional thread block with a three dimensional grid, the kernel was not executed CUDA Programming and Performance	6	6038	February 10, 2011
Why only 2D Grid and Why only 3D Thread Block ? Why only upto 2-D Grids and why only Upto 3-D Thread CUDA Programming and Performance	5	1755	August 3, 2010
hitting the grid size limitation CUDA Programming and Performance	5	1553	November 13, 2009
Grid dimensions CUDA Programming and Performance	6	5720	September 18, 2009
A question about 3D grid CUDA Programming and Performance	6	2430	December 16, 2011

Grid is only 2d?

Related topics