Cuda driver props cudaGetDeviceProperties

Veltz · June 27, 2010, 4:15pm

I run cudaGetDeviceProperties on the Device I initialize but it gives me really bogus limits on the thread and block maximum dimensions, (the limits are something like 512, 512, 64 for maxThreadsDim and 65536, 65536, 1 for maxGridSize, of course the kernel invocations stop working much sooner with invalidConfiguration error returned.

What am I doing wrong?

chippies · June 27, 2010, 5:06pm

Those values look right. These are the maximums for each dimension of a block and each dimension of a grid.

Each block may only contain maxThreadsPerBlock threads. Each thread uses a certain number of registers, and there are only regsPerBlock registers per block. These might be the sources of your error.

I don’t know of any additional limits on the number of blocks in a grid, other than the dimension limits.

Veltz · June 28, 2010, 9:02am

So should I instead take the register number and delete by the number of registers used by my kernel? If so how can I tell how many registers a given kernel uses?

chippies · June 28, 2010, 11:56am

The CUDA Occupancy Calculator can help in this regard, although I see the spreadsheet hasn’t been updated for Fermi cards yet. That isn’t hard though as you just need to enter the physical limits for Fermi.

You can pass the --ptxas-options=-v to NVCC, which will get the PTX assembler to print everything it does, including the final register count and shared memory for your kernels.

Topic		Replies	Views
Setting block size and avoiding errors CUDA Programming and Performance	7	6254	November 15, 2008
deviceQuery CUDA Programming and Performance	4	2094	June 14, 2007
max number of block CUDA Programming and Performance	21	17871	April 20, 2010
Question regarding maximum amount of blocks CUDA Programming and Performance	2	833	January 28, 2011
Newbie Question: Device Capabilities CUDA Programming and Performance	2	3073	July 19, 2008
Block dimensions in CUDA CUDA Programming and Performance	0	881	November 4, 2011
Invalid configuration problem CUDA Programming and Performance	2	2962	October 23, 2008
maximum total number of threads for kernel Maximum allowed number of blocks in grid CUDA Programming and Performance	2	4095	August 10, 2007
Max gridDim.x ? CUDA Programming and Performance	7	4526	March 11, 2010
the maximum number of blocks and threads CUDA Programming and Performance	10	7051	September 4, 2008

Cuda driver props cudaGetDeviceProperties

Related topics