kernel argument limitations?

sundog314 · June 2, 2009, 7:43pm

Hi all-

I believe (although i cannot confirm) somewhere, someone mentioned that the arguments to a kernel are limited to 256 bytes. Can someone confirm and/or explain this? Does this mean that the total size of the data in my argument list cannot exceed 256 bytes? If so, this seems a little (ok, a lot) restrictive for an API that is focused on massively data-parallel applications.

Oh, another question: Does anyone have any strategies for ensuring/confirming that a kernel is launched within device constraints, since there’s no segfault or fpe-trap support? Thanks for any and all comments!

tmurray · June 2, 2009, 7:46pm

Why not just pass a pointer to a structure that contains all of your data…?

sundog314 · June 2, 2009, 7:52pm

ahhhhhhh…a single pointer that points to only the first element of the entire data array? So even if my data array is > 256 bytes, the passed pointer is nowhere near that limit?

tmurray · June 2, 2009, 7:58pm

sure, it’s just that the actual arguments passed to a kernel (the size of every piece of data within your kernel<<<x,y>>>(data, data2…) ) must be less than 256 bytes. It’s not really a big deal at all.

seibert · June 2, 2009, 8:01pm

Correct. You allocate your arrays on the device with cudaMalloc (up to the free memory on your card), load your data into those memory blocks with cudaMemcpy, and pass just the pointers (4 or 8 bytes depending on whether your OS is 32 or 64 bit) as parameters to your kernel.

sundog314 · June 2, 2009, 8:07pm

Nice. Thank you so much tmurray and seibert for clarifying this for me. I was afraid I wouldn’t be able to kernel-ize some code of mine because my data is so godawful huge.

So, just to beat a dead horse: if i passed 65 float* pointers to a kernel, that would most definitely crap out, right?

tmurray · June 2, 2009, 8:41pm

on a 32-bit machine, yes, and on a 64-bit machine, 33 would crap out.

Topic		Replies	Views
is there any limit on # of arguments in cuda kernel? CUDA Programming and Performance	12	17996	March 19, 2010
Max size of CUDA arguments CUDA Programming and Performance	2	3052	May 23, 2017
Problem on psaaing memory from host to device CUDA Programming and Performance	3	894	April 23, 2012
Formal parameter space overflowed kernel launch error CUDA Programming and Performance	18	18643	May 2, 2010
max number of arguments for kernel function Legacy PGI Compilers	2	8053	April 20, 2013
bydefault function arguments resides in shared memory? CUDA Programming and Performance	9	4314	May 11, 2009
Passing variables into kernel over 256 bytes CUDA Programming and Performance	5	9683	July 12, 2011
Parameters passed to a CUDA kernel exceed 256 bytes. CUDA Programming and Performance	13	7096	September 21, 2009
Multiple array inputs to Kernel Formal parameter space overflowed in function Error CUDA Programming and Performance	1	1328	November 3, 2008
how many arguments are available in one kernel? CUDA Programming and Performance	8	1484	May 23, 2010

kernel argument limitations?

Related topics