How CUDA driver set stack size on kernel invocation?

kaigai · May 21, 2019, 7:41am

According to the CUDA Driver API documentation, cuCtxSetLimit is introduced as follows.

https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__CTX.html#group__CUDA__CTX_1g0651954dfb9788173e60a9af7201e65a

CU_LIMIT_STACK_SIZE controls the stack size in bytes of each GPU thread. Note that the CUDA driver will set the limit to the maximum of value and what the kernel function requires.

It looks to me the CUDA driver will automatically set what the kernel function requires, regardless of the configured limit size of the per-thread stack size.

On the other hands, I got CUDA_ERROR_LAUNCH_FAILED due to lack of stack size, for a GPU kernel which allocates about char buf[2048] at least. After cuCtxSetLimit(CU_LIMIT_STACK_SIZE, 6000), it was solved.

I wonder the following two points.

How to understand the description about? It looks to me the configured stack limit is working, even if what the kernel function requires is larger than the limitation.

How to know what the kernel function requires from the driver? I expected cuFuncGetAttribute() with CU_FUNC_ATTRIBUTE_LOCAL_SIZE_BYTES tells what I wanted, however, it returns 0 for the kernel which takes 2048 bytes buffer of the stack.

Best regards,

Topic		Replies	Views
cudaDeviceSetLimit bug CUDA Programming and Performance	6	72	January 21, 2025
Maximum stack size? CUDA Programming and Performance	7	1092	March 24, 2024
What is the maximum CUDA Stack frame size per Kerenl. CUDA Programming and Performance	1	13560	November 18, 2013
Some stack questions CUDA Programming and Performance	1	493	January 13, 2012
cudaDeviceSetLimit call increases the GPU memory CUDA Programming and Performance	2	1099	September 28, 2016
show sizes of GPU memory usage, eg log cudaMalloc, CUDA reports "out of memory" at runtime CUDA Programming and Performance	4	2147	December 13, 2016
Set stack size limit not working CUDA Programming and Performance	1	880	October 31, 2014
How to understand and set the stack size ? OptiX	5	3031	June 14, 2022
Recursion Depth on C2050? CUDA Programming and Performance	6	1837	August 23, 2010
How can I pass stack, heap size option to gcc? CUDA Programming and Performance	1	4884	May 28, 2014

How CUDA driver set stack size on kernel invocation?

Related topics