cudaFuncSetAttribute and dynamic parallelism

garyfpga · January 9, 2023, 2:05pm

Hi,

I am trying to use more than 64KB of shared memory. I can do so with cudaFuncSetAttribute() and set the cudaFuncAttributeMaxDynamicSharedMemorySize. It works well when I launch my kernel from host. However, when I am trying to launch the kernel using dynamic parallelism, the cudaFuncSetAttribute() doesn’t seem to work. Is anybody here had experience similar problem?

Robert_Crovella · January 9, 2023, 11:58pm

A similar request was made here. The OP there filed a bug with NVIDIA (3503453), and that bug is in the late stages of being finalized. The development work is done, and based on my read of the bug, it should become available in CUDA 12.1 (if it is not already available in 12.0. I haven’t tested 12.0, but based on what I see in the bug it doesn’t appear to be in 12.0). There is no guarantee that it will be in 12.1, that is just my best guess based on what I see so far.

garyfpga · January 10, 2023, 1:44am

Thanks Robert, good to know that it is actually not working at the moment.

Topic		Replies	Views
Dynamic SM with Dynamic Parallelism CUDA Programming and Performance	12	1522	September 8, 2025
Is cudaFuncAttributeMaxDynamicSharedMemorySize a supported attriburw? Legacy PGI Compilers	8	2926	June 16, 2020
Default value of max dynamic shared memory CUDA Programming and Performance cuda	8	176	December 23, 2024
NCU dynamic shared memory display question Nsight Compute	2	503	April 24, 2024
Template function set cudaFuncAttributeMaxDynamicSharedMemorySize error CUDA Programming and Performance	4	449	February 19, 2024
Can't launch 1 block with 1024 threads when maximizing shared memory using cudaFuncSetAttribute CUDA Programming and Performance	2	371	August 11, 2023
Dynamic shared memory can not allocate problem(on 3050PC) CUDA Programming and Performance	2	463	July 20, 2022
cuFuncSetAttribute locks until H2D/D2H async memcpy finishes CUDA Programming and Performance cuda , performance	4	90	February 25, 2025
Max shared memory CUDA Programming and Performance	0	1291	July 28, 2020
C1060 and Shared Memory size CUDA Programming and Performance	2	1148	April 10, 2011

cudaFuncSetAttribute and dynamic parallelism

Related topics