Hi there,
Nsight compute will report “Launch Statistics” for a specified kernel, which contains a “Shared Memory Configuration Size” field (launch__shared_mem_config_size).
e.g. (see link)
$ ncu python t1.py
...
Section: Launch Statistics
---------------------------------------------------------------------- --------------- ------------------------------
Block Size 64
Grid Size 64
Registers Per Thread register/thread 48
Shared Memory Configuration Size byte 0
...
What’s the meaning of this field? Does it calculate from cudaFuncAttributes::preferredShmemCarveout and cudaDevAttrMaxSharedMemoryPerMultiprocessor?