Hello, I am trying to understand the kernel resource usage for my application. I use -Xptxas=-v
to output kernel resource usage during compilation, but am not sure I understand the numbers entirely. For instance, for an output along the lines of below, is smem
reported per block or SM? What about cmem
? Are the spills per thread? Is there a guide online that has descriptions for these?
64 bytes stack frame, 128 bytes spill stores, 164 bytes spill loads
ptxas info : Used 64 registers, 7136 bytes smem, 432 bytes cmem[0], 64 bytes cmem[2]