I’m trying to identify the source of some bank conflicts in my code, and for that, I’m relying on NVVP. It accuses a low shared memory efficiency (about 20% for this kernel).
When I go to the unguided analysis option, in “Shared Memory Access Pattern” I can see many warnings with the ratio of Load/Store transactions per access much higher than the ideal but I can’t see the point in the source code where it happens. Above that, I see a box that says “No Source File Mapping”, which suggests that I should recompile with -lineinfo to enable source-file mappings. I’m already doing that.
Does anyone know how to fix this source-mapping issue?
I have a Tesla V100 and tested CUDA 10.2 and 11.3.