Problems about Profiling Shared Memory Bank Conflicts using nsight-compute

tom.hx · January 24, 2022, 5:59am

I have Write a Cuda Kernel, with carefully shared memory arrangement，and it should be bank conflict free if my understanding is correct.

When using the Nsight Compute：

In the source code page， I checked the ‘L1 Wavefronts Shared’ and ‘L1 Wavefronts Shared Ideal’ ， values are all same in these 2 columns. Does this mean the source code achieved bank conflict free?

1879×991 317 KB
However, in the details page, It shows there exist bank conflicts when loading and storing shared memory.

2845×166 53.9 KB
And something more interesting， If I emit the Kernel with only one Block, that is, set GridDim.x, GridDim.y, GridDim.z to 1, it shows there are NO Shared load/store bank conflicts.

So, Is there REALLY exist bank conflicts or not? And Why there exists the gap in nsight compute ? How to narrow down the issue?

felix_dt · January 24, 2022, 3:12pm

If Nsight Compute is showing bank conflicts in the Memory Workload Analysis tables, there are truly conflicts in your kernel. The Source page metrics you referred can help in identifying the source of such conflicts, but they are not guaranteed to show all of them (i.e. there is no strict correlation in both directions).

You can find more info related to bank conflict analysis in Nsight Compute in this thread: Shared memory bank conflicts and nsight metric. Note that the “Memory L1 (Ideal) Transactions Shared” have since been renamed to “L1 Wavefronts Shared (Ideal)” in newer versions of the tool. Also, as hinted to in this reply, we are looking to make it easier to determine the source of any bank conflicts in future versions of the tool.

tom.hx · January 25, 2022, 7:19am

I see，Thank you very much.

Topic		Replies	Views
Is there any way to find out the location in cuda code that cause shared memory bank conflicts? CUDA Programming and Performance	6	1239	January 21, 2022
Shared memory bank conflicts and nsight metric CUDA Programming and Performance	15	5892	October 19, 2024
Analyzing bank conflicts with Nsight compute CUDA Programming and Performance	1	2349	August 14, 2020
Why shared memory bank conflict number is not equal? Nsight Compute	2	648	July 15, 2020
Analyzing the bank conflicts in my kernel Nsight Compute	2	60	September 30, 2025
About bank conflict of shared_mem CUDA Programming and Performance	2	497	July 25, 2023
Shared memory bank conflict CUDA Programming and Performance	4	4207	March 27, 2008
Shared memory bank conflicts CUDA Programming and Performance	1	2420	August 24, 2009
The increase of the shared memory size leads to the bankconflict (from 9 KB shared memory) Nsight Compute	5	573	July 14, 2023
Read n-way bank conflict in Nsight Compute Nsight Compute	5	1060	January 12, 2023

Problems about Profiling Shared Memory Bank Conflicts using nsight-compute

Related topics