Questions about “L1 Conflicts Shared N-way” & metrics related to “Excessive”
|
|
0
|
252
|
March 29, 2024
|
Ncu-ui is not working
|
|
5
|
311
|
April 2, 2024
|
Profiling CUDA Graphs With Conditional Nodes
|
|
2
|
303
|
April 9, 2024
|
Roofline model's different chart's understanding
|
|
0
|
283
|
March 24, 2024
|
Nsight compute system trace issue
|
|
4
|
242
|
March 22, 2024
|
Total FLOPS for the HGMMA instruction
|
|
4
|
368
|
March 21, 2024
|
Contraditory register count report when calling a non-inlined function
|
|
5
|
1121
|
March 20, 2024
|
How to setup "function cache configuration"?
|
|
1
|
253
|
March 19, 2024
|
How may I dump the full/relative path of the source files corresponding to the source page?
|
|
6
|
252
|
March 13, 2024
|
How to get Nsight Compute timeline of tensor cores and cuda cores?
|
|
5
|
390
|
April 16, 2024
|
Nsight Compute not reporting/profiling all kernels profiled by Nsight Systems
|
|
9
|
319
|
March 27, 2024
|
"Failed to prepare kernel for profiling" in Nsight Compute2024.1.0 on Windows11
|
|
3
|
260
|
March 27, 2024
|
How can I profile both kernel and cuda APIs hardware usage and application total duration
|
|
5
|
329
|
March 27, 2024
|
Fail to open Nsight Compute 2024.1.0(Ubuntu)
|
|
8
|
551
|
March 12, 2024
|
Failed to access the following 9 metrics
|
|
2
|
315
|
March 27, 2024
|
The profiler returned an error code: 3221226505 (0xc0000409)
|
|
5
|
467
|
March 12, 2024
|
How many FLOPs does one tensor_op_hmma instruction do?
|
|
0
|
302
|
March 7, 2024
|
When using Nsight Compute, are more than two kernels profiled separately or concurrently?
|
|
2
|
235
|
March 5, 2024
|
Why there are two ridge points in roofline model?
|
|
3
|
306
|
March 19, 2024
|
==ERROR== Could not deploy stock section files toc
|
|
6
|
2025
|
March 12, 2024
|
Question about Nsight Compute's application range replay support data collection for multi-node, multi-GPU setups under NCCL
|
|
3
|
381
|
March 14, 2024
|
Nsight Compute unable to launch trtexec with plugins lib
|
|
3
|
276
|
February 27, 2024
|
Nsight compute hanging issue
|
|
7
|
365
|
March 11, 2024
|
Gpu__cycles_active vs. sm__cycles_active.max
|
|
3
|
316
|
February 26, 2024
|
Mismatch in L2 load miss and Device Memory loads
|
|
2
|
304
|
March 20, 2024
|
Metric references and description
|
|
7
|
3127
|
March 2, 2024
|
Can you use nsight to see tensor core occupancy?
|
|
4
|
575
|
March 23, 2024
|
NSight Compute vs. NSight Systems vs. PyTorch Profiler
|
|
2
|
648
|
March 23, 2024
|
Could nsight compute really trace the OpenAI-triton code?
|
|
5
|
700
|
March 7, 2024
|
How to get the bytes read/write sum about Memory access between GPUs?
|
|
7
|
708
|
March 20, 2024
|