|
Which foundational libraries do the high-frequency GPU metrics in Nsight Systems come from?
|
|
5
|
508
|
March 14, 2024
|
|
Understanding CPU and GPU Behavior with NVIDIA Visual Profiler
|
|
2
|
563
|
March 8, 2024
|
|
Nsight Profiler Hangs on OpenMP Initialization
|
|
9
|
1338
|
February 29, 2024
|
|
Nsight Compute unable to launch trtexec with plugins lib
|
|
3
|
641
|
February 27, 2024
|
|
What is the best tool for general C/C++ sample profiling?
|
|
10
|
1316
|
February 1, 2024
|
|
How to get speed of light with ncu-cli
|
|
8
|
1102
|
March 23, 2024
|
|
Global memory usage profiling and tracking
|
|
9
|
1963
|
February 1, 2024
|
|
Confusion about the (d/f/h)(mul/add/fma) count in the nsight compute
|
|
6
|
1707
|
January 16, 2024
|
|
Ncu does not detect kernels, ==ERROR== The application returned an error code (11)
|
|
6
|
2166
|
December 13, 2023
|
|
What is the meaning of Latency: ←2.006 ms here?
|
|
1
|
605
|
December 12, 2023
|
|
FLOPS Profiler
|
|
2
|
757
|
November 17, 2023
|
|
NVTX Error
|
|
1
|
617
|
November 7, 2023
|
|
How to profile how many times an instruction is executed or how much duration it takes?
|
|
2
|
739
|
January 12, 2024
|
|
Nsight systems failed to load report from .nsys-rep
|
|
2
|
1685
|
November 1, 2023
|
|
How does nsys tool profile cuda libraries, like cublas, cudnn, etc.?
|
|
12
|
1309
|
October 25, 2023
|
|
Cuda stream stalls due to memcpyAsync --- even when memory copy performing is idle?
|
|
2
|
663
|
October 7, 2023
|
|
Profiling elapsed time and energy around extreme clock frequencies
|
|
0
|
724
|
September 19, 2023
|
|
Applying Timeline View Filter to Stats System View
|
|
3
|
886
|
September 25, 2023
|
|
Nsight compute failed to profile with nvtx ranges in pytorch
|
|
4
|
1525
|
September 19, 2023
|
|
Jetson orin performace profiling
|
|
2
|
1123
|
August 30, 2023
|
|
Higher L2 cache hit rate but larger device memory tranfer size
|
|
1
|
834
|
August 13, 2023
|
|
How to make an executable file run on NVIDIA gpus
|
|
0
|
668
|
July 27, 2023
|
|
How can I get the analysis like nvprof in nsys?
|
|
3
|
621
|
July 21, 2023
|
|
How to make an executable file run on NVIDIA gpus
|
|
0
|
543
|
July 19, 2023
|
|
Is there any way to solely collect the total duration of the CUDA kernels within each nvtx range
|
|
1
|
620
|
July 11, 2023
|
|
Weird behavior of cuda event
|
|
3
|
762
|
June 23, 2023
|
|
[Question] NSys CUDA Profiler - Page Migration and Number of CPU/GPU page faults
|
|
1
|
1071
|
June 23, 2023
|
|
Max block size limiting factor
|
|
3
|
747
|
July 5, 2023
|
|
Segmented memory copy to/from device
|
|
4
|
1038
|
June 20, 2023
|
|
Cannot get tensor core metrics with latest NSight system
|
|
4
|
1511
|
June 20, 2023
|