NCU too slow and incomplete

hys4qm · April 4, 2025, 9:50pm

I need to measure the DRAM util, gpu util per kernel and other stats - im using command sudo -E CUDA_VISIBLE_DEVICES=0 ncu --set basic --launch-count 100 --force-overwrite -o ncu_8b_Q2_k --section-folder="/usr/local/cuda-12.8/nsight-compute-2025.1.1/sections/" ./llama-cli -m <model_path> -ngl 99 --prompt <my_prompt> -no-cnv -c 512 -n 50 ; if i dont set the launch count it takes forever to run, previously i set --metrics sm__throughput.avg.pct_of_peak_sustained_elapsed,dram__throughput.avg.pct_of_peak_sustained_elapsed but for both cases, the NVIDIA compute doesn’t show any useful info. Where am i supposed to get the metric values?

veraj · April 7, 2025, 7:02am

Hi, @hys4qm

Did you get report generated at last after set the launch count ?
If not, did you get any error ? What’s the output of the command ?

hys4qm · April 10, 2025, 2:39am

If i forcefully set launch count then the report generated is as shown, otherwise it keeps running for hours

veraj · April 10, 2025, 2:51am

You can go to “Raw” page and filter by the metric name. If the metric is been collected, it should be found there.

veraj · May 1, 2025, 2:52am

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Nsight compute command line roofline option nvc, nvc++ and nvfortran nsight	3	2041	March 28, 2024
Ncu-ui not profiling some sections Nsight Compute	4	2372	November 26, 2020
Some metric set and section are not enable Nsight Compute cuda , ubuntu	5	1525	January 16, 2024
Takes days to profile my code Nsight Compute	6	1360	April 27, 2021
Ncu problems Nsight Compute	6	911	December 3, 2022
How can I measure kernel launch overhead using ncu Nsight Compute	7	1342	May 4, 2023
Nv-nsight-cu-cli hangs on any binary Nsight Compute	8	1059	September 24, 2021
Nvprof metrics in nsight? Nsight Compute	1	857	June 3, 2021
How can I profile both kernel and cuda APIs hardware usage and application total duration Nsight Compute	5	422	March 27, 2024
How to get speed of light with ncu-cli Nsight Compute profiling	8	838	March 23, 2024

NCU too slow and incomplete

Related topics