Instruction-Level Profiling via nvprof?

tbenson · January 21, 2016, 5:48pm

Has anyone had any luck with instruction-level profiling in CUDA 7.5? An example is outlined here:

http://devblogs.nvidia.com/parallelforall/cuda-7-5-pinpoint-performance-problems-instruction-level-profiling/

but I have yet to get it to work.I get the “Kernel Profile - PC Sampling” report in nvvp with a kernel-level sample count and the sample distribution pie chart, but there is no section below that listing source files or functions. There is an icon next to the minimize/maximize buttons for the results window that presumably allows you to add source file mappings, but it does not work. Clicking the icon pops up a modal “Source Files Mapping” window, but nothing happens when you click “Add Mapping”. I use out-of-tree builds via CMake, but I have tried copying the executable into the source directory and running nvvp directly from there with no luck. I’m using Linux and nvvp has never seemed to work there, so perhaps it works better in Windows.

I typically use nvprof, but I cannot find any associated flags to generate instruction-level profiles. Does anyone know if this functionality is somehow exposed in nvprof? I looked through the available metrics via --query-metrics, but I don’t see anything related to program counter sampling.

I am using a GTX 980 Ti (CC 5.2) and built the code for sm_52.

Topic		Replies	Views
Nvvp instruction level profiling: source-file mappings missing from the kernel CUDA Programming and Performance	7	2938	December 19, 2019
solved: instruction level sampling in visual profiler (eclipse edition), not available? CUDA Programming and Performance	5	1203	November 5, 2018
Profiling GPU at source code level CUDA Programming and Performance	4	644	November 9, 2024
Can't see the source code in NVVP Visual Profiler and nvprof	4	1072	January 2, 2025
nvprof on Win7/x64: "No CUDA application was profiled" CUDA Programming and Performance	0	608	June 4, 2014
CUDA 7.5: Pinpoint Performance Problems with Instruction-Level Profiling Technical Blog	14	924	April 13, 2018
Nvidia Visual Profiler nvvp DRIVE AGX Xavier General	7	2455	October 12, 2021
Visual Profiler Visual Profiler and nvprof	0	2095	October 24, 2014
Nvvp ->unguided analysis ->kernel profile (not showing the source code, hotspot and the source code location ) Visual Profiler and nvprof	2	368	November 28, 2024
cuda profiler CUDA Programming and Performance	2	846	May 21, 2013

Instruction-Level Profiling via nvprof?

Related topics