Hi all!
After reading this very interesting post https://devblogs.nvidia.com/parallelforall/cuda-7-5-pinpoint-performance-problems-instruction-level-profiling/ I thought I’d give the visual profiler nvvp a try.
I have no issue running nvvp, but when I try to do the instruction level profiling, I get a view of the disassembled kernel. I’d like to see the source file.
In some box of the Results section, it says:
" The source-assembly viewer could not be shown because source-file mappings are missing from the kernel. You can enable source-file mappings by using the -lineinfo flag when compiling the kernels"
I’ve tried adding -linfo to the compiling, i.e. my makefile executes:
nvcc -x cu -O3 -std=c++11 -lineinfo -dc main.cpp
nvcc -x cu -O3 -std=c++11 -lineinfo -dc DG.cpp
nvcc -x cu -O3 -std=c++11 -lineinfo -dc Problem_functions.cpp
nvcc main.o DG.o Problem_functions.o -lineinfo -o DG1D
but it doesn’t help.
I have cuda 9.0. I’m on linux, and execute the profiler with “nvvp ./DG1D”
Any guess?