Hello,
I’m trying to profile a CUDA kernel running on a remote machine using Nsight Compute.
I installed CUDA 10 on both local (where I run Nsight Compute) and target machine (where I run the kernel on a Tesla V100 GPU).
I’m able to profile the kernel using nvprof
but if I try to use nv-nsight-cu
I get the error: kernel profiling is not supported on this device
despite the command nv-nsight-cu-cli --list-chips
returns the architecture gv100
which is the one of the Tesla V100.
I tried to build the program with and without the --arch
flag but nothing changes.
What can I do? Thanks :-)