==PROF== Disconnected from process 89889
==WARNING== No kernels were profiled.
at the end of my application. I’m on Ubuntu 20.04 using Cuda version 11.1. I am certain that my GPU is being used (nvidia-smi reports activity correctly). I see lots of similar topics on these forums, but none of the fixes within help.
Additional points:
nsys profile ./application args seems to work fine.
I am using a V100
I cannot use Visual Studio but I’d like to be able to get specific GPU performance metrics for my application
nvprof similarly tells me that “No kernels were profiled”
Can you share some more information about how your application executes? For example, does your application fork child applications with CUDA kernels in them? Is that the reason you’re using --target-processes all? Is there hand written CUDA in there, or some 3rd party library or a higher level framework like PyTorch?
Do you have access to the GUI? If so, you could launch an interactive profile to step through the APIs and see if you encounter a cuda kernel.
My application does not fork anything. I just added --target-processes all to make sure that I capture everything. There is a mix of hand-written CUDA, cublas, and even cusolver code in there. I have (crappy x11) access to the nvprof GUI, and I have seen that the kernels are executing as expected.
You mentioned x11, what’s the environment you’re using? Is this all happening locally on an Ubuntu box or are you ssh into the Ubuntu machine where the app and profile are running? Are there multiple devices (GPUs) on the target machine? Can you share the output of ‘nvidia-smi’?