Hi everyone,
I am puzzled as to why I cannot get Nsight Systems to work properly. It’s my first time using the profiler and posting here, so excuse me if the question turns out to be banal. I would be very glad if I could get some help.
I am trying to profile a Julia application I wrote using CUDA. I get the following error:
julia> CUDA.@profile #'some expression here using CUDA.jl'
[ Info: Running under Nsight Systems, CUDA.@profile will automatically start the profiler
WARNING: CUDA tracing is required for cudaProfilerStart/Stop API support. Turning it on by default.
There are no active sessions.
ERROR: failed process: Process(/usr/local/bin/nsys stop, ProcessExited(1)) [1]
Stacktrace:...
caused by: Failed to compile PTX code (ptxas received signal 11)
If you think this is a bug, please file an issue and attach /tmp/jl_DLp64D.ptx
Stacktrace: ...
I’ve left out the stack traces as these are specific to Julia. Can post them if needed.
Upon launching using profile
command:
~$ nsys profile julia
End of file
I can get the profile session to start using the UI, but no CUDA events are recorded: “No CUDA events collected. Does the process use CUDA?”
I have a GeForce GTX 1050 Ti GPU.
This is the output of uname -a
~$ uname -a
Linux copenhagen 5.13.0-7620-generic #20~1634827117~21.04~874b071-Ubuntu SMP Fri Oct 29 15:06:55 UTC x86_64 x86_64 x86_64 GNU/Linux
Output of cat /proc/sys/kernel/perf_event_paranoid
~$ cat /proc/sys/kernel/perf_event_paranoid
1
This is the output of nvidia-smi
~$ nvidia-smi
Mon Nov 22 08:51:19 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.86 Driver Version: 470.86 CUDA Version: 11.4 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:01:00.0 On | N/A |
| 30% 39C P0 N/A / 75W | 965MiB / 4034MiB | 8% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
Output of /usr/local/bin/nsys --version
~$ /usr/local/bin/nsys --version
NVIDIA Nsight Systems version 2021.5.1.77-4a17e7d