Hi,
I am having problems with profiling any application on Ubuntu 20.04 with WSL2 host. Here is an example:
nvprof ./vectorAdd
[Vector addition of 50000 elements]
==22918== NVPROF is profiling process 22918, command: ./vectorAdd
==22918== Warning: Unified Memory Profiling is not supported on the current configuration because a pair of devices without peer-to-peer support is detected on this multi-GPU setup. When peer mappings are not available, system falls back to using zero-copy memory. It can cause kernels, which access unified memory, to run slower. More details can be found at: http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-managed-memory
Copy input data from the host memory to the CUDA device
CUDA kernel launch with 196 blocks of 256 threads
==22918== Error: Internal profiling error 4075:999.
Copy output data from the CUDA device to the host memory
Test PASSED
Done
======== Error: CUDA profiling error.
Is there something I can do to fix the issue?
Thank you.