Thanks for provoking this discussion. I installed 2024.3 version of ncu
also with in the docker image. Now I am able to profile the kernels with ncu
inside the docker.
The purpose of this task to get to know the input matrix sizes passed to the kernel. I have posted a question here.
Can you help me if we can get input matrix size info with ncu
?