I want to get a CUDA occupancy from the Quadro RTX 6000 GPU.
But ,the RTX 6000 has 476 tensor cores as well as 4608 streaming multiprocessors. How can I get the CUDA occupancy considering of the tensor core?
I want to get a CUDA occupancy from the Quadro RTX 6000 GPU.
But ,the RTX 6000 has 476 tensor cores as well as 4608 streaming multiprocessors. How can I get the CUDA occupancy considering of the tensor core?
Hi @ho126jin
Please note this forum branch is dedicated to CUDA GDB support. You question might be more suitable for a different forum branch: CUDA Programming and Performance - NVIDIA Developer Forums