How can I get GPU SM active and SM occupancy on nvidia A10?

I wanna profile GPU SM active and SM occupancy to view acutal load of GPU. As we know, nsight or nvprofil can get this metrics, but this tools are bind with CUDA application. How can I get this metrics from device aspect like dcgm or nvml api? NVML API had already support SM active and SM occupancy on Hopper or newer devices(NVML API Reference Guide :: GPU Deployment and Management Documentation). However, Ampere architecture is not support. what should I do to get real GPU util for my nvidia A10?