Hi NVIDIA folks!
I successfully run the official sample cupti_metric_properties
on v100 platform, which reveals all available metrics on this GPU.
Yet I failed to run this sample on h100 platform. This root cause is:
CUPTI need to call cuptiProfilerGetCounterAvailability
twice to obtain the counter availability image. The first call is to obtain the size of the image, then the user sets the buffer as corresponding size, finally the second call returns the actual image.
The sample failed at the second call to cuptiProfilerGetCounterAvailability
, the return value is 999 (give nothing valuable).
My environment is :
- CUDA 12.1
- Ubuntu 20.04
- Driver 555.42.06
- NVIDIA H100
Could you please help to solve this issue? Thanks so much