In case of using peer memory, How can I measure the L1 or L2 cache's value on operating GPU?

I seems you already found your answer why there is no peer traffic shown for nvlink data here. Peer traffic is for PCIe-connected GPUs, it does not count nvlink traffic. This is shown in the NVLink section.

To collect the nvlink section, use --set nvlink or --set full --section Nvlink.

1 Like