Hi,
I am trying to use the documented public SCF PMU events on Jetson AGX Orin as the main memory-related signal for CPU-driven workloads.
From the public TRM, I understand that CPU memory reads/writes go through:
CPU cluster → ACI → SCF → SNOC → MCF
My question is specifically about the public SCF PMU events such as:
- scf_pmu/bus_access_rd/
- scf_pmu/bus_access_wr/
- scf_pmu/scf_cache_refill/
- scf_pmu/scf_cache_wb/
Could NVIDIA clarify, at a high level, what boundary these events correspond to?
In particular:
- Do bus_access_rd / bus_access_wr count accesses entering SCF from the CPU/cluster side, or traffic leaving SCF toward SNOC / memory-controller side?
- Do these events include accesses that are satisfied by the SCF system cache (L4), or only accesses that miss in SCF and continue toward memory?
- Are MMIO or other non-DRAM paths excluded from these events?
Even a brief clarification of the intended counting boundary would be very helpful.
Thank you.
Hi,
Publicly we have tegrastats showing loading of CPU cores:
Tegrastats Utility — NVIDIA Jetson Linux Developer Guide
Seems like you need further information about CPU processing tasks. Is there a reason to get the operations in detail?
Hi, thanks.
Yes, there is a reason for asking at this level of detail.
This is part of my Bachelor’s thesis, where I am studying memory-bandwidth measurement for CPU-driven workloads on Jetson AGX Orin. I am not trying to monitor CPU processing load in general. My goal is to use the public SCF PMU events as the main memory-related signal, and for that I need to understand what boundary they represent at a high level.
For example, I am trying to understand whether these public events are closer to:
- traffic entering SCF from the CPU / cluster side
- traffic leaving SCF toward SNOC / MCF / DRAM
- or accesses internal to SCF, including requests satisfied by the SCF system cache
tegrastats is useful for system monitoring, but it does not answer that semantic question. I am specifically asking about the intended counting boundary of the documented public SCF PMU events:
scf_pmu/bus_access_rd/
scf_pmu/bus_access_wr/
scf_pmu/scf_cache_refill/
scf_pmu/scf_cache_wb/
Even a high-level clarification such as “CPU-side of SCF”, “memory-side of SCF”, or “includes accesses satisfied by SCF cache” would already be very helpful for my thesis.
Thank you.