I’m working on an xavier agx / 32gb. The nvpmodel is set to MAXN (running jetson_clocks does not change the outcome).
cat /sys/kernel/debug/bpmp/debug/clk/emc/rate
returns 2133000000 (Hz) which is expected, and which together with the 256 bit bus width translates to the advertised 137GB/s memory bandwidth.
However, if I print the memoryClockRate obtained through cudaGetDeviceProperties the result is 1377000 (KHz) independently of the nvpmodel and/or jetson_clocks. So the implied memory bandwidth is not 137GB/s but only 88GB/s.
Given that CPU and GPU use the same physical memory, I would assume that the bus width and frequencies are the same.
Is there something wrong with the way I set the memory frequency + what is reported by /sys/kernel/debug/bpmp/debug/clk/emc/rate or is cudaGetDeviceProperties giving wrong results?