How to improve Orin module memory bandwidth

Hi,
we do compile mbw with CFLAGS=-O2 -Wall -g and see ~9400 MiB/s. If you run with prebuilt and self-built mbw, and see identical result, it is throughput of production module.