We are looking for benchmarks that can give the peak FLOP/s and memory bandwidth on the Jetson AGX Orin.
https://github.com/NVIDIA-AI-IOT/jetson_benchmarks: We looked at this, and it seems to focus on deep learning workloads. We are interested in measuring the peak performance/bandwidth. Please recommend any standard benchmarks.
If these suggestions don’t help and you want to report an issue to us, please attach the model, command/step, and the customized app (if any) with us to reproduce locally.