Hi there,
I am benchmarking the performance of a 16GB Orin NX versus my previous 16GB Xavier AGX.
I’m running this using trtexec and have noticed the following output. On the Xavier AGX, I achieved a 99th percentile latency of 1.57ms with a compute time of 1.45ms. On the Orin NX, I achieved a 99th percentile latency of 1.78ms with a compute time of 1.66ms.
The commands used to set this up was trtexec —onnx=model.onnx —int8 —duration=10 —avgRuns=100 on both devices. To troubleshoot, I’ve ensured that the both devices have run sudo nvpmodel -m 0 and sudo jetson_clocks. Both GPUs are above 90% utilization, what are the potential reasons for this performance decrease. I would expect the Orin NX to be faster.
Model can be found here: Odin-CV/CV/models/model.onnx at main · SumedhP/Odin-CV · GitHub