I tested the performance of Xavier NX in connection with Tensorflow, TF-TRT, OpenCV and the SSD-MobilenetV2 pretrained on the COCO dataset and was quite disappointed. I only get 10fps with the sample video attached. The GPU does not seem to be heavily loaded.
Installed Tensorflow 1.15 according to Official TensorFlow for Jetson AGX XavierNX
Installed OpenCV with CUDA support
Installed everything else according to How to configure your NVIDIA Jetson Nano for Computer Vision and Deep Learning - PyImageSearch
Created an optimized TensorRT graph
Attached: Used Scripts and the according terminal output, the Sample video and the jtop-Info Screenshot
detect_realtime_nano.py (7.4 KB)
Output_detect_realtime_nano.txt (6.0 KB)
Output_prepare_trt_graph.txt (28.3 KB)
prepare_trt_graph.py (2.2 KB)
Here is demo where you can see the jetson jtop stats during the inference: https://share.icloud.com/photos/0O0SXTp9PBkj8SikiV3y-nvuw
What am I doing wrong? Or can someone confirm that this is the maximum performance of the XavierNX with this framework?