Optimize TF-TRT models on Jetson Nano to improve inference timing and efficiency

chson0316 · July 23, 2019, 8:56am

Hi,
I have executed a TF-TRT model(FP16) for image classification on Jetson Nano.
For more information see tf_trt_models/classification.ipynb at master · NVIDIA-AI-IOT/tf_trt_models · GitHub

I used an inception-ResNet-v2 model for the prediction. The timing I got for the inference was about 140ms. In addition that takes around ~3.5GiB Memory and ~5.2GiB Swap.

Can I get support in order to optimize the timing and efficiency of my system? the details are:

Framework: Tensorflow TensorRT
Architecture: inception-ResNet-v2
inference: ~140 msec.
Memory: ~3.5GiB
Swap: ~5.2GiB

I would like to know if I can get the support to improve it or if I can get literature and examples on how to take advantage of the hardware as much as.

Thanks.

AastaLLL · July 24, 2019, 6:18am

Hi,

Swap may have some bad impact on the performance.
Is your model can be inference without adding the swap?

More, it’s recommended to use pure TensorRT to have a better performance.
Here is a tutorial for your reference: [url]https://github.com/NVIDIA-AI-IOT/tf_to_trt_image_classification[/url]

Thanks.

chson0316 · July 30, 2019, 2:58am

Hi AastaLLL,

Thank your recommendations.

I tried a tutorial: GitHub - NVIDIA-AI-IOT/tf_to_trt_image_classification: Image classification with NVIDIA TensorRT from TensorFlow models. on my Jetson Nano.

I got this error when built this project

git clone --recursive https://github.com/NVIDIA-Jetson/tf_to_trt_image_classification.git
cd tf_to_trt_image_classification
mkdir build
cd build
cmake …
make

– Configuring done
– Generating done
– Build files have been written to: /home/nano/prj/tf_to_trt_image_classification/build
[ 25%] Building CXX object src/CMakeFiles/uff_to_plan.dir/uff_to_plan.cpp.o
/home/nano/prj/tf_to_trt_image_classification/src/uff_to_plan.cpp: In function ‘int main(int, char**)’:
/home/nano/prj/tf_to_trt_image_classification/src/uff_to_plan.cpp:71:79: error: no matching function for call to ‘nvuffparser::IUffParser::registerInput(const char*, nvinfer1::DimsCHW)’
parser->registerInput(inputName.c_str(), DimsCHW(3, inputHeight, inputWidth));
^
How can I fix this?

Thanks.

AastaLLL · August 13, 2019, 7:38am

Hi,

Please check the patch for TensorRT 5.x here:
[url]https://github.com/NVIDIA-AI-IOT/tf_to_trt_image_classification/pull/40[/url]

Thanks.

Topic		Replies	Views
TensorFlow-TensorRT inference time and memory consumption on Nano Jetson Nano	2	979	October 18, 2021
Optimize Tensorflow with Tensor RT to improve inference timing Jetson Nano	2	637	October 18, 2021
TensorFlow 2 Models for Jetson Nano Jetson Nano tensorrt , tensorflow , tf-trt	2	1621	December 15, 2021
converting a frozen graph to tensorRT Jetson Nano	5	1788	October 14, 2021
TensorFlow object detection inference out of memory Jetson Nano	7	3036	October 18, 2021
TF-TRT optimization TensorRT tensorrt , tensorflow , jetson-inference	4	4945	June 2, 2021
optimizing tf-trt load time Jetson Nano	12	4168	October 15, 2021
Low FPS on Jetson Nano using TensorRT Jetson Nano tensorrt , tensorflow	7	1204	August 27, 2020
Jetson Nano Out of Memory running TRT Model Jetson Nano tensorrt , tensorflow , inference-server-triton , deepstream	5	2172	December 22, 2021
No performance improvement for Tensorflow TensorRT model on converted on Jetsons Xavier NX Jetson Xavier NX tensorrt , tensorflow	2	676	October 18, 2021

Optimize TF-TRT models on Jetson Nano to improve inference timing and efficiency

Related topics