Recap on tensorflow object detection API on TX2

laupl0082 · January 12, 2018, 4:41pm

Hi users,
I just wanted to summarize developers experience and sharing some tips about tensorflow object detection API on TX2.
At the moment I am just talking about what is actually doable and not, with a focus on inference, rather than training.
Browsing the forum, my experience and other resources, this is what I understood.
Let’s consider only available pretrained frozen graph.

TF obj.det. API can be used for inference with ssd_mobilenet_v1 network architecture at approx ~5-8 fps. Faster Rcnn resnet pretrained models seems to cause OOM errors (in my experience, all of them). Was anybody able to run one of the Faster Rcnn resnet model? If yes, could you share some tips?

Second question.
Does the conversion to TensorRT have an impact also on memory usage?
What I mean is the following: even if I am not able to run a specific network architecture due to OOM error on TX2 from TF Obj Det API, I could potentially train a model in a different, more powerful machine, export trained graph to UFF format (through python API), then transfer it to TX2 where it can be imported using C++ API for inference. Does it sound correct? performance would certainly benefit from a TF->TensorRT conversion, but I am not sure about memory usage.
I am considering this as an option beacuse I’ve notice in jetson-inference DetectNet a FasterRcnnResnet50 network.

Thanks for your contribution!

AastaLLL · January 15, 2018, 3:29am

Hi,

Thanks for the sharing.

We are also checking TensorFlow object detection API.
Appreciated for sharing your experience with us.

Although TensorFlow can run ssd_mobilenet_v1 with GPU mode correctly, we find the GPU utilization is pretty low.
Do you also meet this issue?
Could you share the tegrastats data when you inference with the ssd_mobilenet_v1?

sudo ./tegrastats

For your second question:
1. Workflow is correct. Only concern is that we have yet to support the custom API for UFF user.
If there is a non-supported layer in your model, there is no WAR to run this layer with TensorRT.

2. TensorRT support fp16 mode which can cut memory in half and it will be extremely helpful for your use case.

Thanks.

vtaranti · February 19, 2018, 8:19pm

Hi AastaLLL,

I will soon be looking into Tensorflow object detection API with TensorRT (for TX2).

some models of interest are :

ssd_mobilenet_v1
ssd_inception_v2
faster_rcnn_inception_v2

Do you have any links specific to the tensorflow Object detection API TensorRT to get me started?

Thanks
vtaranti

AastaLLL · February 21, 2018, 9:10am

Hi,

It’s recommended to check if the layers of your model are well-supported by TensorRT first.

We have listed the supported layer for UFF parser and TensorRT engine in detail here:
UFF parser: [url]Developer Guide :: NVIDIA Deep Learning TensorRT Documentation
TensorRT engine: [url]Developer Guide :: NVIDIA Deep Learning TensorRT Documentation

Thanks.

gustavvz · April 6, 2018, 8:28am

Hi guys,

are there any new projects like (https://github.com/NVIDIA-Jetson/tf_to_trt_image_classification) but for object detection models using TensorRT?

When talking about performance of the Object detection API:
I was working on this topic over the last 4 months, also started at around 4 fps for mobilenet ssd,
But now i am able to achieve up to 30 fps with the same model on the jetson, you can have a look at my github and try it out (https://github.com/GustavZ/realtime_object_detection)

What i am now interested in is: Makin Mask R-CNN run on the jetson. Did anybody get it working on the jetson successfully? Maybe through compressing/binarization techniques? Is it documented which layers lack TensorRT for Mask R-CNN? Probably a lot…

Anyways would be nice to hear about your experience

AastaLLL · April 10, 2018, 6:58am

Hi,

Sorry for that we are not familiar with Mask R-CNN.
But you can find the detail supported layer of TensorRT 4 here:
[url]Developer Guide :: NVIDIA Deep Learning TensorRT Documentation

Thanks.

kithminr1995 · August 28, 2018, 3:10am

Hi,

Is there a MaskRCNN sample available for TensorRT4? I need to know to to create my config.py file to be used as a preprocessor. I am using the matterport mask rcnn model as well.

Thank You!

AastaLLL · August 31, 2018, 3:50am

Hi,

MaskRCNN is not in our official sample.
Suppose you need some plugin implementation to make it work.

You can check this sample for detail:
[url]https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#uffssd_sample[/url]

Thanks.

raki.dedigama · November 5, 2018, 7:56am

This repo (GitHub - NVIDIA-AI-IOT/tf_trt_models: TensorFlow models accelerated with NVIDIA TensorRT) is a good resource for optimizing tensorflow classification/detection models with tensorRT. You can achieve up to 10-15 FPS on the Jetson tx2. However, I was not able to properly get mask rcnn working in a similar manner. Seems that the mask layer is not yet supported in tensorRT4.

Topic		Replies	Views
Low GPU Usage with Tensorflow Inference on Jetson Tx2 Jetson TX2	13	4442	October 18, 2021
SSD Mobilenet V2 TensorRT optimization for Jetson TX2 Jetson TX2 tensorrt	6	1862	October 18, 2021
Help needed while using Tensor RT 3 to create inference engine for facenet model. Jetson TX2	13	3544	October 18, 2021
TensorFlow to TensorRT - Object Detection API Recommended Workflow TensorRT tensorrt , tensorflow , onnx	1	707	October 15, 2021
TF-TRT vs TensorRT Jetson Nano	2	3534	October 14, 2021
TensorRT 3 for tensorflow support Jetson TX2	8	1603	October 18, 2021
SSD-MobilenetV2 bad performance on XavierNX using Tensorflow + TF_TRT Jetson Xavier NX tensorrt , opencv , cuda , tensorflow	5	1860	October 18, 2021
Running Inference with DeepStream, but with unknown model architecture DeepStream SDK	6	952	October 12, 2021
Convert custom Tensorflow model to TensorRT Jetson Nano	12	5093	October 14, 2021
Jetson TX2 TensorFlow/TensorRT Workflow Jetson TX2	4	2469	October 18, 2021

Recap on tensorflow object detection API on TX2

Related topics