Unable to convert Keras model to TensorRT runtime

tf1247 · May 20, 2019, 8:47pm

I’m trying to convert a Keras YoloV3 model (i.e. [url]https://github.com/qqwweee/keras-yolo3[/url]) to a format that can be loaded with DeepStream’s nvinfer element.

I’ve seen the samples at [url]https://github.com/NVIDIA-AI-IOT/deepstream_reference_apps[/url] as well as [url]https://docs.nvidia.com/deeplearning/sdk/tensorrt-sample-support-guide/index.html#yolov3_onnx[/url], so I know that it is technically possible, but I haven’t yet been successful with the conversion.

I’ve tried permutations of freezing the graph, converting to UFF/ONNX, using [url]https://github.com/onnx/tensorflow-onnx[/url], but every time there was a different error.

What I’d like to know is: what the best approach is to go from a Keras model to inference in DeepStream?

AastaLLL · May 21, 2019, 7:31am

Hi,

It’s recommended to use UFF interface rather than ONNX since UFF is our format specified for TensorFlow.
Once having a UFF model, there is a sample in deepstream package for the UFF based model:
{DeepStream_Release}/sources/objectDetector_SSD

To enable YOLO, it is required some plugin implementation since some non-supported layers inside.
We have an plugin implementation for YOLO in deepstream_reference_apps and you can integrate it directly into your application.

Thanks.

ChrisDing · May 21, 2019, 7:52am

Solution 1:
Can you refer to https://github.com/NVIDIA-AI-IOT/deepstream_reference_apps/tree/master/yolo to deloy yoloV3 detection? It does not use uff/onnx/caffe parser but use tensorRT API directly to build network.

Solution 2:

Using tensorflow’s graph_utils, graph_io API to convert keras model to .pb.
If the model is trained in NHWC, we should make sure NCHW architecture could consume the pretrained weights. Generally, most layers could work well directly in NHWC → NCHW conversion except Reshape, Flatten, Dense and Softmax applied to feature map.
GitHub - amir-abdi/keras_to_tensorflow: General code to convert a trained keras model into an inference tensorflow model
Converting .pb to .uff using uff converter
And map all the tf nodes which are not supported by TensorRT directly to plugin node.
Refer to sources/objectDetector_FasterRCNN

tf1247 · May 21, 2019, 6:05pm

Thanks for the quick answers! Is there a process to figure out which layers in a model are supported by uff and what I need custom plugins for?

ChrisDing · May 22, 2019, 2:14am

TensorRT provides convert-to-uff tool which will return error and which custom plugins need to implement.

https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#samplecode3

You can also refer to tensorRT sample sampleUffSSD “README.md” and “config.py”

tf1247 · May 22, 2019, 5:35pm

Thank you both for your help!

Topic		Replies	Views
Convert custom Tensorflow model to TensorRT Jetson Nano	12	5091	October 14, 2021
Conversion of TF-TRT model to Deepstream errors DeepStream SDK	8	1970	October 12, 2021
Keras/tensorflow/onnx weight deploy to deepstream DeepStream SDK	6	799	October 12, 2021
Tutorials on how to deploy AI model on Jetson Xavier NX starting from h5 or pb files Jetson Xavier NX tensorrt , python	5	485	February 10, 2023
Convert YOLOV3 to TF- UFF - tennsorRT on Jetson Xavier Jetson AGX Xavier	4	2911	October 18, 2021
Unable to convert tensorflow models master maskRcnn graph to tensorRT TensorRT	6	596	January 15, 2021
Converting TensorFlow autoencoder decoder to TensorRT engine via UFF TensorRT	1	1161	February 11, 2020
How to optimise attention OCR model for jetson nano using tensor-rt TensorRT	18	2469	June 18, 2020
Keras-->TensorRT, even naive sample fails TensorRT	4	1068	April 19, 2019
How to run a keras model for inference using the TensorRT C++ API? Deep Learning (Training & Inference) tensorrt , tensorflow	2	708	May 15, 2020

Unable to convert Keras model to TensorRT runtime

Related topics