set input to tensorflow frozen model before converting to TRT fails

alex73 · January 8, 2020, 5:08am

The goal:

convert a retrained ssd-inception-v2 tensorflow model to a TensorRT model.
conversion and inference done on TX2
training - laptop

I used the “ssd-inception-v2” model from tensorflow zoo, retrained it and now I wanted to convert to TRT.
The problem was that the converted model was not faster than the origin since the input dim is undefined (?, ?, ?, 3).

I tried to set the input size to a fixed size using the code below, but I get an error, as shown below.

Help is appreciated in what and how to change.

The error:

ValueError: node 'image_tensor' in input_map does not exist in graph (input_map entry: image_tensor:0->image_tensor:0)

The code:

with tf.gfile.GFile(frozen_graph_filename, "rb") as file_handle:
    graph_def = tf.GraphDef()
    graph_def.ParseFromString(file_handle.read())

new_input = tf.placeholder(dtype=tf.uint8, shape=[1, 320, 320, 3], name='image_tensor')

with tf.Graph().as_default() as frozen_graph:
    # tf.import_graph_def(graph_def, name='')  # <-- this works as expected
    tf.import_graph_def(graph_def, input_map={'image_tensor:0': new_input})
    
# convert to TRT:
model_out = ['detection_classes', 'num_detections', 'detection_boxes', 'detection_scores']
trt_graph = trt.create_inference_graph(
    input_graph_def=graph_def,  
    outputs=model_out,	
    max_batch_size=1,  
    max_workspace_size_bytes=max_work_space,  
    precision_mode=tensorRT_precision,
    is_dynamic_op=False)

SunilJB · January 8, 2020, 9:22am

Hi,

Could you please share your script and model file so we can better help?

Also, provide details on the platforms you are using:
o Linux distro and version
o GPU type
o Nvidia driver version
o CUDA version
o CUDNN version
o Python version [if using python]
o Tensorflow and PyTorch version
o TensorRT version

Thanks

alex73 · January 8, 2020, 10:02am

SuniJB,

Details:
– Laptop:

Ubuntu 18.04, GTX 1060
Cuda 10.0.130
CuDnn 7.5.0
tensorflow for retraining the model, built tag v1.13.0-821-g1a40ebdd
GitHub - tensorflow/models: Models and examples built with TensorFlow

– TX2:

jetpack 4.2.2

Both use:

python 3.6.8
tensorflow-gpu 1.14.0
tensorRT 5.1.5.0

The model is, as mentioned, ssd-mobilenetv2 (for object detection), as taken from the tensorflow zoo.
Retraining follows the tensorflow examples, and the conversion to tensorRT is as above.

Thanks

alex73 · January 8, 2020, 11:18am

if I run this:

with tf.gfile.GFile(frozen_graph_filename, "rb") as file_handle:
     graph_def = tf.GraphDef()
     graph_def.ParseFromString(file_handle.read())

new_input = tf.placeholder(dtype=tf.uint8, shape=[None, 320, 320, 3], name='image_tensor')
tf.import_graph_def(graph_def, input_map={'image_tensor:0': new_input})

I get:

ValueError: NodeDef mentions attr 'shape' not in Op<name=Cast; signature=x:SrcT -> y:DstT; attr=SrcT:type; attr=DstT:type; attr=Truncate:bool,default=false>; NodeDef: {{node import/Cast}}. (Check whether your GraphDef-interpreting binary is up to date with your GraphDef-generating binary.).

This is running on the laptop, same tf version (1.14.0)

SunilJB · January 9, 2020, 2:50pm

Could you please try below code, it seems to be working:

frozen_graph_filename = "frozen_inference_graph.pb"
with tf.gfile.GFile(frozen_graph_filename, "rb") as file_handle:
    graph_def = tf.GraphDef()
    graph_def.ParseFromString(file_handle.read())

new_input = tf.placeholder(dtype=tf.uint8, shape=[1, 320, 320, 3], name='image_tensor')

#with tf.Graph().as_default() as frozen_graph: <--- commented this code
    # tf.import_graph_def(graph_def, name='')  # <-- this works as expected
tf.import_graph_def(graph_def, input_map={'image_tensor:0': new_input})

Updated Input Layer:

name: "image_tensor"
op: "Placeholder"
attr {
  key: "dtype"
  value {
    type: DT_UINT8
  }
}
attr {
  key: "shape"
  value {
    shape {
      dim {
        size: 1
      }
      dim {
        size: 320
      }
      dim {
        size: 320
      }
      dim {
        size: 3
      }
    }
  }
}

Was able to successfully generate TRT engine with default max_workspace_size_bytes and precision_mode:
TensorRT model is successfully stored!
numb. of trt_engine_nodes in TensorRT graph: 4
numb. of all_nodes in TensorRT graph: 6023

alex73 · January 9, 2020, 9:19pm

SunilJB,

Thanks for the support and for the effort but that didn’t work either (comment #4), but I was able to export the model to a frozen graph with the input dim set in another way. Inference seems to be as expected.

When I convert to TRT I get a new error:

convert to trt:

# convert to TRT:
model_out = ['detection_classes', 'num_detections', 'detection_boxes', 'detection_scores']
trt_graph = trt.create_inference_graph(
    input_graph_def=graph_def,  
    outputs=model_out,	
    max_batch_size=1,  
    max_workspace_size_bytes=max_work_space,  
    precision_mode=tensorRT_precision,
    is_dynamic_op=False)

New error:

tensorflow.python.framework.errors_impl.InvalidArgumentError: NodeDef mentions attr 'shape' not in Op<name=Cast; signature=x:SrcT -> y:DstT; attr=SrcT:type; attr=DstT:type; attr=Truncate:bool,default=false>; NodeDef: {{node Cast}}. (Check whether your GraphDef-interpreting binary is up to date with your GraphDef-generating binary.)

Does this indicate a mismatch between tensorflow versions on the laptop vs. TX2 or could it be something else too?

On the laptop I get:
Version: 1.14.0

On the TX2 I get:
Version: 1.14.0+nv19.10

After looking at the options through:
https://developer.download.nvidia.com/compute/redist/jp/v42/tensorflow-gpu/
I tried 1.14.0+nv19.7, 1.14.0+nv19.9 but I still get the same error.

To install on the TX2 I run:
sudo pip3 install --no-cache-dir --pre --extra-index-url https://developer.download.nvidia.com/compute/redist/jp/v42 tensorflow-gpu==1.14.0+nv19.10

Thanks

alex73 · January 9, 2020, 9:53pm

Issue has been resolved!

I installed v1.14.0+nv19.10 on the TX2 and model was converted successfully and I could apply it too.

Not sure why this time the installation worked but the code above hasn’t changed.

Thanks for the effort SunilJB.

Topic		Replies	Views
Using TF-TRT to convert MobileNet / SSDLite model gives errors Jetson TX2	3	1960	October 18, 2021
TF-TRT TrtGraphConverterV2 converter build failure TensorRT	3	822	April 25, 2022
Tensorflow 1.7 with TensorRT fails Jetson TX2	13	3821	October 18, 2021
No SpeedUp after TensorRT INT8 (PointNet ++ tensorflow model) TensorRT	6	1253	February 25, 2020
use tensorflow tensorrt API convert failed TensorRT	7	2949	May 2, 2018
create_inference_graph() produced output model with too big size TensorRT	13	2200	August 9, 2019
PLEASE HELP ME! TF-TRT DepthwiseConv2dNative mismatching NCHW/NHWC format ERROR! TensorRT	0	552	February 23, 2021
Problem testing TensorRT optimized model Jetson TX2	5	1108	February 23, 2020
TF-TRT, why have to create TensorRT engine every time of inference ? TensorRT	7	2642	December 23, 2019
Dont see any speedups using TensorRT TensorRT	14	2951	October 12, 2021

set input to tensorflow frozen model before converting to TRT fails

Related topics