Unable to convert Mask-RCNN (.pb or .h5 format) model to UFF

pradan · May 8, 2021, 7:17am

I use the current setup:

Hardware:

i7 based Acer system
NVIDIA GEFORCE RTX
NVIDIA Jetson Xavier (target)

Software:

Ubuntu 18.04
TensorRT v7.2.2
CUDA Version 10.2.89
cuDNN v8.1.1
GNU make >= v4.1
cmake >= v3.13
Python 3.6.5
Uff 0.6.9
graphsurgeon

Attempt 1

I followed the steps given in sampleUffMaskRCNN README and went on modifying the conv2d_transpose function in /usr/lib/python3.6/dist-packages/uff/converters/tensorflow/converter_functions.py. I also applied the 0001-Update-the-Mask_RCNN-model-from-NHWC-to-NCHW.patch patch.

For test, I used the same sample model provided in the instructions. In short, I replicated all the steps.

The config.py and model.py I used, have been attached below config.py (9.2 KB) model.py (124.6 KB)

I get the error ending with:

Converting to UFF graph
Traceback (most recent call last):
  File "mrcnn_to_trt_single.py", line 164, in <module>
    main()
  File "mrcnn_to_trt_single.py", line 123, in main
    text=True, list_nodes=list_nodes)
  File "mrcnn_to_trt_single.py", line 157, in convert_model
    debug_mode = False
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/uff/converters/tensorflow/conversion_helpers.py", line 276, in from_tensorflow_frozen_model
    return from_tensorflow(graphdef, output_nodes, preprocessor, **kwargs)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/uff/converters/tensorflow/conversion_helpers.py", line 225, in from_tensorflow
    debug_mode=debug_mode)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/uff/converters/tensorflow/converter.py", line 141, in convert_tf2uff_graph
    uff_graph, input_replacements, debug_mode=debug_mode)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/uff/converters/tensorflow/converter.py", line 126, in convert_tf2uff_node
    op, name, tf_node, inputs, uff_graph, tf_nodes=tf_nodes, debug_mode=debug_mode)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/uff/converters/tensorflow/converter.py", line 94, in convert_layer
    return cls.registry_[op](name, tf_node, inputs, uff_graph, **kwargs)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/uff/converters/tensorflow/converter_functions.py", line 87, in convert_add
    uff_graph.binary(inputs[0], inputs[1], 'add', name)
IndexError: list index out of range

The entire logs are here:log1.txt (92.7 KB)

On running the same script but with my model (also based on ResNet101 like the sample model), I get the error ending with:


Traceback (most recent call last):
  File "mrcnn_to_trt_single.py", line 164, in <module>
    main()
  File "mrcnn_to_trt_single.py", line 115, in main
    model.load_weights(model_weights_path, by_name=True)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/keras/engine/topology.py", line 2643, in load_weights
    f, self.layers, skip_mismatch=skip_mismatch)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/keras/engine/topology.py", line 3248, in load_weights_from_hdf5_group_by_name
    K.batch_set_value(weight_value_tuples)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py", line 2365, in batch_set_value
    assign_op = x.assign(assign_placeholder)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/ops/variables.py", line 2067, in assign
    self._variable, value, use_locking=use_locking, name=name)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/ops/state_ops.py", line 227, in assign
    validate_shape=validate_shape)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/ops/gen_state_ops.py", line 66, in assign
    use_locking=use_locking, name=name)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/framework/op_def_library.py", line 794, in _apply_op_helper
    op_def=op_def)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/util/deprecation.py", line 507, in new_func
    return func(*args, **kwargs)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/framework/ops.py", line 3357, in create_op
    attrs, op_def, compute_device)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/framework/ops.py", line 3426, in _create_op_internal
    op_def=op_def)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/framework/ops.py", line 1770, in __init__
    control_input_ops)
  File "/home/hotify/trt_sample_try/trt_sample/lib/python3.6/site-packages/tensorflow_core/python/framework/ops.py", line 1610, in _create_c_op
    raise ValueError(str(e))
ValueError: Dimension 1 in both shapes must be equal, but are 324 and 16. Shapes are [1024,324] and [1024,16]. for 'Assign_682' (op: 'Assign') with input shapes: [1024,324], [1024,16].

The entire logs are: log3.txt (10.1 KB) .

Both models are ResNet101 and still leading to errors. I tried following the post: converting mask rcnn to tensor rt - #31 by ChrisDing but their configuration file is way different from mine. On using theirs, I get the same error.

Any help is appreciated 🙂

mchi · May 12, 2021, 1:26am

HI @pradan
Will check and get back to you

Thanks!

pradan · May 14, 2021, 12:47pm

Ok. I got past this error by using CUDA 10.2 + cudnn 8.1 and TensorRT 7.0.0.11.

Can you please guide me how should I visualise the results of MaskRCNN UFF predictions ? I mean, is there a way to save the mask ?

mchi · May 15, 2021, 8:21am

you can refer to GitHub - NVIDIA-AI-IOT/deepstream_4.x_apps: deepstream 4.x samples to deploy TLT training models to save the mask, but we don’t have existing OSD plugin that can draw the mask rcnn mask on frame

pradan · May 15, 2021, 9:46am

The deepstream repo recommends TensorRT version to be 5 or 6 but I have converted the h5 to UFF using version 7.0. How is that justified ? Also, I feel that the documentation on deepstream is poor.

What methods do I have that do not need deepstream ?

mchi · May 15, 2021, 10:32am

Sorry!

We have mask-rcnn samples
/opt/nvidia/deepstream/deepstream/sources/apps/sample_apps/deepstream-mrcnn-test

or

you can refer to the PeopleSegNet sample in GitHub - NVIDIA-AI-IOT/deepstream_tao_apps at release/tlt3.0

// PeopleSegNet

pradan · May 17, 2021, 7:17am

Still no answer to this…

Does it make sense, to now use TensorRT 5 or 6 after converting using TensorRT7 ?

mchi · May 19, 2021, 4:16pm

We have uff converter in each TRT release, so there is not guarantee that the converted uff works on a different TRT version.
And, uff is deprecating, we recommned to use ONNX.

[We have deprecated the Caffe Parser and UFF Parser in TensorRT 7.0. They are still tested and functional in TensorRT 8.0, however, we plan to remove the support in the future. Ensure you migrate your workflow to use tf2onnx, keras2onnx or TensorFlow-TensorRT (TF-TRT) for deployment](Release Notes :: NVIDIA Deep Learning TensorRT Documentation)

Topic		Replies	Views
Assertion `inputs[0].nbDims == 4 && inputs[0].d[1] == mNbClasses * 4' failed.' TensorRT tensorrt	7	658	May 31, 2021
Deepstream doesn't give expected Mask-RCNN output DeepStream SDK	26	3015	February 8, 2022
Accuracy decreases after integrating mask rcnn in deepstream DeepStream SDK	32	1909	October 12, 2021
Sample for Mask R-CNN crashes TensorRT	8	913	April 6, 2020
Deepstream Mask RCNN bad performance DeepStream SDK	8	966	December 5, 2022
Help needed while using Tensor RT 3 to create inference engine for facenet model. Jetson TX2	13	3552	October 18, 2021
converting mask rcnn to tensor rt DeepStream SDK	41	12369	August 10, 2021
Cannot convert a tensorflow model to UFF Jetson TX2	13	2339	October 18, 2021
Assertion `inputs[0].nbDims == 4 && inputs[0].d[1] == mNbClasses * 4' failed DeepStream SDK tensorrt	8	1407	August 26, 2021
Error converting Maskrcnn keras model to TRT TensorRT tensorrt	3	514	January 20, 2022

Unable to convert Mask-RCNN (.pb or .h5 format) model to UFF

Hardware:

Software:

Attempt 1

Related topics