UFF models occurs error after changing the input shape

zhhuang · March 27, 2019, 9:35am

I have trained a 512x512 ssd-inception-v2 model from tensorflow and also convert its pb file into uff format. But the thing is it occurs error when running this uff model.

Begin parsing model...
End parsing model...
Begin building engine...
sample_uff_ssd: nmsPlugin.cpp:135: virtual void nvinfer1::plugin::DetectionOutput::configureWithFormat(const nvinfer1::Dims*, int, const nvinfer1::Dims*, int, nvinfer1::DataType, nvinfer1::PluginFormat, int): Assertion `numPriors * numLocClasses * 4 == inputDims[param.inputOrder[0]].d[0]' failed.
Aborted (core dumped)

I know that the offical sample in sampleUffSSD is a 300x300 model, so I change property in config.py. Here is my config file:

import graphsurgeon as gs
import tensorflow as tf

Input = gs.create_node("Input",
    op="Placeholder",
    dtype=tf.float32,
    shape=[1, 3, 512, 512])
    PriorBox = gs.create_plugin_node(name="GridAnchor", op="GridAnchor_TRT",
    numLayers=6,
    minSize=0.2,
    maxSize=0.95,
    aspectRatios=[1.0, 2.0, 0.5, 3.0, 0.33],
    variance=[0.1,0.1,0.2,0.2],
    #featureMapShapes=[19, 10, 5, 3, 2, 1])
     featureMapShapes=[33, 18, 9, 6, 4, 2])
    NMS = gs.create_plugin_node(name="NMS", op="NMS_TRT",
    shareLocation=1,
    varianceEncodedInTarget=0,
    backgroundLabelId=0,
    confidenceThreshold=1e-8,
    nmsThreshold=0.6,
    topK=100,
    keepTopK=100,
    numClasses=9,
    inputOrder=[0, 2, 1],
    #inputOrder=[0, 1,2],
    confSigmoid=1,
    isNormalized=1,
    scoreConverter="SIGMOID")
    concat_priorbox = gs.create_node(name="concat_priorbox", op="ConcatV2", dtype=tf.float32, axis=2)
    concat_box_loc = gs.create_plugin_node("concat_box_loc", op="FlattenConcat_TRT", dtype=tf.float32, axis=1, 
    ignoreBatch=0)
    concat_box_conf = gs.create_plugin_node("concat_box_conf", op="FlattenConcat_TRT", dtype=tf.float32, axis=1, 
    ignoreBatch=0)

namespace_plugin_map = {
    "MultipleGridAnchorGenerator": PriorBox,
    "Postprocessor": NMS,
    "Preprocessor": Input,
    # "ToFloat": Input,
    # "image_tensor": Input,
    "MultipleGridAnchorGenerator/Concatenate": concat_priorbox,
    #"Concatenate/concat": concat_priorbox,
    "concat": concat_box_loc,
    "concat_1": concat_box_conf,
}

namespace_remove = {
    "ToFloat",
    "image_tensor",
    "Preprocessor/map/TensorArrayStack_1/TensorArrayGatherV3",
}

def preprocess(dynamic_graph):
    # remove the unrelated or error layers
    dynamic_graph.remove(dynamic_graph.find_nodes_by_path(namespace_remove), remove_exclusive_dependencies=False)

    # Now create a new graph by collapsing namespaces
    dynamic_graph.collapse_namespaces(namespace_plugin_map)
    # Remove the outputs, so we just have a single output node (NMS).
    dynamic_graph.remove(dynamic_graph.graph_outputs, remove_exclusive_dependencies=False)

    # Remove the Squeeze to avoid "Assertion `isPlugin(layerName)' failed"
    Squeeze = dynamic_graph.find_node_inputs_by_name(dynamic_graph.graph_outputs[0], 'Squeeze')
    dynamic_graph.forward_inputs(Squeeze)

And also this is my training pipeline

# SSD with Inception v2 configuration for MSCOCO Dataset.
# Users should configure the fine_tune_checkpoint field in the train config as
# well as the label_map_path and input_path fields in the train_input_reader and
# eval_input_reader. Search for "PATH_TO_BE_CONFIGURED" to find the fields that
# should be configured.

model {
  ssd {
    num_classes: 8
    box_coder {
      faster_rcnn_box_coder {
        y_scale: 10.0
        x_scale: 10.0
        height_scale: 5.0
        width_scale: 5.0
      }
    }
    matcher {
      argmax_matcher {
        matched_threshold: 0.5
        unmatched_threshold: 0.5
        ignore_thresholds: false
        negatives_lower_than_unmatched: true
        force_match_for_each_row: true
      }
    }
    similarity_calculator {
      iou_similarity {
      }
    }
    anchor_generator {
      ssd_anchor_generator {
        num_layers: 6
        min_scale: 0.2
        max_scale: 0.95
        aspect_ratios: 1.0
        aspect_ratios: 2.0
        aspect_ratios: 0.5
        aspect_ratios: 3.0
        aspect_ratios: 0.3333
        reduce_boxes_in_lowest_layer: true
      }
    }
    image_resizer {
      fixed_shape_resizer {
        height: 300
        width: 300
      }
    }
    box_predictor {
      convolutional_box_predictor {
        min_depth: 0
        max_depth: 0
        num_layers_before_predictor: 0
        use_dropout: true
        dropout_keep_probability: 0.5
        kernel_size: 3
        box_code_size: 4
        apply_sigmoid_to_scores: false
        conv_hyperparams {
          activation: RELU_6,
          regularizer {
            l2_regularizer {
              weight: 0.00004
            }
          }
          initializer {
            truncated_normal_initializer {
              stddev: 0.03
              mean: 0.0
            }
          }
        }
      }
    }
    feature_extractor {
      type: 'ssd_inception_v2'
      min_depth: 16
      depth_multiplier: 1.0
      conv_hyperparams {
        activation: RELU_6,
        regularizer {
          l2_regularizer {
            weight: 0.00004
          }
        }
        initializer {
          truncated_normal_initializer {
            stddev: 0.03
            mean: 0.0
          }
        }
        batch_norm {
          train: true,
          scale: true,
          center: true,
          decay: 0.9997,
          epsilon: 0.001,
        }
      }
    }
    loss {
      classification_loss {
        weighted_sigmoid {
          anchorwise_output: true
        }
      }
      localization_loss {
        weighted_smooth_l1 {
          anchorwise_output: true
        }
      }
      hard_example_miner {
        num_hard_examples: 3000
        iou_threshold: 0.99
        loss_type: CLASSIFICATION
        max_negatives_per_positive: 3
        min_negatives_per_image: 0
      }
      classification_weight: 1.0
      localization_weight: 1.0
    }
    normalize_loss_by_num_matches: true
    post_processing {
      batch_non_max_suppression {
        score_threshold: 1e-8
        iou_threshold: 0.6
        max_detections_per_class: 100
        max_total_detections: 100
      }
      score_converter: SIGMOID
    }
  }
}

train_config: {
  batch_size: 16
  optimizer {
    rms_prop_optimizer: {
      learning_rate: {
        exponential_decay_learning_rate {
          initial_learning_rate: 0.001
          decay_steps: 150720
          decay_factor: 0.95
        }
      }
      momentum_optimizer_value: 0.9
      decay: 0.9
      epsilon: 1.0
    }
  }
  fine_tune_checkpoint: "/home/hite/Downloads/oldverison_TFmodel/models/research/object_detection/ssd_model/ssd_inception_v2_coco_2017_11_17/model.ckpt"

  from_detection_checkpoint: true
  # Note: The below line limits the training process to 200K steps, which we
  # empirically found to be sufficient enough to train the pets dataset. This
  # effectively bypasses the learning rate schedule (the learning rate will
  # never decay). Remove the below line to train indefinitely.
  num_steps: 500000
  data_augmentation_options {
    random_horizontal_flip {
    }
  }
  data_augmentation_options {
    ssd_random_crop {
    }
  }
}

train_input_reader: {
  tf_record_input_reader {
    input_path: "/home/hite/Downloads/oldverison_TFmodel/models/research/object_detection/ssd_model/pascal_train.record"
  }
  label_map_path: "/home/hite/Downloads/oldverison_TFmodel/models/research/object_detection/ssd_model/pascal_label_map.pbtxt"
}

eval_config: {
  num_examples: 8000
  # Note: The below line limits the evaluation process to 10 evaluations.
  # Remove the below line to evaluate indefinitely.
  max_evals: 10
}

eval_input_reader: {
  tf_record_input_reader {
    input_path: "/home/hite/Downloads/oldverison_TFmodel/models/research/object_detection/ssd_model/pascal_train.record"
  }
  label_map_path: "/home/hite/Downloads/oldverison_TFmodel/models/research/object_detection/ssd_model/pascal_label_map.pbtxt"
  shuffle: false
  num_readers: 1
  num_epochs: 1
}

Is there anyone has methods to fix this problem

NVES · March 27, 2019, 3:04pm

Hello,

I believe the sample does a resize to 300x300 itself and then works with the image of this size. In the config.py we are discarding of all the preprocessing and thus we need to make a resize ourselves.

So it’s not possible to make an input of other size than 300x300 as of right now without changing a whole architecture of the network in the demo.

zhhuang · March 28, 2019, 3:04am

Hi NVES,
Thx for the reply.
About the sample code, I have changed the input register part in sampleUffSSD.cpp.

parser->registerInput("Input", DimsCHW(3, 512, 512), UffInputOrder::kNCHW);

And in BatchStreamPPM.h,

static constexpr int INPUT_C = 3;
static constexpr int INPUT_H = 512;
static constexpr int INPUT_W = 512;

But still, I m stuck in the engine building part.
Any thing else that I miss ?

bugo · June 21, 2019, 6:58am

ssd {
    num_classes: 8

sampleUffSSD has defined constant for this static constexpr int OUTPUT_CLS_SIZE = 91;

ivan.ralasic · August 5, 2019, 12:59pm

Hi, I’m trying to convert a tf model to TRT.

I’m able to convert the default SSD_mobilenet_v2 model and SSD_mobilenet_v2 model trained on custom data without a problem. This is true when the input size is fixed to 300x300, but if I try to change the input size to a different size (i.e. 500x500), the conversion to .uff fails with following error:

python3.6: nmsPlugin.cpp:139: virtual void nvinfer1::plugin::DetectionOutput::configureWithFormat(const nvinfer1::Dims*, int, const nvinfer1::Dims*, int, nvinfer1::DataType, nvinfer1::PluginFormat, int): Assertion `numPriors * numLocClasses * 4 == inputDims[param.inputOrder[0]].d[0]' failed.

I see that similar question has been asked previously, but there hasn’t been an update on this issue. Can you give me instructions how to perform model conversion if the input size isn’t the default 300x300.

I hope that there will be much more support on the trt and uff formats and conversions, because the single example with pretrained COCO models is pretty useless.

Thank you very much!

Topic		Replies	Views
sampleUffSSD with newer tensorflow models (2018) TensorRT	16	2515	November 13, 2020
Converting sampleUffSSD for different tensorflow models TensorRT	4	1568	October 10, 2018
How adapt Tensorflow object detection for custom dataset to Deepstream 5.0 DeepStream SDK tensorflow	17	1978	July 27, 2021
how to write config.py for converting ssd-mobilenetv2 to uff format Jetson Nano	19	6877	October 14, 2021
run SSD_MobileNetV2 (Tensorflow object detection API) on TensorRT TensorRT	18	5035	October 12, 2021
Problems with SSD Mobilenet v2 UFF Jetson Nano ssd	35	7935	October 18, 2021
Convert SSD-Mobilenet to UFF Jetson Nano	13	1828	October 14, 2021
How to convert SSD mobilenet v2 to uff,Then use uff in jetson_inference detectnet_camera script？ Jetson Nano	13	2170	October 14, 2021
ERROR: UFFParser: Parser error: BoxPredictor_0/Reshape: Reshape: -1 dimension specified more than 1 ... TensorRT	25	5608	September 24, 2020
sampleUffSSD with custom resolution for ssd_mobilenet_v1 TensorRT tensorrt	3	624	September 25, 2020

UFF models occurs error after changing the input shape

Related topics