When trying to convert caffe googlenet model to TensorRT model, I met somme issues.
My network is a googlenet used as feature extractor in the Faster rcnn algorithm.
The RPROIFused layer provided by the library works fine and is not the problem. I have managed to run the Faster Rcnn sample with success.

I get the following error message

ERROR: inception_5a/output: all concat input tensors must have the same dimensions except on the concatenation axis
model_optimizer: ../common/enginehelper.h:89: nvinfer1::DimsCHW enginehelper::getCHW(const nvinfer1::Dims&): Assertion `d.nbDims >= 3' failed.

The problem seems to come from the Concat layer. I think that the blob dimensions are correct.
Bottom blob dimensions are
(1, 256, 7, 7)
(1, 320, 7, 7)
(1, 128, 7, 7)
(1, 128, 7, 7)

and the requested top dimension is
(1, 832, 7, 7)

However, in the googlenet sample, the Concat layer with the same number of blobs is used with success.
Is it a known issue ? Did I miss something ?

This issue is related to the followinf one (but with TensorRT 4)

I use TensorRT 5.0.2, CUDA 10, cudnn 7.4.2, RTX 2080 on Ubuntu 18.04 with driver 410.72.


we are triaging and will keep you updated.


To help us debug, can you please share repro that contains the model/network and source that demonstrates the errors you are seeing? we are not seeing this issue locally:

/infer_perf --deploy=/home/mvillmow/Downloads/faster_rcnn_test_iplugin_googlenet_v3_0.prototxt --output=cls_prob
&&&& RUNNING TensorRT.infer_perf # ./infer_perf --deploy=/home/mvillmow/Downloads/faster_rcnn_test_iplugin_googlenet_v3_0.prototxt --output=cls_prob
[I] deploy: /home/mvillmow/Downloads/faster_rcnn_test_iplugin_googlenet_v3_0.prototxt
[I] output: cls_prob
[I] Running on CUDA device: TITAN V (1.455 GHz, 80 SMs, mem 0.85 GHz, ECC disabled, 3072 bits, Compute Capability 7.0)
[I] Default InternalBuildFlags = 406f
[I] Updating InternalBuildFlags = 406f
[I] Input "data": 3x224x224
[I] Input "im_info": 1x1x3
[I] Output "cls_prob": 300x21x1
&&&& PASSED TensorRT.infer_perf # ./infer_perf --deploy=/home/mvillmow/Downloads/faster_rcnn_test_iplugin_googlenet_v3_0.prototxt --output=cls_prob


I send you a private message with the requested data.

Thanks for your help :)


Per engineering:

After I get it to compile and load the attached deploy_plugin.txt and weights_modif.caffemodel. This is the output I get:

***** Optimisation modele *****

Librairie Caffe
Caffe model file : /home/mvillmow/p4/mvillmow-tensorrt/sw/gpgpu/MachineLearning/DIT/release/5.1/build/x86_64-linux/d.pry
Caffe weights file : /home/mvillmow/p4/mvillmow-tensorrt/sw/gpgpu/MachineLearning/DIT/release/5.1/build/x86_64-linux/w.pry
Caffe mean file :
Nb sorties reseau : 3
Batch size : 1
Precision : FP 32
Nb custom layers : 1

Fichier /home/mvillmow/p4/mvillmow-tensorrt/sw/gpgpu/MachineLearning/DIT/release/5.1/build/x86_64-linux/weights.pry genere

Is this correct output?


Per engineering, this has been addressed in the next release of TensorRT. I can’t discuss the release schedule here, but please stay tuned for the announcement.



Yes, this is the correct output without any error message from TensorRT.

Thanks for your help, I will wait for the next release.


Hello Moderator,

Can you please share the .prototxt and caffemodel file you used for the inference results you posted on 02/07/2019 10:52 PM.