TRT engine - peculiar behaviour

fasmatikos · September 29, 2020, 10:41am

Hi, I am building a TRT engine for a custom SSD-mobilenet-v1 model trained in Caffe (using TRT python API). The script i am using builds the engine, serialize&saves it on a *.bin/.engine file and then immediately after build, performs an inference on an image saved on nano…

While the engine is being built and executed successfully (returning the correct bbox), when, in a separate run, i skip the build part and just load and deserialize the engine from the .bin/.engine file created earlier, i get weird results, with more than one bboxes in fixed positions each (no matter the input image)… and this is also happening with another custom caffe-squeezenetSSD (!)… i have also used the same script (with appropriate modifications) for building and running an custom caffe-Resnet50 model and all work as expected (on-line build and run and load-run)…

I do not get why the SSD models are running fine when in build-and-run mode and not when are loaded and the executed…

AastaLLL · September 30, 2020, 2:35am

Hi,

A possible cause is that some parameter in deserialization mode doesn’t be initialized correctly.
Would you mind to share your source so we can check it further?

Here is a good example for deserializing a TenosrRT model:

github.com

dusty-nv/jetson-inference/blob/master/c/tensorNet.cpp#L1093


      
          	}

          #if NV_TENSORRT_MAJOR < 5

          	else if( model_fmt == MODEL_ONNX )

          	{

          		LogError(LOG_TRT "importing ONNX models is not supported in TensorRT %u.%u (version >= 5.0 required)\n", NV_TENSORRT_MAJOR, NV_TENSORRT_MINOR);

          		return false;

          	}

          	else if( model_fmt == MODEL_UFF )

          	{

          		LogError(LOG_TRT "importing UFF models is not supported in TensorRT %u.%u (version >= 5.0 required)\n", NV_TENSORRT_MAJOR, NV_TENSORRT_MINOR);

          		return false;

          	}

          #endif

          	else if( model_fmt == MODEL_CAFFE && !prototxt_path_ )

          	{

          		LogError(LOG_TRT "attempted to load caffe model without specifying prototxt file\n");

          		return false;

          	}

          	else if( model_fmt == MODEL_ENGINE )

          	{

          		if( !LoadEngine(model_path.c_str(), input_blobs, output_blobs, NULL, device, stream) )

Thanks.

fasmatikos · September 30, 2020, 8:24am

Hi thanks for the response…
The scipt is rather basic folowing the NVIDIA API guidlines, sopython_test_TRT_CaffeSSD.txt (7.0 KB)

fasmatikos · September 30, 2020, 10:11am

Do I need to also load some plugin facory in python ? I saw the extra “pluginFactory” argument from your post above…
nvinfer1::ICudaEngine* engine = infer->deserializeCudaEngine(engine_stream, engine_size, pluginFactory);
and here
Runtime — tensorrt 7.2.0.9 documentation (nvidia.com)

How can i do this ? Is this only for PriorBox plugin ?
BTW, I am using the Jetpac 4.4 [L4T 32.4.3] on nano with TRT ver 7.1.3.0 with CUDA v10.0.89 and OpenCV v3.4.8
.

AastaLLL · October 6, 2020, 6:17am

Hi,

pluginFactory is only required when the model use a customized layer for inference.
Based on your source, the model should work fine without setting it.

Would you mind to check the TensorRT engine is well constructed or not first?

def load_engine(engine_file):
    with open(engine_file, 'rb') as f, trt.Runtime(TRT_LOGGER) as runtime:
        engine =  runtime.deserialize_cuda_engine(f.read())
        if engine is None: print("deserialize fails")
        return engine

If the engine is good, please share the the Caffe model with us so we can check it deeper.

Thanks.

fasmatikos · October 6, 2020, 10:46am

Thanks for the response! I can confirm that there is no error when the engine is loaded and deserialized…

The same weird behaviour is also present when a build/load/run the engine with the deafult mobilenetSSD network from [GitHub - chuanqi305/MobileNet-SSD: Caffe implementation of Google MobileNet SSD detection network, with pretrained weights on VOC0712 and mAP=0.727.), with default caffemodel and slightly altered prototxt ( a. flatten layers are changed to Reshape layers including their params and b. “keep_count” output has been added right after the “detection_out”). Again, when i build the engine and immidiately execute it everything run smoothly, but when i serialize/save the engine and then load it its getting really messy…

Here you can find a zip file containing:

the python script i use,
the .prototxt and the .caffemodel
the test image and the two images with detection results (when i build/run the engine and when i load/run the engine)
ther engine file (.bin) i build

thanx and I hope we can figure it out !

AastaLLL · October 7, 2020, 3:37am

Thanks for the data.

Will update later.

AastaLLL · October 7, 2020, 6:31am

Hi,

Thanks for sharing the detail source to reproduce this.

We confirmed that the same issue also occurs in our environment.
This problem is passed to our internal team now.

Will keep you updated once we got a feedback.
Thanks.

vassalos · October 15, 2020, 9:09am

Hi @AastaLLL,
Is there any update regarding the issue ?

B.R.

AastaLLL · October 19, 2020, 4:08am

Hi,

We confirmed that there are some issue in our serializer and deserializer.
But the detail root cause is still under checking.

Thanks.

AastaLLL · November 2, 2020, 6:04am

Hi,

This issue is fixed in our internal branch.
The fix will be available in our future release.

Thanks.

vassalos · November 10, 2020, 1:17pm

thank you for your support!

Topic		Replies	Views
Error during deserializing the engine which is generated by TLT Deep Learning (Training & Inference)	1	413	May 12, 2020
Exception: jetson.inference -- detectNet failed to load network Jetson Nano jetson-inference	4	2758	October 15, 2021
Magic tag assertion failed! Deserialization of engine failed! Jetson Xavier NX tensorrt	10	852	December 22, 2021
sampleSSD inference on jetson Nano in C++ error TensorRT tensorrt , ssd	16	1663	June 12, 2020
Python TensorRT loaded engine failed Jetson AGX Orin tensorrt	6	616	May 17, 2023
TRT8 serialize() return nullptr Jetson AGX Orin tensorrt	15	614	July 24, 2023
How to load and deserialize the .engine file? Jetson TX2	2	1447	October 18, 2021
how to load a custom model Jetson Nano	7	1474	October 14, 2021
Troubleshooting TensorRT Engine Deserialization Issue: Null Engine Jetson Orin Nano tensorrt , cudnn	6	462	July 31, 2024
TensorRT deserializeCudaEngine 推理结果错误 TensorRT chinese	1	1510	December 11, 2023

TRT engine - peculiar behaviour

Related topics