How to debug gstnvinfer with custom model？

529683504 · April 16, 2021, 10:02am

Please provide complete information as applicable to your setup.
Hi，
I am trying to build a simple pipeline ( appsrc—> gst-nvinfer(detector)—>fakesink) using an custom model (SSH) I had generated the trt engine file and it can do inferernce correctly base on trt inference API. But when i added this model to the pipeline ,I found the result is huge differ from trt inference API.I checked the preprocess in gst-nvinfer(fetch the result of NvBufSurfTransform in gstnvinfer.cpp) ,its normally. I also checked the properties about preprocess and I did not find error.

MY question is: Is there a way to debug? or good orientation for me to locate error? what makes the difference of result between deepstream and trt ?
• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Fiona.Chen · April 19, 2021, 2:30am

How could your pipeline ( appsrc—> gst-nvinfer(detector)—>fakesink) without nvstreammux run? Can you eleborate more details about your pipeline and application?

529683504 · April 20, 2021, 1:19am

hi Fiona：

sorry for my typo，it‘s appsrc—> streammux—>gst-nvinfer(detector)—>fakesink)

529683504 · April 20, 2021, 1:41am

my application is for image inferance, we use a face detector to detect face and draw the bbox on the image. appsrc use to read and decode image then send the decoded image to streammux.
we also used customed postprocess to get bbox, but the keypoint is here: the result in outputlayerinfo is far away from the result from our result from tensorRT inference.

as I mentioned before I have examined the result after NvBufSurfTransform (in gst-nvinfer.cpp) it looks normaly. I also checked the parameters in my model_config_file.But did not find any unusual or differ with our tensorRT inference code.

So I didn’t have good way to do debug temporarily.Do you have some advice or experience for us to reference?

Fiona.Chen · April 20, 2021, 7:37am

You may refer to DeepStream SDK FAQ - Intelligent Video Analytics / DeepStream SDK - NVIDIA Developer Forums, the item 2. [DS5.0GA_Jetson_GPU_Plugin] Dump the Inference Input can dump the input data of tensorRT for you to compare.

529683504 · April 20, 2021, 7:53am

ok，let me check

529683504 · April 22, 2021, 2:02am

hi Fiona，
thanks for your patch！

this is the original image:

this is the result from our tensorRT inference code

this is the result of NvBufSurfTransform in gstnvinfer:

this the image before input:

it looks abnormal but it not the root cause because when I send this image to our tensorRT inference code as input, it can get normal results:

so,have any ideas?

Fiona.Chen · April 22, 2021, 12:32pm

What is the features of the model? Please describe the input layers and output layers, the pre-processing needed, the nvinfer config file you used. It is better to provide the model too.

529683504 · April 23, 2021, 6:12am

Hi Fiona,
the data you needed can download here: link : 百度网盘-链接不存在
key: uxaw

here are some description:
1.our model is similar like: SSH/scripts at master · mahyarnajibi/SSH · GitHub
an detector use to detect face . we removed the m3-detect head to improve the FPS.
ssh_vgg/vgg16_ssh.caffemodel ssh_vgg/vgg16_ssh.prototxt is the original caffe model. the ssh_vgg.engine is converted by us and the gpu we used is gtx1070. and the ssh_pgie_config.txt is the deepstream pgie config file. the input layer named data (3,540,960) output tensor:ssh_cls_prob (2000,1) for confidence score, ssh_boxes(2000,5) for bounding box , and ssh_boxes[:,1:5] is the location of box :[xmin,ymin,xmax,ymax]( Absolute Coordinate)

because this model has an TRT unspported layer(ssh_proposal) we write an plugin using TRT IpluginV2Ext Api . the lib is in ssh_vgg\modelTRT\modelTRT_custom_plugin\lib
source code:ssh_vgg\modelTRT\modelTRT_custom_plugin\plugins

3.the code of convert tool is in ssh_vgg\modelTRT\modelTRT_custom_plugin

we use TRT python API to test the engine,code is here: ssh_vgg\modelTRT
you can run it for trt test: python3 inter_main.py -c [config json file] -a [author name] -m “test” -s [the plugin lib path] (need to modify infer_sample_ssh_vgg.json

5.the preprocess is in ssh_vgg\modelTRT\preprocess.py function: resize_normalization_preprocess(aka resize_normal_pre), its very simple, just do image resize->substract the mean values-> dimension transpose(from hwc(cv::Mat dimention format) to chw)

Fiona.Chen · April 23, 2021, 6:15am

You nvinfer config file and deepstream pipeline details?

529683504 · April 23, 2021, 6:29am

6 the ssh_pgie_config.txt is the nvinfer config file.
the content is here:

[property]
gpu-id=0
net-scale-factor=1
offsets=102.9801;115.9465;122.7717
model-engine-file=…/…/…/models/ssh/ssh_vgg.engine
labelfile-path=…/…/…/models/ssh/labels.txt
force-implicit-batch-dim=1
batch-size=1
network-mode=0
num-detected-classes=2
interval=0
gie-unique-id=10
output-blob-names=ssh_boxes;ssh_cls_prob
parse-bbox-func-name=parseSSHBox
custom-lib-path=./libprocesslib.so
model-color-format=1
process-mode=1
network-type=0
maintain-aspect-ratio=1
infer-dims=3;540;960
cluster-mode=4
scaling-filter=1

[class-attrs-0]
pre-cluster-threshold=0.6
[class-attrs-1]
pre-cluster-threshold=0.3

7.deepstream pipeline is simple too just appsrc—>streammux—>pgie—>fakesink
appsrc for sending image, streammux params:

529683504 · April 30, 2021, 9:40am

any updates?

Fiona.Chen · May 6, 2021, 2:40am

Can you send us your model?

529683504 · May 6, 2021, 2:43am

the model in this zip file

529683504 · May 10, 2021, 2:15am

any update?

Fiona.Chen · June 10, 2021, 7:38am

I’ve tried with your model (vgg16_ssh.caffemodel and vgg16_ssh.prototxt), it can not be parsed by deepstream internal parser.

529683504 · June 10, 2021, 8:11am

please read my comment CAREFULLY

Fiona.Chen · June 10, 2021, 8:33am

When make modelTRT_custom_plugin, met error:

CMake Error at plugins/CMakeLists.txt:16 (find_package):
By not providing “Findglog.cmake” in CMAKE_MODULE_PATH this project has
asked CMake to find a package configuration file provided by “glog”, but
CMake did not find one.

Could not find a package configuration file provided by “glog” with any of
the following names:

glogConfig.cmake
glog-config.cmake

Add the installation prefix of “glog” to CMAKE_PREFIX_PATH or set
“glog_DIR” to a directory containing one of the above files. If “glog”
provides a separate development package or SDK, be sure it has been
installed.

Fiona.Chen · June 10, 2021, 8:35am

Can you provide the following information?
• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)

Fiona.Chen · June 10, 2021, 8:37am

If there is unsupported layer in your network, you can use the IPlugin interface in deepstream.Using a Custom Model with DeepStream — DeepStream 6.1.1 Release documentation

Topic		Replies	Views
Issues running custom model in deepstream DeepStream SDK	1	547	February 17, 2021
0x55a9170640 ERROR nvinfer gstnvinfer.cpp:511:gst_nvinfer_logger:<primary-nvinference-engine> NvDsInferContext[UID 1]:log(): UffParser: Could not read buffer. DeepStream SDK	15	4028	June 8, 2021
Custom detection ONNX model gives wrong outputs using nvinfer with DeepStream 5.1 DeepStream SDK	16	3135	September 27, 2021
Using custom model in deepstream DeepStream SDK jetson-inference , python , deepstream	39	807	September 10, 2024
ERROR nvinfer gstnvinfer.cpp:632:gst_nvinfer_logger:<primary-inference> NvDsInferContext[UID 1]: Error in NvDsInferContextImpl::parseBoundingBox() TAO Toolkit	4	4805	December 16, 2021
yolo custom model file and error from primary-nvinference-engine DeepStream SDK	14	3111	December 3, 2019
Custom Model applied on Deepstream TensorRT tensorrt , ubuntu , jetson-inference , gstreamer	2	567	December 30, 2022
Parsing custom tensorflow model DeepStream SDK	30	1039	September 4, 2023
NvDsInferLayerInfo not giving expected no. of outputs DeepStream SDK	59	2756	July 31, 2020
Converting Custom RetinaNet model to TensorRT in DeepStream DeepStream SDK tensorrt , neural-network-framework , jetson , deepstream , net	28	482	January 21, 2025

How to debug gstnvinfer with custom model？

Related topics