In the 04_video_dec_trt example, how do I print the class of an object other than the bounding box?

forumuser · February 24, 2021, 2:01am

Hello,

In the 04_video_dec_trt example, how do I print the class of an object other than the bounding box?
I know that resnet_three_class can detect car, motorbike, and people.
How do I detect the bounding box for a person using the onnx file provided as a sample?

Thank you.

DaneLLL · February 24, 2021, 4:57am

Hi,
Do you mean classification? The reference model does detection only. For classification, you may need to apply the other model.

The reference model is for demonstration. If you have different usecase, would need to check if there is other model fitting your usecase.

forumuser · February 24, 2021, 10:24am

Hello,

I am having difficulty understanding the function that interprets the output of the onnx file provided as a sample.
file name: resnet10_dynamic_batch.onnx

In the resnet10 bounding box parsing function

Why the value of bbox_norm is 35.0
Meaning of gc_centers_0 and gc_centers_1
When specifying the location of output_x1, I don’t understand how to calculate as follows.
Can you explain?

image1216×693 67.3 KB

The output part of onnx file is as follows.

Thank you.

AastaLLL · February 25, 2021, 5:02am

Hi,

These parameters are used to map the output tensor into a bounding box.
ResNet10 is an internal customized model so the architecture is not available for public.
But it’s very similar to the DetectNet or YOLO. Ex. https://i.stack.imgur.com/aUcNf.jpg

First, you can divide the image into certain grid with size (grid_x, grid_y).
Then the bbox location can be calculated by a offset (ex. output_x1 ) + grid center.

Bbox_norm is a training parameter.
Since output_x1 and grid center may not in a same scale. Bbox_norm is response for the transform.
That means 1.0 in output_x1 equals to 35 in grid center. And 1 in grid center indicates 1 pixel.
Center position of each grid cell.
Please check the explanation above.

Thanks.

Topic		Replies	Views
DeepStream primary detector (ResNet10) output parsing DeepStream SDK tensorrt	4	851	October 12, 2021
C++ Yolov3_ONNX output parsing TensorRT	5	1399	January 6, 2025
Update the bbox parser (yolov3)since the output format (model onnx) is different DeepStream SDK	7	726	October 12, 2021
PeopleNet Output TensorRT tensorrt	3	530	December 2, 2020
Post process the peoplenet v2 output DeepStream SDK gstreamer	6	77	January 21, 2025
Tensorflow model or onnx on my Jetson Nano Jetson Nano tensorrt , tensorflow , python	4	1340	October 15, 2021
detectnet-camera : usage with customized model Jetson Nano	6	1577	October 14, 2021
Implementing Custom retrained ResNet152 Into DeepSream 5.0 DeepStream SDK	6	586	October 12, 2021
How to draw simple bounding boxes and detect only person using Jetson Inference Jetson Nano jetson-inference	6	1950	October 18, 2021
Custom Bounding Box Parsing function for RetinaNet in DeepStream without handling Anchors and Backbone DeepStream SDK jetson-inference , python , deepstream	6	42	April 1, 2025

In the 04_video_dec_trt example, how do I print the class of an object other than the bounding box?

Related topics