Jetson-inference yolov4-tiny

3629701 · May 15, 2023, 11:10am

hi,
I train a YOLOv4-tiny 288x288 model with helipad data using darknet and convert the trt model using tensorrrt_demos github and compare it with jetson-inference SSD-Mobilnet v2.

Since jetson-inference is written in C/C++ when loading the model with the drone script (using MAVSDK-python) it works very well at up to 44 fps.

However, the YOLOv4-tiny-288 model drops 1/3 fps whenever there is drone action. So here’s a question.
I know that jetson-inference doesn’t support yolo, but it produces results up to 70fps (Amazing!!!) as shown in the image below, and I couldn’t overlook that!!!

The YOLO model has three labels: helipad, Person, and Vehicle. Looking at the terminal output, there are many more ClassIDs. Why?

I think the difference is in the output part (OUTPUTS : Dimension is different).
Comparing SSD-Mobilenet v2 and YOLOv4-tiny-288.onnx using a site called Netron, the output is different.

I think you need to convert the values.
Any good ideas for this??
thank you:)

dusty_nv · May 15, 2023, 1:33pm

Hi @3629701, yes I believe you are correct that YOLO having different output tensor format is leading jetson-inference to think that there are more classes in the model than there actually are. For ONNX models, the pre/post-processing in jetson-inference detectNet object is setup for SSD models from train_ssd.py. This is where it gets the number of classes from the dimensions of the output tensor:

github.com

dusty-nv/jetson-inference/blob/31b35c46205773bc2377bdc37e9b0bcb929968d9/c/detectNet.cpp#L407


      
          

          // allocDetections

          bool detectNet::allocDetections()

          {

          	// determine max detections

          	if( IsModelType(MODEL_UFF) )	// TODO:  fixme

          	{

          		LogInfo(LOG_TRT "W = %u  H = %u  C = %u\n", DIMS_W(mOutputs[OUTPUT_UFF].dims), DIMS_H(mOutputs[OUTPUT_UFF].dims), DIMS_C(mOutputs[OUTPUT_UFF].dims));

          		mMaxDetections = DIMS_H(mOutputs[OUTPUT_UFF].dims) * DIMS_C(mOutputs[OUTPUT_UFF].dims);

          	}

          	else if( IsModelType(MODEL_ONNX) )

          	{

          		mNumClasses = DIMS_H(mOutputs[OUTPUT_CONF].dims);

          		mMaxDetections = DIMS_C(mOutputs[OUTPUT_CONF].dims) /** mNumClasses*/;

          		LogInfo(LOG_TRT "detectNet -- number of object classes: %u\n", mNumClasses);

          	}	

          	else

          	{

          		mNumClasses = DIMS_C(mOutputs[OUTPUT_CVG].dims);

          		mMaxDetections = DIMS_W(mOutputs[OUTPUT_CVG].dims) * DIMS_H(mOutputs[OUTPUT_CVG].dims) * mNumClasses;

          		LogInfo(LOG_TRT "detectNet -- number of object classes: %u\n", mNumClasses);

In addition to modifying that, you would need to adapt the bounding box clustering code to correctly interpret the output format of YOLO model: https://github.com/dusty-nv/jetson-inference/blob/31b35c46205773bc2377bdc37e9b0bcb929968d9/c/detectNet.cpp#L688

3629701 · May 19, 2023, 1:02pm

hi,
The onnx model has been changed to have the same ouput as possible with the existing model.

I confirmed that one big difference is the boxes type of OUTPUTS.
The old boxes are [1,3000,4] , and the current model is [1,1215,1,4] .
The first value of 1 is batch_size, the second value of 3000 is the maximum number of bounding boxes, and the last value is boxes (left, top, right, bottom).
Is my understanding correct?

So can you tell me exactly where I need to modify in detectNet.cpp here?

I need your help.

thank you:)
yolov4_1_3_288_288_static.onnx (22.5 MB)

dusty_nv · May 19, 2023, 5:34pm

@3629701 I don’t think it’s realistic to attempt to alter the network topology of YOLO model to match SSD-Mobilenet output and expect the model to still produce valid results or be as accurate. And even if the output tensor dimensions match, they might still carry different data.

Instead, you would need to re-implement the YOLOv4 post-processing code in detectNet.cpp. Alternatively, it may just be easier to run your YOLOv4 model in a project that uses TensorRT Python API directly like this tutorial: https://jkjung-avt.github.io/tensorrt-yolov4/

system · June 14, 2023, 2:39am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Low fps when doing object detection on jetson nano Jetson Nano jetson-inference	19	8880	March 1, 2022
Python wrapper for tensorrt implementation of Yolo (currently v2) Jetson Nano	32	8006	July 2, 2020
Yolov3 is very slow Jetson Nano	21	20217	October 14, 2021
Run YoloV8 with Jetson Inference on Jetson Nano Jetson Nano yolo	3	5866	March 29, 2023
Tiny Yolo v3 in Python for Jetson Nano Jetson Nano	10	3151	March 19, 2020
Yolov6 Slow inference speed on the Nvidia Jetson NX board Jetson Xavier NX yolo	8	1610	August 24, 2022
run yolov3-tiny with tensorRT model Jetson Nano	7	3394	January 4, 2020
YoloV3 in pretrained Detection Models Available Jetson Nano	4	2398	October 14, 2021
YoloV4 with OpenCV Jetson Nano yolo	12	6159	October 15, 2021
Custom YoloV4 Tiny Model with DeepStream DeepStream SDK tensorrt , yolo , onnx	2	1253	October 12, 2021

Jetson-inference yolov4-tiny

Related topics