Retraining Detectnet_v2, Dashcamnet, with Custom Dataset, Inference Quality Issue

ryuchanhoe · October 10, 2023, 10:55am

• Hardware:
Razer Blade 15 RTX 3080 Ti

• Network Type:
Detectnet_v2 (DashcamNet)

• TLT Version:
nvcr.io/nvidia/tao/tao-toolkit-tf:v3.22.05-tf1.15.4-py3

Hi, I am trying to retrain the DashCamNet with my custom dataset because I am seeing confidence ~ 0.5 and want to see if I can boost them up.

However, I am seeing a very low inference quality confidence ~0.02.

First of all, I am dealing with images with resolution of 1920x1080.
I am using ros_deep_learning to run the DashcamNet on the ROS environment.

I have a relatively small dataset ~ 500 images with a single car in the FOV. (I am not interested in other classes.)
The resolution is in 960x544 .jpg file.

image0000.txt (79 Bytes)

I am trying to see if this is working as intended and create more datasets.

Let me tell you what I have done.

I do dataset conversion
detectnet_v2 dataset_convert -d /tlt_ws/src/model/configs/kitti_config_960x544.txt -o /tlt_ws/src/model/converted_960x544/tf
kitti_config_960x544.txt (238 Bytes)
I train the pretrained DashCamNet
detectnet_v2 train -e /tlt_ws/src/model/configs/training_config_960x544.txt -r ./demo -k tlt_encode --gpus 1
training_config_960x544.txt (5.4 KB)

(Unpruned DashCamNet Acquired: DashCamNet | NVIDIA NGC)

I evaluate the .tlt files and pick the one with the highest average precision (which is model.step-7700.tlt in my case)
detectnet_v2 evaluate -e demo/experiment_spec.txt -m ./demo/model.step-7700.tlt -k tlt_encode
experiment_spec.txt (6.2 KB)

Validation cost: 0.000251
Mean average_precision (in %): 22.7273

class name average precision (in %)

bicycle 0
car 90.9091
person 0
road_sign 0

Median Inference Time: 0.011957

I prune the acquired .tlt with checking average precision

detectnet_v2 prune -m demo/model.step-7700.tlt -o ./demo_pruned/pruned08.tlt -pth 0.08 -k tlt_encode

-pth 0.08 seems to be maintaining the average precision while the file size becomes 8.7MB from 46.5MB.

if I evaluate the pruned .tlt file again using
detectnet_v2 evaluate -e demo/experiment_spec.txt -m ./demo_pruned/pruned08.tlt -k tlt_encode

Validation cost: 0.000251
Mean average_precision (in %): 22.7273

class name average precision (in %)

bicycle 0
car 90.9091
person 0
road_sign 0

Median Inference Time: 0.006338

I retrain the pruned .tlt

detectnet_v2 train -e ./configs/training_config_demo_960x544.txt -r ./demo_pruned -k tlt_encode --gpus 1

training_config_demo_960x544.txt (5.4 KB)

I evaluate the retrained .tlt and pick the highest average precision .tlt file.
detectnet_v2 evaluate -e demo/experiment_spec.txt -m ./demo_pruned/model.step-9900.tlt -k tlt_encode
Validation cost: 0.000094
Mean average_precision (in %): 21.3554

class name average precision (in %)

bicycle 0
car 85.4216
person 0
road_sign 0

Median Inference Time: 0.012592

I export the .tlt file to .etlt

detectnet_v2 export -m demo_pruned/model.step-9900.tlt -k tlt_encode -e configs/training_config_pruned_960x544.txt -o export_960x544/pruned_retrained_960x544.etlt

I convert .etlt to .engine

./tao-converter -d 3,544,960 -k tlt_encode -o output_cov/Sigmoid,output_bbox/BiasAdd pruned_retrained_960x544.etlt

Inference Result with 0.01 threshold

custom1850×1055 302 KB

This is kind of confusing because if I convert the vanila DashCamNet in .engine file and launch with the roslaunch file. It works as intended.

However, If I switch to the custom trained model, it is not working as intended but still average precision is high enough when evaluating.

I believe I am clearly doing something wrong.
Suspicious things:

I did not claim data_type int8 anywhere?
I did not utilize calibration.tensor or .bin anywhere?
The resolution is off?
If I were to inference 1920x1080 image, then I should be training in 1920x1088 instead of 960x544 with the dataset with resolution of1920x1080?
Too small dataset?
My ambition is too keep the original inference quality of DashcamNet while revising my specific environment?
But too small dataset can mess both up?

This might be too wordy but I tried…
Please feel free to let me know, if you think if anything random thing might be one of the factors.
Thanks for the attention!

Morganh · October 11, 2023, 8:36am

Please check if the custom trained model (.tlt) works as intended.
If yes, then please check if fp32 tensorrt engine or fp16 tensorrt engine works as intended.

ryuchanhoe · October 11, 2023, 12:20pm

Thanks for the reply!

Please check if the custom trained model (.tlt) works as intended.

How do I check the custom.tlt is working as intended?
I have done (detectnet_v2 evaluate) thing to check average precision, and they are showing okay precision, 0.8~0.9 on car.
Or you are saying trying other means to check the custom.tlt?
I am not sure how else to check .tlt works as intended other than that. Can you please let me know? (or reference?)

If yes, then please check if fp32 tensorrt engine or fp16 tensorrt engine works as intended.

It looks like I have been dealing with fp32 until now because I was converting without ‘-t int8’ and fp32 is the default value apparently.
Now, I believe I am converting the model into int8, fp16, fp32 properly.
./tao-converter -k tlt_encode -d 3,544,960 -o output_cov/Sigmoid,output_bbox/BiasAdd pruned_retrained_960x544.etlt -t int8 -c dashcamnet_int8.txt
dashcamnet_int8.txt (4.0 KB)
I tried using int8, fp16, fp32 .engine file.
int8, fp16 and fp32 are showing pretty much the same result.

int81668×927 84.1 KB

Morganh · October 11, 2023, 3:54pm

Please run detectnet_v2 inference . You can refer to the notebook or user guide.

Also, from your inference result of tensorrt engine, the car is well detected. May I know that what is the expected?

ryuchanhoe · October 12, 2023, 10:07am

Please run detectnet_v2 inference . You can refer to the notebook or user guide.

Thanks for letting me know.
I tried inferencing training dataset with confidence 0.5.
I would say still a very low quality inference result, even with the detectnet_v2 inference command similar to running .engine with ROS.

Only 20% of dataset was able to be boxed at around confidence 0.5.

image0000.txt (89 Bytes)

The other 80 % was missing bounding box. (I believe they are at about ~ 0.1 confidence.)

inferencer_config.txt (1.6 KB)

Also, from your inference result of tensorrt engine, the car is well detected. May I know that what is the expected?

Sorry that I did not make it clear.
As you can see it on the gif below, the bbox is blinking with the low confidence (under 0.1 confidence).
This is dramatically worse than the plain pretrained DashcamNet.

custom trained dashcamnet:
low_int8-2023-10-12_17.45.17

plain dashcamnet:
low_int8-2023-10-12_18.03.14

So my custom training completely ruined the Pretrained DashcamNet?
If so, it is because of my training config? small dataset?
I am not sure why…

Thanks for the help.

Morganh · October 15, 2023, 3:53pm

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

Seems that the plain dashcamnet already matches your requirement. For dashcamnet retraining, yes, please add more dataset from expected scenario. You can train the car class only. After training, you can run detectnet_v2 inference to check the inference result.

system · October 30, 2023, 5:45am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Retraining Trafficcamnet with custom vehicle dataset TAO Toolkit	30	2471	March 11, 2022
Inferring resnet18 classification etlt model with python TAO Toolkit	45	3987	October 12, 2021
How to do inference with a TLT faster rcnn model? TAO Toolkit	15	1694	October 12, 2021
Error detectnet_V2 train with TAO : dbscan_min_samples: 0.05' TAO Toolkit tao	4	388	November 7, 2023
Can't evaluate pruned model for FasterRCNN TAO Toolkit	7	580	October 12, 2021
Very low precision while Training detectnet_v2 model using custom data in TAO TAO Toolkit	13	1037	May 4, 2023
TrafficCamNet inference error TAO Toolkit tao	20	922	February 22, 2022
How to do inference with fpenet_fp32.trt TAO Toolkit	21	2615	August 24, 2021
Tao Training Detectnet_v2 custom dataset : Average precision value 0.0000% TAO Toolkit	5	205	June 25, 2024
Relationship between training dataset size and inference data size TAO Toolkit	12	689	February 22, 2022

Retraining Detectnet_v2, Dashcamnet, with Custom Dataset, Inference Quality Issue

Related topics