TAO Toolkit efficientnet B4 and efficientnet B5 fps

user44752 · December 21, 2021, 4:37pm

• Hardware jetson Nano
• Network Type: Classification, efficientnet
• TAO Version: 3-21.11

Hi,

I have been training two models with TAO Toolkit, one with efficientnetB4 and one with efficientnetB5.

After following the steps and pruning both models I have generated an engine using DeepStream to load it in the Jetson Nano.

For the B4 model the speed is 14 fps and for the B5 model it is 9 fps. It seems it is very slow taking into consideration that the models have been pruned.

Do you know if there is any benchmark to validate these values? In case they are the proper values, is there any way to optimize it and reduce them?

Thanks.

Morganh · December 21, 2021, 4:46pm

May I know are you running with Classification network?

user44752 · December 22, 2021, 9:53am

Yes, I’m using both networks in classification mode.

We’re considering several options and we though efficientnet would be the best for us. Right now the model I’ve used at 30 fps is B1, but I’d like to know if it could be possible to use more complex architectures like B4 or B5 at similar fps with the Nano.

Morganh · December 22, 2021, 10:47am

Did you ever prune the model and run retraining?

user44752 · December 22, 2021, 5:49pm

Yes. I pruned them with a threshold of 0.6 for both models and then I retrained them following the tutorials you have with a few less epochs. As I told you with B1 I’m having 30 fps and the prune threshold is the same

Morganh · December 23, 2021, 1:03am

You can try several thresholds to prune more and then run retraining.

Usually the threshold is not the same for different backbones. After pruning, you can find the pruning ratio in the log. Or you can also find the new trainable parameters in the retraining log.

user44752 · December 23, 2021, 8:26am

Ok, I’ll be trying new experiments with different thresholds, let’s see. Thank you

user44752 · December 30, 2021, 11:59am

Hi again and sorry for the delay. I’ve tried with several thresholds in the B5 model and with a 0.7 threshold the size of the etlt file is about 6 MB. So when I put it inside the DeepStream application the frame rate is 15 fps, which is much better than the 9 fps we had with 0.6 threshold. My question now is:
with a file of only 6 MB size is it normal to have only 15 fps? Isn’t it a really small file?

Morganh · January 4, 2022, 1:50am

Please try to prune/retrain if it can still be pruned.

Morganh · January 5, 2022, 3:11am

More, did you ever try fp16 mode? If not, please use it.

user44752 · January 7, 2022, 8:51am

Hi, do you mean try to prune the pruned model again and then retrain it?

Also, for exporting the .etlt model I’m using int8 precision, the one that I use later on the nano.

Thank you for your answers.

Morganh · January 7, 2022, 8:58am

Yes.

user44752 · January 7, 2022, 9:02am

Ok, I’ll try to do that and then I’m checking the size again. I’ll text back as soon as possible. Thanks

system · February 1, 2022, 1:46am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
EfficientNetB5 on jetson nano? DeepStream SDK	8	1226	December 7, 2021
Tao yolov4 pruned model is stuck at 6.5 FPS TAO Toolkit	18	41	September 3, 2024
Transfer Learning toolkit models vs Deepstream models on the Nano TAO Toolkit	9	1909	October 12, 2021
TAO trained custom model (DetectNetV2 + RESNET10) deployed in Deepstream is having low fps DeepStream SDK tensorrt , artificialintelligence , tao , deepstream	5	731	February 14, 2023
Possibility of QAT training for Jetson devices for yolov4_tiny model with pruned etlt model TAO Toolkit	2	410	May 16, 2023
Correct config for training and deploying Efficientnet classification model in deepstream? TAO Toolkit	2	716	April 12, 2022
Performance difference between various tlt models while deploying on deepstream. TAO Toolkit	8	751	October 12, 2021
Jetson Nano2g run DS6 inference from a Live Camera RTSP stream with 6 FPS TAO Toolkit	7	610	February 8, 2022
Detectnetv2 resnet18/resnet10 on jetson nano. TAO Toolkit	4	850	October 12, 2021
Why I can't get 40 FPS for TLT YOLOv3 ResNet18 FP16 in 320x320? DeepStream SDK tensorrt , performance	7	828	October 12, 2021

TAO Toolkit efficientnet B4 and efficientnet B5 fps

Related topics