How to achieve a good performance of MaskRCNN on Jetson Nano

fredericolms · November 9, 2020, 10:06pm

Hello, I’m using TLT with MaskRCNN to achieve a lane detection model, but I’d like to run this model in the Jetson Nano platform. However, I’m getting very low FPS (better result was 2FPS). I followed this tutorial to understand the training process: https://developer.nvidia.com/blog/training-instance-segmentation-models-using-maskrcnn-on-the-transfer-learning-toolkit/

I tried a different number of layers (resnet10, resnet18, and resnet50) and different resolutions, but the performance is still low.
I would like to know if there are other parameters in the spec file that could help me improve the performance of MaskRCNN model on Jetson Nano. My spec file looks very much like the one in the link I mentioned above.

I’m also using DeepStream SDK for inference. Any help would be great. Thanks in advance.

Morganh · November 10, 2020, 7:01am

Hi @fredericolms,
Which image_size did you set in the training spec?
If you were using the value mentioned in the https://developer.nvidia.com/blog/training-instance-segmentation-models-using-maskrcnn-on-the-transfer-learning-toolkit/, see its " Figure 4. Performance of the Mask R-CNN model with DeepStream." , the fps seems to match your result.

fredericolms · November 10, 2020, 6:36pm

Hello, thanks for the reply! I used different image_size arguments. For the result of 2FPS which I meant, I used an image_size of 256x256 with a resnet50 as the backbone. I also used the image_size of 1344x832 (as the tutorial link that I mentioned) and the performance was 0.6FPS on average. Then I thought that maybe there was a parameter or something to change in the spec file that would help to improve performance. I have two questions:

1 - As you mentioned, Figure 4 shows the FPS on Jetson Nano. But the trained model in that tutorial has 91 classes. I am training only one class, which is the “lane” class. Should I expect a difference (in performance) between a MaskRCNN trained model with 91 classes and a MaskRCNN model with only one class (which is my case)?

2 - If this is how the Jetson Nano will perform with MaskRCNN models, is there another segmentation model (more lightweight) that I can use to train and use DeepStream as my “platform” for inference?

Thank you in advance.

Morganh · November 11, 2020, 7:19am

In your Nano, have you set max power mode and boost the cpu block?

$ sudo nvpmodel -m 0

$ jetson_clocks

fredericolms · November 12, 2020, 3:28am

Yes, I got those results while in max power mode and jetson_clocks running.

Morganh · November 12, 2020, 8:49am

So, I suggest you to test the mask-rcnn model which is mentioned in the blog. It is trained on 1 class.

The configuration file and label file for the model are provided in the SDK. These files can be used with the generated model as well as your own trained model. A sample Mask R-CNN model trained on a one-class dataset is provided in GitHub GitHub - NVIDIA-AI-IOT/deepstream_tao_apps: Sample apps to demonstrate how to deploy models trained with TAO on DeepStream

cd deepstream_tlt_apps/
wget https://nvidia.box.com/shared/static/8k0zpe9gq837wsr0acoy4oh3fdf476gq.zip -O models.zip
unzip models.zip
rm models.zip

fredericolms · November 12, 2020, 7:28pm

Thank you, foi all the help @Morganh! I tested this model on my Jetson Nano and I got 0.53 FPS on average. So it seems like this is the performance of MaskRCNN on Jetson Nano, right? Or am I doing something wrong here?

Morganh · November 13, 2020, 8:25am

I’m afraid your result is similar to Figure 4.

fredericolms · November 16, 2020, 4:56pm

Ok, thank you for the support!

Topic		Replies	Views
Poor performance of MaskRCNN on images TAO Toolkit	16	1331	October 12, 2021
Interpreting output of MaskRCNN from TLT to TRT TAO Toolkit tensorrt	7	1686	October 9, 2021
Slow FPS using SSD-Mobilenetv2 Jetson Nano	6	2924	October 14, 2021
Training Instance Segmentation Models Using Mask R-CNN on the NVIDIA Transfer Learning Toolkit Technical Blog	3	1024	August 18, 2021
Problem On Deploying Mrcnn Model in TX2 TAO Toolkit segmentation	9	780	October 12, 2021
Mask R-CNN integration on Jetson Xavier AGX TAO Toolkit	19	745	July 27, 2022
Why I can't get 40 FPS for TLT YOLOv3 ResNet18 FP16 in 320x320? DeepStream SDK tensorrt , performance	7	837	October 12, 2021
How can I finetune the TensorRT faster RCNN Sample? Jetson TX2	14	5144	October 18, 2021
Poor metric results after retraining maskrcnn using TLT notebook TAO Toolkit	23	2415	August 3, 2021
Basic questions about transfer learning with TAO Toolkit TAO Toolkit	2	466	January 12, 2023

How to achieve a good performance of MaskRCNN on Jetson Nano

Related topics