Memory usage when loading unet for inference on jetson nano

ksokolov · June 28, 2021, 4:14pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) Jetson
• DeepStream Version 5.1
• JetPack Version (valid for Jetson only) 4.5.1
• TensorRT Version7.1.3.0-1+cuda10.2
• NVIDIA GPU Driver Version (valid for GPU only)
**• Issue Type( questions, new requirements, bugs)**questions
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Hi, I am trying to run inference with a custom unet model on my Jetson Nano. I trained it with tlt, then exported and created an engine file on the device. The engine file is around 37 MB. I use the deepstream-segmentation example from deepstream-python-apps to run a deepstream pipeline with this model. When loading the engine, the memory usage goes from 1.5 GB idle to 3.8 GB, so the system is almost freezing. This is happening before the actual inference takes place, during the model loading stage. Now when I try the dstest_segmentation_config_industrial.txt, the memory consumption only goes up to 2.7 GB from the 1.5 GB idle. I checked the .engine file for this config and it is 25 MB. so there are 2 questions from me

Why would a model which weights 38 MB almost cause OOM while a 25 MB one does not?
Why is the memory consumed in the range of GBs while the model size is around 20-30 MB?

ksokolov · July 1, 2021, 10:02am

would there be any comment on this? currently our project development is stalled because we do not know in what way should we optimize the model to get performance as stated by nvidia benchmarks

AastaLLL · July 13, 2021, 3:13am

Hi,

The memory usage depends on the algorithm used in TensorRT.
More, it also takes some memory to load the TensorRT/cuDNN library(>600MB)

You can create the engine file with different workspace amounts to limit the used algorithm.

/usr/src/tensorrt/bin/trtexec --workspace=1024 ...

Thanks.

system · September 18, 2021, 3:26am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Memory usage when loading unet for inference on jetson nano TAO Toolkit	10	1159	September 18, 2021
How to reduce the memory-usage during the infer process？ DeepStream SDK	7	1500	October 12, 2021
Jetson orin nano 4G 上转 OSNet模型（2.2M），workmemory=256,推理的时候为什么会占用1G内存？ Jetson Orin Nano jetson-inference	8	45	February 27, 2025
Problem with limited memory after loading model - Jetson Nano Jetson Nano jetson-inference , pytorch	2	1518	October 3, 2021
Increase resource usage of Jetson nano DeepStream SDK	3	318	December 2, 2021
Deepstream out of memory when transfer the onnx model into engine on Jetson Nano board DeepStream SDK	3	766	October 12, 2021
Question about the memory usage of jeston nanoo Jetson Orin Nano cuda , jetson-inference , python	3	49	October 17, 2024
Triton Inference Server Inference Request Error on GPU Jetson Nano inference-server-triton , nano , gpu , segmentation	2	883	October 15, 2021
Loading Tensorflow model on to Jetson Nano Jetson Nano tensorflow , nano2gb	7	2520	October 15, 2021
Getting message: A lot of buffers are being dropped. Jetson Nano	6	1182	October 18, 2021

Memory usage when loading unet for inference on jetson nano

Related topics