Optimizing memory consumption on Jetson

yukigaru · September 2, 2021, 6:32pm

Hi! I have a large NN-inference program that runs on Jetson. Unfortunately there’s not enough memory for it to operate and I need to optimize it somehow. Are there well-known techniques and advice to reduce memory consumption on a Jetson device? Thanks!

yukigaru · September 3, 2021, 5:45am

More info: TensorRT, Cuda 10.2, L4T 32.4.4, Tegra194

snarky · September 4, 2021, 2:32am

How much memory are you missing? Which jetson device (they each have different amounts of RAM.)

If you’re missing a dozen megabytes, then turning off some kernel daemons you might not need (wifi or bluetooth or whatever) might help, but if you’re off by gigabytes, then chances are you probably isn’t enough that’s possible to remove – the Ubuntu overhead really isn’t all that large, even if you include all the things you DO need that it provides.

If you’re asking about how to make a model smaller, then reducing the layer sizes, and reducing the number of classes detected by the model is typically how one does that. You can also look into the approaches they took in MobileNet to create a smaller network, and maybe apply something like that to your own model.

yukigaru · September 4, 2021, 7:36am

Thank you for the answer!

I’m missing around 200-400 mb, while there’s total 8gb in the device. OS has been compacted already.

The application has been written for regular CPU+GPU video card scheme and thus incurs lot of copying back and forth. Maybe I should dig into that direction…

snarky · September 4, 2021, 9:10pm

That sounds quite fruitful! It might also make for a snappier application overall. Fewer copies are almost always better :-)

AastaLLL · September 6, 2021, 3:06am

Hi,

Which model do you use?

In our latest TensorRT v8.0, we provide an option to inference without loading cuDNN.
This can save lots of memory but the supported layers are not much currently.

It’s recommended to give it a try. The package is integrated in JetPack 4.6.
Thanks.

yukigaru · September 6, 2021, 6:16am

We use TX2 and Nano, and optimization is needed for both. Cudnn is used for inference. Thanks!

AastaLLL · September 8, 2021, 8:55am

Hi,

Do you use TensorRT for inference or other frameworks (ex. TensorFlow)?

If TensorRT is used, you can try to run the model with --tacticSources=-cudnn.
The memory usage of MNIST decreases from 832.742 MB to 527.469 MB on JetPack 4.6.

Thanks.

yukigaru · September 8, 2021, 11:42am

Thank you! Does it reduce performance?

AastaLLL · September 13, 2021, 4:56am

Hi,

We don’t see a performance regression on the MNIST model.

Please noted that this feature starts from TensorRT v8.0.
So the supported layers are still limited currently.

Thanks.

Topic		Replies	Views
TensorRT used lots of memory when loading model files Jetson Orin NX tensorrt	6	1171	May 31, 2023
Very large CPU RAM Usage in TensorRT General	7	6142	October 12, 2021
Lowering tensorrt memory usage Jetson TX2 tensorrt	4	589	May 16, 2023
Problem with limited memory after loading model - Jetson Nano Jetson Nano jetson-inference , pytorch	2	1518	October 3, 2021
High RAM consumption with CUDA and TensorRT on Jetson Xavier NX Jetson Xavier NX tensorrt	10	2838	October 18, 2021
TensorRT model consuming more amount of RAM Jetson TX2 tensorrt	3	886	October 18, 2021
Memory Usage Discrepancy with TensorRT 8.6 and 8.2 Jetson TX2 tensorrt	3	341	March 27, 2024
Expected Tensor RT 8 RAM Usage Jetson TX2 tensorrt	2	519	March 2, 2022
High RAM consumption with CUDA and TensorRT on Jetson Xavier NX TensorRT	1	514	July 8, 2021
How to reduce the memory-usage during the infer process？ DeepStream SDK	7	1500	October 12, 2021

Optimizing memory consumption on Jetson

Related topics