Memory Usage Discrepancy with TensorRT 8.6 and 8.2

theo.engels · March 18, 2024, 5:03pm

Description

I am trying to run inference using a SAMNet model (GitHub - yun-liu/FastSaliency: Code for "SAMNet: Stereoscopically Attentive Multi-scale Network for Lightweight Salient Object Detection" and "Lightweight Salient Object Detection via Hierarchical Visual Perception Learning") converted to a trt engine. On my computer with tensorrt 8.6.1 and cuda 12.0 i get a memory usage of 0.4GB from htop and 87MB from nvidia-smi when running my code. When i run the same code on the tx2 NX (with the model converted for the tx2), with tensorrt 8.2.1 and cuda 10.2, i get about 1.6GB of memory used from top.

Is this the expected behavior? I have at most about 1gb available to run my model in production, is there a way to reduce the memory used? This is my first time using tensorrt and cuda, am i doing something wrong in my code?

I am gratefull for any help! Thanks

I did some more digging and found out it is because trtexec buidls the model using cudnn and cublas on the tx2 and not on my computer. Lazy loading is used as well on my computer and not on the tx2 due to the tx2 using cuda 10.2.

I rebuilt the engine with cublas disabled and reduced the used memory to 1.2Gb. ANy other tips to reduce memory?

Environment

TensorRT Version: 8.2.1
GPU Type: NVIDIA Pascal™ Architecture GPU with 256 CUDA cores on Jetson TX2 NX
CUDA Version: 10.2
Operating System + Version: L4TR32

Relevant Files

cpp file to run the model:
samnetTest.tar.gz (4.7 MB)
onnx model:
samnet.onnx.tar.gz (3.5 MB)

AakankshaS · March 20, 2024, 9:12am

Hi @theo.engels ,
I beileve Jetson TX2 Forum should be able to provide better support on the topic.

Moving it.

Thanks

AastaLLL · March 27, 2024, 9:39am

Hi,

Yes, since we start to support lazy module loading in CUDA 11.8.
So memory usage is expected to decrease in newer CUDA environments.

For the older CUDA version, you can try to run TensorRT without cuDNN to save some memory.

Thanks.

system · April 24, 2024, 6:11am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Expected Tensor RT 8 RAM Usage Jetson TX2 tensorrt	2	519	March 2, 2022
GPU vs CPU deep learning memory usage Jetson Nano cudnn	5	692	March 26, 2024
High RAM consumption with CUDA and TensorRT on Jetson Xavier NX Jetson Xavier NX tensorrt	10	2838	October 18, 2021
Very large CPU RAM Usage in TensorRT General	7	6142	October 12, 2021
Optimizing memory consumption on Jetson Jetson AGX Xavier jetson-inference	10	1082	October 18, 2021
Tensorflow-gpu using high system memory, which is the bottleneck Jetson TX2 cuda , tensorflow	4	645	October 18, 2021
TensorRT used lots of memory when loading model files Jetson Orin NX tensorrt	6	1172	May 31, 2023
High RAM consumption with CUDA and TensorRT on Jetson Xavier NX TensorRT	1	514	July 8, 2021
Lowering tensorrt memory usage Jetson TX2 tensorrt	4	589	May 16, 2023
Why tensorRT occupy many memory ? Jetson TX2	9	3828	May 12, 2021

Memory Usage Discrepancy with TensorRT 8.6 and 8.2

Description

Environment

Relevant Files

Related topics