TensorRT and CuDNN library sizes are very big

deblauwetom · May 1, 2020, 8:55am

Description

The tensorrt libraries(libnvinfer) and the linked-to cudnn libraries are very big. Together roughly 650 MB. This takes up already a lot of the memory. I have tried linking with the static versions, and then it’s just my own program that gets to be 650 MB big. So is there something that can be done about it?

Environment

TensorRT Version: 6
GPU Type: nvidia jetson nano
Nvidia Driver Version: jetpack 4.3
CUDA Version: jetpack 4.3 (10.0)
CUDNN Version: jetpack 4.3
Operating System + Version: custom OS based on yocto + meta-tegra layer
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): nvidia jetson nano

Steps To Reproduce

Link and use a C++ program with tensorrt’s libnvinfer library.

SunilJB · May 1, 2020, 6:22pm

Moving to Jetson Nano forum so that Jetson team can take a look.

dusty_nv · May 1, 2020, 8:19pm

Hi @deblauwetom, the latest JetPack 4.4 Developer Preview release with cuDNN 8.0 splits up the cuDNN libraries into several sub-libraries which are smaller in size:

$ ls -ll /usr/lib/aarch64-linux-gnu/libcudnn*.so.8.0.0
-rw-r--r-- 1 root root  98767336 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_adv_infer.so.8.0.0
-rw-r--r-- 1 root root  51032536 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_adv_train.so.8.0.0
-rw-r--r-- 1 root root 177698176 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_cnn_infer.so.8.0.0
-rw-r--r-- 1 root root  31817240 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_cnn_train.so.8.0.0
-rw-r--r-- 1 root root 138606184 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_etc.so.8.0.0
-rw-r--r-- 1 root root 108440120 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_ops_infer.so.8.0.0
-rw-r--r-- 1 root root  27284344 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn_ops_train.so.8.0.0
-rw-r--r-- 1 root root    149480 Apr 18 04:02 /usr/lib/aarch64-linux-gnu/libcudnn.so.8.0.0`

The master libcudnn.so then dynamically loads only the needed cuDNN sub-libraries at runtime based on what API calls the application is performing. This should significantly reduce the memory usage from the libraries (e.g. if only inferencing is used).

deblauwetom · May 6, 2020, 11:40am

Hello,

That is good news indeed. However, it seems the “infer” libraries are still the biggest in your list, so I won’t get my hopes up too much.

For example when loading an engine with deserializeCudaEngine I get 800MB of memory usage extra. The saved network itself is only 6MB. From the moment I use this function once, I get the extra memory usage. Would this case be solved with jetpack 4.4? Are there any memory usage benchmarks available? I would just like know if upgrading to jetpack 4.4 would be worth the trouble before I try it.

Thanks,
Best regards

Topic		Replies	Views
Large libraries Jetson Nano	3	468	August 30, 2019
Jetson shared libraries size TensorRT jetson	2	45	November 6, 2025
Lowering tensorrt memory usage Jetson TX2 tensorrt	4	685	May 16, 2023
GPU vs CPU deep learning memory usage Jetson Nano cudnn	5	842	March 26, 2024
Very large CPU RAM Usage in TensorRT General	7	6304	October 12, 2021
TensorRT CPU Memory Management TensorRT jetson-inference , jetson	5	1772	July 7, 2022
TRT engine returns nan on jetson nano Jetson Nano tensorrt	7	653	January 31, 2023
TensorRT small model high RAM consumption during inference problem Jetson Orin Nano tensorrt , cuda , cudnn , yocto , jetson	10	286	November 7, 2024
Tensorrt on Jetson Nano TensorRT tensorrt , cudnn , jetson	1	76	September 30, 2024
cuBLAS, cuDNN, and TensorRT memory release on Jetson nano Jetson Nano tensorrt , cuda , jetson-inference	5	1647	November 24, 2021

TensorRT and CuDNN library sizes are very big

Description

Environment

Steps To Reproduce

Related topics