TensortRT Memory Utilization

whanafy · August 18, 2020, 7:43pm

Description

I am using TensortRT on Nvidia-1080 GPU. I was trying to find out the memory usage of the inference engine on the GPU.
I was trying to find this using the approach described in Developer Guide :: NVIDIA Deep Learning TensorRT Documentation which roughly says the memory usage is The Serialized Engine + 2 * Bindings Sizes + Device Engine Memory Size. However, I found that the actual allocated GPU memory by the whole process is quite different.

The memory consumption computed by the nvidia-smi command nvidia-smi --query-compute-apps=pid,used_memory --format=csv is not the same, to be exact it is 270 MB greater (this is the same value when computed across different engines).

I don’t know why there is a difference in the values, or what is missing in this memory computation?

AakankshaS · August 19, 2020, 7:01am

Hi @whanafy,
Please refer to the below link for FAQ section of the TRT document.

Note: The CUDA infrastructure and device code also consume device memory. The amount of memory will vary by platform, device, and TensorRT version. Use cudaGetMemInfo to determine the total amount of device memory in use.
Thanks!

Topic		Replies	Views
TensorRT Inference Consuming Large Amount of System Resources TensorRT	1	575	July 5, 2022
The memory usage of tensorrt algorithm model is different on different hardware! TensorRT	1	421	November 8, 2022
GPU memory difference between 1070 and 2070 for YOLOv3 TensorRT tensorrt	3	564	May 21, 2020
Would tensorrt optimize the memory consumption？ TensorRT	2	421	May 4, 2020
Tensorrt take much more cpu ram in RTX3070 GPU-Accelerated Libraries cublas	7	1802	October 15, 2021
GPU vs CPU deep learning memory usage Jetson Nano cudnn	5	651	March 26, 2024
GPU Utilization TensorRT tensorrt	3	715	August 29, 2023
TRT inference fp32 vs fp16 TensorRT	4	2555	June 17, 2020
Memory Usage Discrepancy with TensorRT 8.6 and 8.2 Jetson TX2 tensorrt	3	334	March 27, 2024
The same model consumes different sizes of GPU memory in different GPU TensorRT	8	1702	August 8, 2022

TensortRT Memory Utilization

Description

Related topics