Tensorrt take much more cpu ram in RTX3070

827786283 · October 11, 2021, 9:59am

Description

I run sample mnist in RTX1060ti it just takes 789.4MB cpu memory, but in RTX3070 it takes 2406.2MB cpu memory.

Environment

TensorRT Version: 7.2.3.4
GPU Type: RTX1060TI, RTX3070
Nvidia Driver Version:
CUDA Version: 11.1
Operating System + Version: windows 10

Relevant Files

NVES · October 11, 2021, 1:38pm

Hi,
Please refer to the below link for Sample guide.

Refer to the installation steps from the link if in case you are missing on anything

However suggested approach is to use TRT NGC containers to avoid any system dependency related issues.

In order to run python sample, make sure TRT python packages are installed while using NGC container.
/opt/tensorrt/python/python_setup.sh

In case, if you are trying to run custom model, please share your model and script with us, so that we can assist you better.
Thanks!

827786283 · October 12, 2021, 2:57am

Hi,

I follow instructions for installing TensorRT from a zip package on Windows 10.

Installation Guide :: NVIDIA Deep Learning TensorRT Documentation
2. I follow Running C++ Samples on Windows and README.md in sampleMNIST. Finally I can run the executable directly and through Visual Studio.
Sample Support Guide :: NVIDIA Deep Learning TensorRT Documentation
3. I am not familiar with TRT NGC,but I think using NGC will make TRT c++ program deplpoyment on windows more complicated and I find that NGC do not support Microsoft Windows.
Frequently Asked Questions · NVIDIA/nvidia-docker Wiki · GitHub

I’m just trying to figure out why same c++ program runing in rtx30 occupy more cpu ram than in rtx10? Is there any method can solve very large cpu ram usage problem in rtx30 on windows?

Thanks

spolisetty · October 12, 2021, 1:37pm

Could you please confirm, are you facing the same issue on latest TRT version 8.2 EA

827786283 · October 13, 2021, 3:48am

I still face the same issue when test on latest TRT version 8.2 EA. I also test in rtx3060 and get almost same problem. I also changed the cuda 11.1 to cuda 11.4, nothing improved. The below is program verbose output.

The attachment is the program I compile with vs2017 from \TensorRT-8.2.0.6\samples\sampleMNIST

spolisetty · October 14, 2021, 10:22am

Hi,

We have developed more kernels for Ampere GPUs. Some of the memory is consumed by cudnn and other libs like cublas. We also need more memory on newer GPU.
Based on the above screenshots looks like cuBLAS,cuDNN is consuming high CPU memory.

spolisetty · October 14, 2021, 10:25am

Moving post to cuBLAS tag to get better help on the memory management.

827786283 · October 15, 2021, 1:33am

Hi spolisetty:
Thanks a lots. From the verbose output it indeed show that the cuBLAS,cuDNN and CUDA initialization takes much more CPU memory. Much more CPU occupation on 30series GPU really cause some trouble on deploying our program. I will keep looking for better solutions.

Topic		Replies	Views
New TensorRT Model occupying more GPU Memory as compared to older version TensorRT tensorrt , tensorflow , gpu	8	1982	August 20, 2021
The same model consumes different sizes of GPU memory in different GPU TensorRT	8	1708	August 8, 2022
Where an I download TensorRT 6 tar file TensorRT tensorrt	6	1242	October 12, 2021
TensorRT-5.0 error,win10 TensorRT	8	1843	October 12, 2021
TensorRT on Windows 10 with CUDA 11.5 and cuDNN 8.3 TensorRT	1	1571	November 26, 2021
TensorRT was linked against cuBLAS/cuBLAS LT 11.3.0 but loaded cuBLAS/cuBLAS LT 11.2.1 TensorRT tensorrt , cuda , cudnn	10	9389	October 12, 2021
About build errors for sampleOnnxMNIST TensorRT tensorrt , cuda	3	995	February 4, 2021
TensortRT Memory Utilization TensorRT	1	370	August 19, 2020
TensorRT for Cuda 12.2 TensorRT	9	11087	July 24, 2024
run ./sample_mnist failed, TensorRT	1	3073	September 4, 2019

Tensorrt take much more cpu ram in RTX3070

Description

Environment

Relevant Files

Related topics