High RAM consumption with CUDA and TensorRT on Jetson Xavier NX

marek.lipovsky · July 8, 2021, 11:18am

Hello.

We are having issues with high memory consumption on Jetson Xavier NX especially when using TensorRT via ONNX RT.

By default our NN models are in FP32, so we tried converting to FP16 which makes the NN model smaller. However, during the model inference the memory consumption is the same as with FP32.

I did enable FP16 inference using ORT_TENSORRT_FP16_ENABLE=1 as suggested by ONNX Runtime documentation (Redirecting…). But it didn’t help.

Does Jetson Xavier NX support both FP16 and FP32, especially for CUDA and TensorRT?
Is there any other way to reduce the memory consumption when using CUDA and TensorRT?

We also found out that with any NN model the process using ONNX RT with CUDA execution provider utilizes at least 1.5GB of RAM and with TensorRT execution provider at least 2 GB of RAM. Is this expected?

Is there maybe any light version of the libraries to be used on a device with limited RAM like Jetson?

Thank you in advance
Marek

NVES · July 8, 2021, 11:37am

Hi,
This looks like a Jetson issue. We recommend you to raise it to the respective platform from the below link

Thanks!

Topic		Replies	Views
High RAM consumption with CUDA and TensorRT on Jetson Xavier NX Jetson Xavier NX tensorrt	10	2838	October 18, 2021
Same memory usage for fp16 and int8 Jetson Xavier NX tensorrt	4	2144	September 27, 2021
TensorRT model consuming more amount of RAM Jetson TX2 tensorrt	3	886	October 18, 2021
Excessive RAM usage Jetson Xavier NX pytorch , docker-machine-learning	4	867	February 12, 2024
CenterNet keypoint detector not giving good FPS in jetson xavier NX Jetson Xavier NX tensorrt , tensorflow	4	1621	August 29, 2021
Tensorrt Engine use too much memory TensorRT tensorrt	1	1595	December 13, 2021
Optimizing memory consumption on Jetson Jetson AGX Xavier jetson-inference	10	1082	October 18, 2021
Memory Usage Discrepancy with TensorRT 8.6 and 8.2 Jetson TX2 tensorrt	3	341	March 27, 2024
GPU vs CPU deep learning memory usage Jetson Nano cudnn	5	692	March 26, 2024
Using ONNX Runtime with TensorRT on Jetson Devices Jetson AGX Xavier tensorrt	5	1110	October 18, 2021

High RAM consumption with CUDA and TensorRT on Jetson Xavier NX

Related topics