Out of memory error when trying to convert model using INT8 precision mode

nikola2 · September 14, 2023, 10:12am

Hi,

I’ve been following your example to speed up inference with TensorRT, but when I try conversion using INT8 precision mode my program is killed during .build() call.

Output of sudo dmesg -T is:

[Thu Sep 14 04:40:02 2023] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=/,mems_allowed=0,global_oom,task_memcg=/user.slice/user-1000.slice/session-54.scope,task=python,pid=218008,uid=1000
[Thu Sep 14 04:40:02 2023] Out of memory: Killed process 218008 (python) total-vm:87797408kB, anon-rss:0kB, file-rss:1177636kB, shmem-rss:0kB, UID:1000 pgtables:38348kB oom_score_adj:0
[Thu Sep 14 04:40:07 2023] oom_reaper: reaped process 218008 (python), now anon-rss:0kB, file-rss:1179088kB, shmem-rss:0kB

Moreover, whenever I use Tensorflow, my memory gets used up almost completely, is this normal? Here’s the output of jtop:

My code:

import sys
import os
import cv2
import numpy as np
import tensorflow as tf
from tensorflow.python.compiler.tensorrt import trt_convert as trt
from pose_estimation.pose_net import PoseNet

POSE_SAVED_MODEL_DIR="./weights/pose_native_saved_model"

def calibration_input():
    for i in range(100):
        batched_input = np.random.random((1, 224, 224, 3)).astype(np.float32)
        batched_input = tf.constant(batched_input)
        yield (batched_input,)

# Instantiate the TF-TRT converter
converter = trt.TrtGraphConverterV2(
   input_saved_model_dir=POSE_SAVED_MODEL_DIR,
   use_dynamic_shape=True,
   dynamic_shape_profile_strategy='Optimal',
   max_workspace_size_bytes=4000000000,
   precision_mode=trt.TrtPrecisionMode.INT8
)
 
# Convert the model into TRT compatible segments
trt_func = converter.convert(calibration_input_fn=calibration_input)
converter.summary()

POSE_TRT_MODEL_DIR="./weights/pose_trt_09.14_01_int8"

def input_fn():
   batched_input = np.random.random((1, 224, 224, 3)).astype(np.float32)
   batched_input = tf.constant(batched_input)
   yield batched_input

converter.build(input_fn=input_fn)
converter.save(output_saved_model_dir=POSE_TRT_MODEL_DIR)

AastaLLL · September 18, 2023, 3:26am

Thanks for sharing your issue.
Our internal team will check and update more info with you.

SivaRamaKrishnaNV · September 25, 2023, 9:02am

Dear @nikola2,
May I know the Jetpack version?

nikola2 · September 25, 2023, 9:20am

• Hardware Platform Jetson
• DeepStream Version 6.3
• JetPack Version 5.1
• TensorRT Version 5.1

SivaRamaKrishnaNV · September 25, 2023, 9:26am

Dear @nikola2,
From jtop I see, it is Jetson AGX Orin. Is it Jeston AGX Orin or AGX Orin NX ? Could you please also share the model for reproducing the issue?

nikola2 · September 25, 2023, 11:40am

It’s Jeston AGX Orin
Sure, here’s the link

SivaRamaKrishnaNV · September 26, 2023, 6:32am

Dear @nikola2 ,
I notice similar behavior as yours on my machine.

SivaRamaKrishnaNV · October 11, 2023, 5:05pm

Dear @nikola2,
Did you check if TF → ONNX → TRT working with out OOM(out of memory) issue?

nikola2 · October 16, 2023, 12:24pm

Hi, I switched to DeepStream and TRT and now my application has no OOM issue.

system · November 6, 2023, 7:55am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
TensorRT produces all zero outputs when running in reduced precision Jetson Orin Nano tensorrt	8	2323	December 9, 2022
ResourceExhaustedError: Running TF-TRT integration on Jetson AGX Jetson AGX Xavier	10	1167	October 18, 2021
Failed to use INT8 precision mode when using tf-trt on Xavier Jetson AGX Xavier	4	986	October 18, 2021
Tf-trt conversion got killed TensorRT tensorrt , tensorflow , jetson-inference	3	759	April 22, 2021
The problem of deploying yolov8 on jetson orin with jetpack 5.1.1 Jetson Orin Nano tensorrt , python	6	1021	February 7, 2024
Calibration failed: INTERNAL: Failed to build TensorRT engine (INT8 precision mode) in Jetson Xavier NX (16GB) Jetson Xavier NX tensorrt	9	778	April 12, 2023
TensorFlow GPU device created with only 1591MB memory (or is it 3.87GiB?), despite there being over 20GB available Jetson Nano tensorflow , tf-trt	2	2755	June 25, 2021
Device memory is insufficient to use tactic error when converting a model in SavedModel format to tensorrt model. Jetson Nano Jetson Nano tensorrt	3	2340	January 5, 2022
Using a onnx model in INT8 mode for jetson Orin AGX TAO Toolkit yolo , onnx , jetson , deepstream	15	1149	May 21, 2024
TensorRT quantization bug on Jetpack 6.0 Jetson AGX Orin tensorrt , pytorch	6	657	January 22, 2024

Out of memory error when trying to convert model using INT8 precision mode

Related topics