Error when converting onnx model to tensorrt on colab

ckm26 · August 7, 2023, 6:41pm

Hello,
I am trying to convert Mobile_SAM onnx model to tensorrt. I am facing the following error.

[08/07/2023-17:58:33] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in IExecutionContext creation: CPU +0, GPU +45, now: CPU 0, GPU 60 (MiB)
[08/07/2023-17:58:33] [W] [TRT] CUDA lazy loading is not enabled. Enabling it can significantly reduce device memory usage and speed up TensorRT initialization. See “Lazy Loading” section of CUDA documentation CUDA C++ Programming Guide
[08/07/2023-17:58:33] [I] Setting persistentCacheLimit to 0 bytes.
[08/07/2023-17:58:33] [I] Using values loaded from /content/image_embeddings_1.bin for input image_embeddings
[08/07/2023-17:58:33] [I] Input binding for image_embeddings with dimensions 1x256x64x64 is created.
[08/07/2023-17:58:33] [I] Using values loaded from /content/points_coords_1.bin for input point_coords
[08/07/2023-17:58:33] [E] Uncaught exception detected: Cannot open file /content/points_coords_1.bin!

I have verified the file. It works fine and there are other binary files which are working fine but I dont understand why I am getting this error.Below is the command line execution:

!/usr/src/tensorrt/bin/trtexec
–onnx=/content/sam_onnx_example_without_quantization.onnx --loadInputs=‘image_embeddings’:/content/image_embeddings_1.bin,‘point_coords’:/content/points_coords_1.bin,‘point_labels’:/content/point_labels_1.bin,‘mask_input’:/content/mask_input_1.bin,‘has_mask_input’:/content/has_mask_input_1.bin,‘orig_im_size’:/content/orig_im_size_1.bin --saveEngine=/content/mobile_sam.trt --shapes=image_embeddings:1x256x64x64,point_coords:1x5x2,point_labels:1x5,mask_input:1x1x256x256,has_mask_input:1,orig_im_size:1x2

TensorRT Version: 8.6
GPU Type: T4
CUDA Version: 12.0
Operating System + Version:
Python Version (if applicable): 3.10.12

Kindly help me resolve this error!

AakankshaS · August 8, 2023, 6:07am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

ckm26 · August 8, 2023, 4:59pm

Hi! Sure,Thank you for the reply. Below are the attached files (onnx and binary files). I have just executed the trtexec line mentioned in the previous comment.

sam_onnx_example_without_quantization.onnx (15.7 MB)

has_mask_input_1.bin (4 Bytes)
image_embeddings_1.bin (4 MB)
mask_input_1.bin (256 KB)
orig_im_size_1.bin (8 Bytes)
point_coords_1.bin (40 Bytes)
point_labels_1.bin (20 Bytes)

I have used the check_model function, it seems to work fine.

spolisetty · August 10, 2023, 4:43pm

Hi,

We observed the following error.

[08/10/2023-15:51:09] [I] [TRT] [MS] The main stream provided by execute/enqueue calls is the first worker stream
[08/10/2023-15:51:09] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in IExecutionContext creation: CPU +0, GPU +45, now: CPU 0, GPU 60 (MiB)
[08/10/2023-15:51:09] [W] [TRT] CUDA lazy loading is not enabled. Enabling it can significantly reduce device memory usage and speed up TensorRT initialization. See "Lazy Loading" section of CUDA documentation https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#lazy-loading
[08/10/2023-15:51:09] [I] Setting persistentCacheLimit to 0 bytes.
[08/10/2023-15:51:09] [E] Cannot find input tensor with name "‘has_mask_input’" in the engine bindings! Please make sure the input tensor names are correct.
[08/10/2023-15:51:09] [E] Invalid tensor names found in --loadInputs flag.
[08/10/2023-15:51:09] [E] Inference set up failed
&&&& FAILED TensorRT.trtexec [TensorRT v8601] # trtexec --onnx=sam_onnx_example_without_quantization.onnx --loadInputs=‘image_embeddings’:./image_embeddings_1.bin,‘point_coords’:./points_coords_1.bin,‘point_labels’:./point_labels_1.bin,‘mask_input’:./mask_input_1.bin,‘has_mask_input’:./has_mask_input_1.bin,‘orig_im_size’:./orig_im_size_1.bin --shapes=image_embeddings:1x256x64x64,point_coords:1x5x2,point_labels:1x5,mask_input:1x1x256x256,has_mask_input:1,orig_im_size:1x2

Please make sure the ONNX model is correct.

Thank you.

Topic		Replies	Views
Convert onnx to int 8 trt engine TensorRT	2	673	November 1, 2023
Unable to convert model to TensorRT when do_constant_folding=False TensorRT cudnn	1	528	February 29, 2024
How to use different profile in tensorrt? TensorRT tensorrt , python	3	1368	July 19, 2022
Fake quantization ONNX model parse ERROR using TensorRT TensorRT	6	1761	September 26, 2021
TensorRT 10.8 on Windows: API Usage Error (Target GPU SM 120 is not supported by this TensorRT release.) TensorRT cudnn	1	109	February 13, 2025
Error occurred while running the Tensorrt samples: [reformat.cpp::executeCutensor::385] TensorRT tensorrt	3	1187	December 12, 2023
Input length mismatch (onnx conversion to .trt) TensorRT tensorrt , onnx	4	1250	July 13, 2022
Incorrect slicing of boolean constant tensor with step size >1 TensorRT tensorrt , onnx	6	2979	June 16, 2022
Trtexec create engine failed from onnx when adding dynamic shapes TensorRT	5	2115	June 22, 2021
[TensorRT] ERROR: input: dynamic input is missing dimensions in profile 0 TensorRT	11	6976	October 12, 2021

Error when converting onnx model to tensorrt on colab

check_model.py

Related topics