Failed to convert the SAM2.1 decoder onnx model due to Error Code 4: Internal Error

noname.mark09 · April 20, 2025, 9:42am

Greetings, everyone.
Here is an onnx file: https://github.com/ibaiGorordo/ONNX-SAM2-Segment-Anything/releases/download/0.2.0/decoder.onnx

I am trying to use TensorRT to build convert this file using this cli:
trtexec --onnx=decoder.onnx --saveEngine=decoder.engine

hardware information output by TensorRT :
[04/20/2025-17:31:16] [I] === Device Information ===
[04/20/2025-17:31:16] [I] Available Devices:
[04/20/2025-17:31:16] [I] Device 0: “Orin” UUID: GPU-a00bb704-da56-555b-a79c-65a4e3662de8
[04/20/2025-17:31:16] [I] Selected Device: Orin
[04/20/2025-17:31:16] [I] Selected Device ID: 0
[04/20/2025-17:31:16] [I] Selected Device UUID: GPU-a00bb704-da56-555b-a79c-65a4e3662de8
[04/20/2025-17:31:16] [I] Compute Capability: 8.7
[04/20/2025-17:31:16] [I] SMs: 16
[04/20/2025-17:31:16] [I] Device Global Memory: 62840 MiB
[04/20/2025-17:31:16] [I] Shared Memory per SM: 164 KiB
[04/20/2025-17:31:16] [I] Memory Bus Width: 256 bits (ECC disabled)
[04/20/2025-17:31:16] [I] Application Compute Clock Rate: 1.3 GHz
[04/20/2025-17:31:16] [I] Application Memory Clock Rate: 1.3 GHz

TensorRT version is shown here[The cuda version is 12.6]:
[04/20/2025-17:31:16] [I] Note: The application clock rates do not reflect the actual clock rates that the GPU is currently running at.
[04/20/2025-17:31:16] [I] TensorRT version: 10.7.0
[04/20/2025-17:31:16] [I] Loading standard plugins
[04/20/2025-17:31:16] [I] [TRT] [MemUsageChange] Init CUDA: CPU +2, GPU +0, now: CPU 31, GPU 13403 (MiB)
[04/20/2025-17:31:18] [I] [TRT] [MemUsageChange] Init builder kernel library: CPU +928, GPU +749, now: CPU 1002, GPU 14197 (MiB)
[04/20/2025-17:31:18] [I] Start parsing network model.
[04/20/2025-17:31:18] [I] [TRT] ----------------------------------------------------------------
[04/20/2025-17:31:18] [I] [TRT] Input filename: /home/nvidia/projects/segment-anything-2/decoder.onnx
[04/20/2025-17:31:18] [I] [TRT] ONNX IR version: 0.0.8
[04/20/2025-17:31:18] [I] [TRT] Opset version: 16
[04/20/2025-17:31:18] [I] [TRT] Producer name: pytorch
[04/20/2025-17:31:18] [I] [TRT] Producer version: 2.6.0
[04/20/2025-17:31:18] [I] [TRT] Domain:
[04/20/2025-17:31:18] [I] [TRT] Model version: 0
[04/20/2025-17:31:18] [I] [TRT] Doc string:
[04/20/2025-17:31:18] [I] [TRT] ----------------------------------------------------------------
[04/20/2025-17:31:18] [E] Error[4]: ITensor::getDimensions: Error Code 4: Internal Error (/OneHot: an IIOneHotLayer cannot be used to compute a shape tensor)
[04/20/2025-17:31:18] [E] [TRT] ModelImporter.cpp:948: While parsing node number 146 [Tile → “/Tile_output_0”]:
[04/20/2025-17:31:18] [E] [TRT] ModelImporter.cpp:950: — Begin node —
input: “/Unsqueeze_11_output_0”
input: “/Reshape_4_output_0”
output: “/Tile_output_0”
name: “/Tile”
op_type: “Tile”

[04/20/2025-17:31:18] [E] [TRT] ModelImporter.cpp:951: — End node —
[04/20/2025-17:31:18] [E] [TRT] ModelImporter.cpp:953: ERROR: ModelImporter.cpp:195 In function parseNode:
[6] Invalid Node - /Tile
ITensor::getDimensions: Error Code 4: Internal Error (/OneHot: an IIOneHotLayer cannot be used to compute a shape tensor)
[04/20/2025-17:31:18] [E] Failed to parse onnx file
[04/20/2025-17:31:18] [I] Finished parsing network model. Parse time: 0.037032
[04/20/2025-17:31:18] [E] Parsing model failed
[04/20/2025-17:31:18] [E] Failed to create engine from model or file.
[04/20/2025-17:31:18] [E] Engine set up failed

Is anyone willing to share any idea about this?
Thank you, kind community. :)

AakankshaS · April 30, 2025, 4:28pm

Hi @noname.mark09 ,
Can you pls try with min, max and opt shapes with trtexec command , and confirm back if you are still facing this issue?
Alternatively

Review Layer Support:

The OneHot layer and related operations sometimes cause compatibility issues in TensorRT. According to a GitHub issue, an IIOneHotLayer cannot compute a shape tensor, leading to the ITensor::getDimensions error. This indicates that TensorRT might not fully support the OneHot operation as implemented in your ONNX model.
Solution: Consider replacing or simplifying this part of the model before the conversion, possibly by using a custom implementation or alternative logic for one-hot encoding.

Verify ONNX Model Version and Operations:

Ensure your ONNX model uses supported operations for the TensorRT version you are utilizing. TensorRT tends to lag in support for the latest ONNX features or operations.
Solution: Check the opset version of your ONNX model. It might be beneficial to downgrade to an earlier opset version that is known to work smoothly with TensorRT.

Use Latest Versions:

Make sure you are using the latest version of both TensorRT and the ONNX-TensorRT parser, as improvements and bug fixes could resolve your issue.
Solution: Update TensorRT and ensure you are using the appropriate version of CUDA and cuDNN compatible with your TensorRT installation.

Inspect Input Types:

Ensure that the input types for your layers are supported and correctly formatted. As per logs related to similar errors, TensorRT doesn’t natively support certain data types like UINT8.
Solution: Cast all relevant inputs to supported types like INT32 or FLOAT32 prior to conversion.

Alter the Model:

If the error continues, consider modifying the original ONNX model before exporting it. Using libraries like PyTorch or TensorFlow to manually alter the operations could help create a more compatible model for TensorRT.
Example: You might replace or remove layers that trigger incompatibility issues or reconfigure how data flows through certain layers.

Using Workaround Techniques:

Some users have reported success by exporting their models with alternate flags or utilizing additional parameters in the trtexec command.
Solution: Try adding the --explicitBatch flag or modifying how inputs are defined during conversion.

Debugging Tools:

Utilize TensorRT’s debugging facilities to gain insight into where the conversion process is failing. This might involve checking the verbose output logs during the conversion process.
Solution: Use trtexec --onnx=model.onnx --verbose to get detailed logs that could highlight the exact failure point.

Thanks

Topic		Replies	Views
TensorRT build engine failed nanoSAM TensorRT tensorrt	1	574	February 29, 2024
Error code 4 internal error unnamed layer TensorRT cudnn	9	1416	October 4, 2024
Unsupported ONNX data type: UINT8 (2) TensorRT	24	8970	May 6, 2021
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	1921	June 29, 2021
Cannot convert SSD ONNX model to TensorRT TensorRT tensorrt	15	2362	November 23, 2022
Issues with torch.nn.ReflectionPad2d(padding) conversion to TRT engine TensorRT tensorrt , pytorch , onnx	21	4194	February 8, 2022
Running a pytorch network converted to ONNX with TensorRT on the TX2 Jetson TX2	24	8889	October 18, 2021
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1398	November 17, 2022
Error while trying to convert onnx to tensorrt engine TensorRT tensorrt , cudnn , onnx	1	63	March 28, 2025
Conversion to tensorRT error . [graphShapeAnalyzer.cpp::throwIfError::1306] Error Code 9 TensorRT jetson-inference	10	4364	May 13, 2022

Failed to convert the SAM2.1 decoder onnx model due to Error Code 4: Internal Error

Related topics