ONNX/TensorRT INT64 Clamping. Why?

hexexpert5 · June 28, 2023, 9:23pm

Models exist which use INT64 values.
Inference with those models works on my 4090 GPU.
Hardware computation on a GPU or CPU involving INT64 values don’t care if the code invoking the computations is done via non-compiled python code or highly optimized TRT compiled code.
“Why” doesn’t TRT support INT64?

Also, if there was a good reason, wouldn’t a conversion to float32 better represent large magnitude INT64 values. I’m not sure if the “might” be a contributing factor in the poor quality of SD inferenced images when TRT is used.

AakankshaS · June 29, 2023, 3:37am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

hexexpert5 · June 29, 2023, 6:51am

It is well known that tensorrt doesn’t support int64. Are you doubting that non-tensorrt models sometimes use int64? Do you really want me to post a 1.7GB unet_fp16.onnx model converted from huggingface runwayml/stable-diffusion-v1-5?

I’m not trying to track down a bug for which providing a test case would be appropriate. This situation is obvious. TRT does NOT support int64. Why? My NVidia hardware doesn’t have a problem doing inference with non-tensorrt engine models which happen to have int64 values in them.

hexexpert5 · June 29, 2023, 6:53am

Also, onnx-checker didn’t output anything as if there was no problem with the model.

spolisetty · July 6, 2023, 2:56pm

Hi,

In future major release versions, INT64 support will be added. Please stay tuned for the update.

Thank you.

Topic		Replies	Views
ONNX Model Int64 Weights TensorRT	12	12814	February 17, 2024
TRT Uses INT 32 VS INT 16 TensorRT	3	972	October 12, 2021
Onnx model to TRT conversion error TensorRT	6	3187	April 15, 2022
Tensorrt Conversion from ONNX for keras-ocr models fails because of int32 input in intermediate layers TensorRT	1	733	March 8, 2023
Int8 performance is less than fp16 TensorRT tensorrt	3	840	September 2, 2022
Pytorch -> ONNX -> TensorRT inference with terrible accuracy (int64 clamped to int32) TensorRT cudnn	2	1225	January 23, 2024
ONNX to TRT Engine conversion Error TensorRT tensorrt	8	3658	May 25, 2022
I meet an error when I doing onnx2tensorRT TensorRT tensorrt	2	1263	January 7, 2022
Padding and speedup of tensorrt inference TensorRT	1	360	August 24, 2021
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	949	February 28, 2023

ONNX/TensorRT INT64 Clamping. Why?

check_model.py

Related topics