Workarounds about TensorRT INT64 datatype

huangxinping · September 26, 2021, 8:58am

Description

In the NLP tasks, there are lots of INT64 ID tensor as input, but currently tensorrt doesn’t support the INT64 datatype. Is there any workarounds or tips on it?

Thanks.

Environment

TensorRT Version: 8.0.1.6
GPU Type: T4
Nvidia Driver Version: 460.32.03
CUDA Version: 11.2
CUDNN Version: 8.0.5.39
Operating System + Version: Ubuntu 18.04.5 LTS

spolisetty · September 27, 2021, 1:12pm

Hi,

TensorRT will attempt to cast down INT64 to INT32. For your reference,

github.com/onnx/onnx-tensorrt

thoughts on onnx data type INT64 support?

opened 07:20PM - 10 May 18 UTC

closed 10:44PM - 20 Oct 20 UTC

qigtang

question triaged

I am trying to deploy onnx model using tensorrt, but cannot find tensorrt suppor…t for int64 data type. nvinfer1::ITensor supprts the following types kFLOAT FP32 format. kHALF FP16 format. kINT8 INT8 format. kINT32 INT32 format. In the onnx model exported from pytorch, I have several usages of int64 1. Shape operator generate int64 type tensor per definition 2. some constants are int64 type. Any thoughts on how to address that? Should we modify pytorch export to avoid int64? Assume we never use a number > 2G? Or should we add that into onnx-tensorrt ?

Thank you.

Topic		Replies	Views
Failed to parse onnx file, Engine creation failed. INT64. Attempting to cast down to INT32 TensorRT	5	2741	June 9, 2020
ONNX/TensorRT INT64 Clamping. Why? TensorRT	4	736	July 6, 2023
tensorrt inference error while load onnx model TensorRT	8	3359	October 12, 2021
TensorRT model returns only zero outputs TensorRT	5	2793	April 24, 2022
Does tensorrt suport op like "expand_as" or "tile" for dynamic shape input? TensorRT	2	903	November 11, 2021
Downcasting from INT64 to INT32 TensorRT	1	4670	January 30, 2020
Does TensorRT suport dropout? TensorRT	1	587	May 24, 2023
Tensorrt Conversion TensorRT	2	79	November 30, 2024
Reproducible step-by-step ONNX to TensorRT issue: Unsupported ONNX data type: UINT8 Jetson AGX Xavier tensorrt , tensorflow , onnx	4	978	October 18, 2021
Tensorrt Conversion from ONNX for keras-ocr models fails because of int32 input in intermediate layers TensorRT	1	751	March 8, 2023

Workarounds about TensorRT INT64 datatype

Description

Environment

Related topics