Casting INT32 tensor to FLOAT

v.hunglx2 · April 6, 2021, 2:56pm

Hi, I want to cast a int32 tensor to float type. How can I do that in TensorRT. I tried the Identity layer but it is not supported.

NVES · April 6, 2021, 3:07pm

Hi, Please refer to the below links to perform inference in INT8
https://github.com/NVIDIA/TensorRT/blob/master/samples/opensource/sampleINT8/README.md

Thanks!

v.hunglx2 · April 6, 2021, 3:16pm

Hi, I am not doing INT8 quantization. What I asked is how to convert a INT32 tensor to a float precision tensor. More specifically, I want to feed second output tensor of ITopKLayer (INT32) as the input of IResizeLayer.
Thanks.

spolisetty · April 8, 2021, 8:21am

Hi @v.hunglx2,

INT32 is for indices.
Could you please let us know is there a specific reason that you want to convert indices to float?

Thank you.

v.hunglx2 · April 8, 2021, 10:56am

In some case of semantic segmentation, performing TopK (argmax) in full-size segmentation output may slow-down the pipeline. We want to perform TopK in small-size segmentation output and then scale-up the max indices mask to the origin size, all by TensorRT layers

spolisetty · April 9, 2021, 6:42am

Hi @v.hunglx2,

INT32->kFloat is actually supported operation by nvinfer1::IIdentityLayer. But looks like document not included it. Unfortunately we couldn’t find sample to share.
https://docs.nvidia.com/deeplearning/tensorrt/api/c_api/classnvinfer1_1_1_i_identity_layer.html

We request you to try and if you face any issues please share logs, issue related scripts for better assistance.

Thank you.

v.hunglx2 · April 9, 2021, 7:03am

Hi, I tried again with a simple onnx with Cast operator which will be parsed as nvinfer1::IIdentityLayer by trtexec tool.
TensorRT 7.1.3 (Jetpack): The TRT engine built succesfully
TensorRT 6.3.1 (DriveOS 5.2 release): I get some error. Sample onnx model and log are attached below.
So, maybe INT32 and FLOAT conversion is not supported by TensorRT 6?temp.onnx (175 Bytes) TensorRT6.3.1-trtexec.log (6.8 KB)

spolisetty · April 14, 2021, 5:20am

Hi @v.hunglx2,

With latest TRT version we could successfully generate the engine.
Yes, convert between FP32 and INT is TRT 7.0+ feature. It’s not supported in TRT version 6.

Thank you.

Topic		Replies	Views
How can I use int8 typed inputs into TensorRT6.0? TensorRT	4	1060	June 19, 2020
Can convert to INT32 but not with FP16 TensorRT	3	1046	November 29, 2022
Can TensorRT 7.1.3 convert an INT8 pytorch QAT model to engine? TensorRT	3	740	April 21, 2022
How to deal with Cast to INT32 problem when converting ONNX to TensorRT 6 Jetson AGX Xavier	2	1464	October 18, 2021
How to enforce convert all layers to INT8 when building int8 engine model? TensorRT	5	446	June 21, 2023
TRT Uses INT 32 VS INT 16 TensorRT	3	1020	October 12, 2021
ONNX/TensorRT INT64 Clamping. Why? TensorRT	4	745	July 6, 2023
Question about the tensorrt precision transformation TensorRT	4	470	July 12, 2021
How to pass uint8 input to a tensorrt engine? TensorRT	6	2853	October 12, 2021
Turing Tensor core int4 operation TensorRT	3	2814	December 11, 2018

Casting INT32 tensor to FLOAT

Related topics