Shufflenet_v2_x1_0 on TesorRT7.0, F32 and FP16 Inference results are quite different

taoze_happy · June 12, 2020, 3:14am

Description

Environment

TensorRT Version: 7.0
GPU Type: Tesla T4
Nvidia Driver Version: 440.33.01
CUDA Version: 10.2
CUDNN Version: 7.6.5.32
Operating System + Version: ubuntu18.04 + 4.15.0-101-generic
Python Version (if applicable): 3.7.3
PyTorch Version (if applicable): 1.2.0

1、 Generate ONNX
import torch
import torchvision.models as models
model = models.shufflenet_v2_x1_0(pretrained=True).cuda()
model.eval()
dummy_input = torch.randn(1,3,224,224).cuda()
input_names = ["input"]
output_names = ["output"]
torch.onnx.export(model, dummy_input, "shufflenet_v2_x1_0.onnx", verbose=False, opset_version=9,input_names=input_names, output_names=output_names)
2、pytorch onnx to tensorrt
    fp32
    trtexec --onnx=shufflenet_v2_x1_0.onnx --saveEngine=32.trt
    fp16
   trtexec --onnx=shufflenet_v2_x1_0.onnx --fp16 --saveEngine=16.trt
3、interfence
   trtexec  --loadEngine=32.trt  --exportOutput=result_32.trt
   trtexec  --loadEngine=16.trt  --exportOutput=result_16.trt
4、compare resutl
    It's not a difference of decimal places

SunilJB · June 12, 2020, 6:57am

Issue is fixed and should be available in next TRT release.
Request you to please stay tuned for TRT release announcement.

Thanks

taoze_happy · June 12, 2020, 7:11am

Thanks for your reply. When will the next version be released?

SunilJB · June 12, 2020, 7:17am

I don’t have info regarding the exact release dates.
Please stay tuned for TRT announcement.

Thanks

taoze_happy · June 12, 2020, 7:18am

thank you

Topic		Replies	Views
Some layers of onnx are discarded directly, when pytorch onnx convert egine file on FP16 TensorRT	1	785	June 12, 2020
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1613	September 28, 2023
ONNX Model and Tensorrt Engine gives different output TensorRT tensorrt , onnx	4	772	March 21, 2023
Incorrect inference results after converting from ONNX to TRT with trtexec TensorRT tensorrt , python , onnx	4	1609	December 9, 2022
Tensorrt Conversion TensorRT	2	88	November 30, 2024
Convert onnx to engine fail on Tensorrt7.1.3.4 TensorRT	2	649	July 29, 2020
Fail to onnx2trt! TensorRT tensorrt	4	380	November 26, 2020
Different FP16 inference with tensorrt and pytorch TensorRT	5	4526	October 25, 2021
TensorRT with fp16 return nan for all outputs TensorRT	5	4094	February 5, 2021
Outputs of tensorrt are too different according to the compute capabilities TensorRT	1	435	November 2, 2022

Shufflenet_v2_x1_0 on TesorRT7.0, F32 and FP16 Inference results are quite different

Description

Environment

Related topics