Comparing Embedding Output Between Pytorch and TensorRT

arosad2 · February 7, 2025, 7:33pm

Description

Questions:

How should I compare the embedding output of a loaded pytorch torchscript model with the embedding output from a TensorRT to evaluate my implementation?
How close should I expect the embedding outputs to be?

I went through the process of converting a pytorch model loaded from torchscript and then serialized it into a a tensorrt engine plan using the python api with float 32 precision. I would like to compare the outputs to make sure my implementation is correct. I understand that the output is expected to be different based on the instructional video for converting tensorflow to tensorrt for classifications. I went through the process of running the same image data in both models and comparing the output using cosine distance which for my output converted to a 1D array is = 0.0013010502 which is 1 - cosine similarity. Is this a good method for comparing the inferences between the models to verify I implemented the conversion correctly and my tensorrt engine output is what I should expect? How close should I expect my output to be?

Environment

TensorRT Version: 10.8.0.43
GPU Type: A5000
Nvidia Driver Version: 565.57.01
CUDA Version: 12.7
CUDNN Version: n/a
Operating System + Version: Ubuntu 22.04.5 LTS
Python Version (if applicable): Python 3.12.8
TensorFlow Version (if applicable):
PyTorch Version (if applicable): 2.5.1+cu124
Baremetal or Container (if container which image + tag): none

Relevant Files

None, as is not a bug.

Steps To Reproduce

Please note question.

Topic		Replies	Views
The TensorRT engine produces different inference results when loaded using Python compared to C++ TensorRT cudnn , deepstream	2	27	April 28, 2025
Outputs of pytorch and tensorrt be accurately aligned TensorRT cudnn	1	247	March 30, 2024
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1580	September 28, 2023
When i found my network's output from pytorch model have different in trt model which input the same input data. what should i do? TensorRT	1	557	May 5, 2022
Different outputs between the output of the tensorflow and the output of ternsorrt. TensorRT	0	368	August 27, 2019
TensorRT 8 : C++ inference gives different results compared to tensorflow python inference TensorRT	7	1362	October 5, 2021
TensorRT 10.1: Different inference results of onnxruntime and tensorrt TensorRT	2	140	August 21, 2024
TensorRT TensorRT	1	446	August 26, 2021
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	1005	February 28, 2023
EDSR model(Superscaling) implimentation Steps TensorRT	7	981	April 8, 2021

Comparing Embedding Output Between Pytorch and TensorRT

Description

Environment

Relevant Files

Steps To Reproduce

Related topics