Trained model giving slightly different values when tested on P100 and V100 . is there a way to make it consistent.?

yuvaramsingh94 · April 29, 2021, 4:19pm

Hi all,
i have currently trained a model using V100 GPU and my inference device is using P100. when i test it on test dataset, both the hardwares are giving slightly different values which eventually reduces my overall score on the P100 hardware. i tried to subtract the difference between the values and use the mean of it to scale my P100 prediction values . this helped me a little bit but, this is not solving my problem.
i am using Pytorch 1.8.0 for this work . is there a better way to address this problem of difference in performance due to hardware difference between Training and Inferencing environment.

i am using pytorch native stack for inferencing . i am not using any optimization like TensorRT

pls see my post of TensorRT

thanks
yuvaram

Topic		Replies	Views
Trained model giving slightly different values when tested on P100 and V100 . is there a way to make it consistent.? TensorRT pytorch	4	446	April 29, 2021
A100 vs. V100 for ML Training Frameworks tensorflow , pytorch	1	26453	March 24, 2021
tensorRT output and Pytorch->ONNX output are not same by FP32 inference TensorRT	0	494	September 9, 2019
A100 graphics card inference performance is not strong TensorRT	4	563	April 12, 2022
Difference between A100 vs RTX 4090 in training deep learning models TensorRT cuda , python	2	530	November 30, 2024
Inference result gets worse when converting pytorch model to TensorRT model TensorRT pytorch	6	1140	January 19, 2022
Tensorrt is slower than pytorch TensorRT	2	2229	September 15, 2021
Tesla V100 GPU way too slow CUDA Programming and Performance	8	6485	December 21, 2017
The TensorRT engine produces different inference results when loaded using Python compared to C++ TensorRT cudnn , deepstream	1	19	April 28, 2025
tensorRT output and Pytorch output are not same by FP32 inference（Classification model） TensorRT	4	1406	July 30, 2019

Trained model giving slightly different values when tested on P100 and V100 . is there a way to make it consistent.?

Related topics