Problem in accuracy and performance in conversion from keras to tensorrt model for production

spolisetty · June 8, 2021, 1:57pm

@yugal.jain1999,

Good to hear original issue has been resolved. Regarding CUDA out of memory error, it may be happening due to insufficient memory available. Please check available GPU memory using nvidia-smi and make sure enough memory is available. We recommend you to share error logs and issue reproducible model/scripts for better assistance,
Please refer following and make sure your engine serializing and inference code is correct.

Regarding Deepstream deployment, we recommend you to post your query on Deepstream forum. You may get better help.

Thank you.

yugal.jain1999 · June 8, 2021, 2:47pm

Hi @spolisetty
CUDA memory is no more there for now.
But I am still getting some random probabilities via tensorrt inference.

What should I do now to solve this?

I am also getting random values after run inference code again and again. May I know how can I set random seed to get same predictions after every time I run code while getting prediction using tensorrt inference?

Thanks

spolisetty · June 10, 2021, 6:18pm

@yugal.jain1999, Could you please give more details, are you able to run inference without error?

yugal.jain1999 · June 10, 2021, 6:23pm

@spolisetty I am still getting same results in tensorrt while onnx model is working fine and giving same accuracy as noval keras model gave.
But tensorrt model still not giving bad accuracy like before.

What should I do?

Thanks

spolisetty · June 11, 2021, 10:12am

Sorry, but It’s little confusing. Could you please let us know are you facing accuracy difference issue.
If yes, we request you to share latest ONNX file and issue reproducible inference script(s) which shows the ONNX inference output and TensorRT inference output on the same input data.
We try from our end also to reproduce the issue for better assistance.

Thank you.

yugal.jain1999 · June 11, 2021, 11:25am

@spolisetty
Yeah I am getting huge accuracy difference in onnx and tensorrt inference.

Yeah sure, I am sharing onnx model and colab notebook to run inference.
Colab Link - Google Colab
model_wts.onnx (822.7 KB)

And I already shared input data video previously if you scroll up to my initial messages.
Can you now help?
Thanks

spolisetty · July 3, 2021, 12:30pm

Hi @yugal.jain199,

Sorry for the delayed response. Are you still facing this issue.

Topic		Replies	Views
PyTorch model loosing accuracy when converting to TensorRT TensorRT tensorrt	10	3051	July 26, 2021
model accuracy penalty with tensorRT on jetson TX2 TensorRT	0	857	June 7, 2019
Wrong inference in TensorRT after convert keras model to TensorRT Jetson Nano	10	1990	October 14, 2021
Inference result gets worse when converting pytorch model to TensorRT model TensorRT pytorch	6	1280	January 19, 2022
Onnx -> tensorrt fp32 conversion performance degradation different outputs TensorRT tensorrt , pytorch , onnx	4	2241	November 29, 2022
TensorRT Engine gives incorrect inference output for segmentation model TensorRT	6	1428	October 12, 2021
Pytorch -> ONNX -> TensorRT inference with terrible accuracy (int64 clamped to int32) TensorRT cudnn	2	1544	January 23, 2024
Differences between tensorflow model inference and tensorRT model inference TensorRT tensorrt , tensorflow	6	2006	September 14, 2022
tensorRT inference unstable compared onnxruntime TensorRT	4	1454	May 4, 2021
Keras->ONXX->TensorRT TensorRT	10	1630	October 12, 2021

Problem in accuracy and performance in conversion from keras to tensorrt model for production

Related topics