Same input different output with Tensorrt4.0.1.6

hl1008 · May 21, 2020, 7:46am

hi,

I have upgrade my code to TensorRT-7.0.0.11. The issue seems slightly different. I have run a test of 20 worker threads each time for 100 round. Among the 2000 result files, I get 1937 files with the same md5(fa6d87b1f5b220a30f7647654a60f6c0) and 63 for another md5(9d661f06c6a3c2f6c117c487227cbb9e).

I notice the different md5 seems happend together with the following runtime error:

[TRT] FAILED_EXECUTION: std::exception
FAILED_EXECUTION: std::exception
FAILED_EXECUTION: std::exception
FAILED_EXECUTION: std::exception
[05/21/2020-15:04:09] [E] [TRT] FAILED_EXECUTION: std::exception
[05/21/2020-15:04:09] [F] [TRT] Assertion failed: *refCount > 0
../rtSafe/WeightsPtr.cpp:20
Aborting...

I guess that should be the reason I get wrong result. So now the situation is:

1. the slightly precision difference among parallel disappear. All the valid result is the same.
2. there is some parallel issue lead to runtime error and invalid result that I still need to resolve.

After I search the runtime error, I am lead to this post which seems similar to my case:

So that’s latest progress on my issue.

Thanks.

runtime env:

CentOS 7.5.1804
GPU: TITAN V
Driver Version: 410.48
CUDA version: 10.0
TensorRT-7.0.0.11
Cudnn: 7.6.5

Topic		Replies	Views
Different TensorRT inference results from the same input when batchSize > 1 TensorRT	2	2069	October 12, 2021
Is TensorRT “floating-point 16 precision mode” non-deterministic on Jetson TX2? Jetson TX2	6	1472	October 18, 2021
Different TensorRT inference results for the same input TensorRT	2	1552	October 23, 2018
Thread safe while use tensorRT TensorRT	1	2685	March 25, 2019
Same tensorRT code get different result TensorRT	10	2267	July 23, 2019
Output changes for the same input when the neural net has been run for several times? TensorRT	19	1769	October 30, 2018
Non-deterministic TensorRT engine building TensorRT tensorrt	3	637	March 10, 2021
Error run 2 context parallel in TensorRT7 TensorRT	13	2698	July 5, 2021
Output is not stable TensorRT	7	620	October 12, 2021
TensorRT inference result of one image don't keep the same in high qps TensorRT tensorrt	1	633	June 29, 2022

Same input different output with Tensorrt4.0.1.6

Related topics