I had a quick try on T4 and observed the same issue. And we will investigate what’s happening.
BTW, I was able to run the model in FP32 mode, so can it be a temp option for you to get unblocked?
Sorry for the late updating.
Is this issue remaining, or what is the current status?
Could you get a chance to try the latest TensorRT 5.1, as I couldn’t find the context to validate the original issue against TensorRT 5.1?
I did observe your issue and it seems to be caused by very hidden bug within our building system.
Would you consider to run your model in FP32 or INT8 mode?
(INT8 running log shown below)
[I] [TRT] Debug synchronize completed successfully after build for layer: probability_colorize/Conv/Conv2D input reformatter 0 (type=9, tactic=0)
[I] [TRT] Debug synchronize completed successfully after build for layer: probability_colorize/Conv/Conv2D (type=14, tactic=-6980047749615980934)
[I] [TRT] Debug synchronize completed successfully after build for layer: probability_colorize/Conv/Tanh (type=0, tactic=0)
[I] [TRT] Debug synchronize completed successfully after build for layer: probability_colorize/out/Add (type=10, tactic=0)
[I] [TRT] Debug synchronize completed successfully after build for layer: probability_colorize/out/Mul_HL_1804289383 (type=10, tactic=0)
[I] [TRT] Debug synchronize completed successfully after build for layer: probability_colorize/out/Mul (type=19, tactic=0)
[I] [TRT] Data initialization and engine generation completed in 0.938048 seconds.
[I] Average over 10 runs is 9.04309 ms (host walltime is 9.27157 ms, 99% percentile time is 9.06378).
[I] Average over 10 runs is 9.11002 ms (host walltime is 9.34709 ms, 99% percentile time is 9.17792).
[I] Average over 10 runs is 9.20339 ms (host walltime is 9.45544 ms, 99% percentile time is 9.2665).
[I] Average over 10 runs is 9.22274 ms (host walltime is 9.48167 ms, 99% percentile time is 9.28752).
[I] Average over 10 runs is 9.23295 ms (host walltime is 9.49235 ms, 99% percentile time is 9.27085).
[I] Average over 10 runs is 9.20772 ms (host walltime is 9.46407 ms, 99% percentile time is 9.23286).
[I] Average over 10 runs is 9.23249 ms (host walltime is 9.49017 ms, 99% percentile time is 9.33901).
[I] Average over 10 runs is 9.28647 ms (host walltime is 9.54139 ms, 99% percentile time is 9.34672).
[I] Average over 10 runs is 9.23318 ms (host walltime is 9.4913 ms, 99% percentile time is 9.2841).
[I] Average over 10 runs is 9.38855 ms (host walltime is 9.64644 ms, 99% percentile time is 9.48998).
&&&& PASSED TensorRT.trtexec # ./trtexec --uff=iter_color_two_side_disp_160_ema_from_tf1_7_woabs.uff --uffInput=pan,1,136,240 --uffInput=color,3,136,240 --output=probability_colorize/out/Mul --int8 --verbose