Hi,
Thanks for providing more details about this issue.
We try to reproduce the accuracy drop with polygraphy on the ONNX model attached (ubt_20251229.onnx).
However, we found the output is nan for both ONNXRuntime and TensorRT backends.
Could you help us check why the output is not valid?
$ polygraphy run ubt_20251229.onnx --onnxrt --trt --verbose
...
[I] Accuracy Comparison | onnxrt-runner-N0-01/05/26-05:25:14 vs. trt-runner-N0-01/05/26-05:25:14
[I] Comparing Output: 'scores' (dtype=float32, shape=(1, 1)) with 'scores' (dtype=float32, shape=(1, 1))
[I] Tolerance: [abs=1e-05, rel=1e-05] | Checking elemwise error
[I] onnxrt-runner-N0-01/05/26-05:25:14: scores | Stats: mean=nan, std-dev=nan, var=nan, median=nan, min=nan at (0, 0), max=nan at (0, 0), avg-magnitude=nan, p90=nan, p95=nan, p99=nan
[I] ---- Values ----
[[nan]]
[V] Could not generate histogram. Note: Error was: supplied range of [nan, nan] is not finite
[I]
[I] trt-runner-N0-01/05/26-05:25:14: scores | Stats: mean=nan, std-dev=nan, var=nan, median=nan, min=nan at (0, 0), max=nan at (0, 0), avg-magnitude=nan, p90=nan, p95=nan, p99=nan
[I] ---- Values ----
[[nan]]
[V] Could not generate histogram. Note: Error was: supplied range of [nan, nan] is not finite
[I]
[I] Error Metrics: scores
[I] Minimum Required Tolerance: elemwise error | [abs=nan] OR [rel=nan] (requirements may be lower if both abs/rel tolerances are set)
[I] Absolute Difference | Stats: mean=nan, std-dev=nan, var=nan, median=nan, min=nan at (0, 0), max=nan at (0, 0), avg-magnitude=nan, p90=nan, p95=nan, p99=nan
[I] ---- Values ----
[[nan]]
[V] Could not generate histogram. Note: Error was: autodetected range of [nan, nan] is not finite
[I]
[I] Relative Difference | Stats: mean=nan, std-dev=nan, var=nan, median=nan, min=nan at (0, 0), max=nan at (0, 0), avg-magnitude=nan, p90=nan, p95=nan, p99=nan
[I] ---- Values ----
[[nan]]
[V] Could not generate histogram. Note: Error was: autodetected range of [nan, nan] is not finite
[I]
[E] FAILED | Output: 'scores' | Difference exceeds tolerance (rel=1e-05, abs=1e-05)
[E] FAILED | Mismatched outputs: ['scores']
[E] Accuracy Summary | onnxrt-runner-N0-01/05/26-05:25:14 vs. trt-runner-N0-01/05/26-05:25:14 | Passed: 0/1 iterations | Pass Rate: 0.0%
[E] FAILED | Runtime: 12.508s | Command: /home/nvidia/topic_356086/env/bin/polygraphy run ubt_20251229.onnx --onnxrt --trt --verbose
The tool helps our internal team to check the issue.
It can be installed via the following command:
$ pip3 install polygraphy
Thanks.