Please provide complete information as applicable to your setup.
• Hardware Platform (Jetson / GPU) Jetson NX
• DeepStream Version 5.0
• JetPack Version (valid for Jetson only) 4.4 DP
• TensorRT Version 7.1
• NVIDIA GPU Driver Version (valid for GPU only)
I am running the sample in /opt/nvidia/deepstream/deepstream-5.0/sources/objectDetector_SSD
with both int8 and fp16 mode, batch = 1. DLA not used.
I use 15W 6CORE power mode.
Both of the detection results are correct. I expect the int8 performance will be higher than fp16.
However, I found int8 and fp16 shows the similar performance, both is around 30fps.
Could you let me know why int8 has the same perf with fp16? and how to achieve higher fps with int8 than fp16?