I am comparing Tesla V100 with RTX 2080 super.
Tesla V100 : CUDA 10.0 & TensorRT 5.1.5.0
RTX 2080 super : CUDA 10.2 & TensorRT 7.0.0.0
And I follow the way I used it here.
Tesla V100 has very good hardware. So it is expected that it will perform well.
But RTX 2080 super faster than V100. I think there is a difference of about 1.5 times.
Why did this happen? is there sure there are big differences depending on the software version?
The software versions (TensorRT 5 vs TensorRT7, and CUDA 10.0 vs CUDA 10.2) can make signficant performance differences, depending on the model. Can you compare both using the same versions? And if you still see the perf difference, please share the scripts used to measure the perf so we can reproduce.