RTX 3070 vs RTX 3070 laptop

My environment :

Windows 10 64bit, CUDA 11.1, Cudnn 8.0.5, TensorRT 7.2.1.6, OpenCV 4.3.0

I test yolo v4 tiny model using RTX 3070 desktop and RTX 3070 laptop.

The inference speed in the desktop was slower than in the laptop.
It’s almost twice as slow.

The performance of the desktop is better,
I don’t understand how this result came about.

Hi,
Request you to share the model, script, profiler and performance output if not shared already so that we can help you better.
Alternatively, you can try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec

While measuring the model performance, make sure you consider the latency and throughput of the network inference, excluding the data pre and post-processing overhead.
Please refer below link for more details:
https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-722/best-practices/index.html#measure-performance
https://docs.nvidia.com/deeplearning/tensorrt/best-practices/index.html#model-accuracy

Thanks!