Darknet slower using Jetpack 4.4 (cuDNN 8.0.0 / CUDA 10.2) than Jetpack 4.3 (cuDNN 7.6.3 / CUDA 10.0)

Same problem on PyTorch 1.4
I am follow this page to install PyTorch:

I am testing the YOLOv5:

and found JetPack 4.4 inference time is about 0.25s
the JetPack 4.3 inference time is about 0.14s