Hi
This might be an issue directly with darknet, I’ve also opened an issue there: Darknet slower using cuDNN 8.0.0 / CUDA 10.2 than cuDNN 7.6.3 / CUDA 10.0 on Jetson · Issue #5426 · AlexeyAB/darknet · GitHub , just let me know in this case I will close this.
Problem
When I use darknet and yolov3-tiny on my jetson nano with the latest Jetpack 4.4 DP I get worse performance than with Jetpack 4.3. My guess is that is related to cuDNN 8.0.0 vs 7.x
I get 6.6 FPS with Jetpack 4.4 DP whereas I get 16.3 FPS with Jetpack 4.3
Complete procedure to reproduce
jetpack 4.4 (cuDNN: 8.0.0 CUDA 10.2) with CUDNN=1 flag:
darknet build with:
GPU=1
CUDNN=1
OPENCV=1
ARCH= -gencode arch=compute_53,code=[sm_53,compute_53]
result for yolov3-tiny ./darknet detector demo cfg/coco.data cfg/yolov3-tiny.cfg yolov3-tiny.weights <videoinput>
CUDA-version: 10020 (10020), cuDNN: 8.0.0, GPU count: 1
OpenCV version: 4.1.1
FPS:6.3
cuDNN: 8.0.0 CUDA 10.2 with CUDNN=0 flag (jetpack 4.4):
darknet build with:
GPU=1
CUDNN=0
OPENCV=1
ARCH= -gencode arch=compute_53,code=[sm_53,compute_53]
result for yolov3-tiny ./darknet detector demo cfg/coco.data cfg/yolov3-tiny.cfg yolov3-tiny.weights <videoinput>
CUDA-version: 10020 (10020), GPU count: 1
OpenCV version: 4.1.1
FPS:13.3
cuDNN: 7.6.3 CUDA 10 with CUDNN=1 flag (jetpack 4.4):
GPU=1
CUDNN=1
OPENCV=1
ARCH= -gencode arch=compute_53,code=[sm_53,compute_53]
result for yolov3-tiny ./darknet detector demo cfg/coco.data cfg/yolov3-tiny.cfg yolov3-tiny.weights <videoinput>
CUDA-version: 10000 (10000), cuDNN: 7.6.3, GPU count: 1
OpenCV version: 4.1.1
FPS:16.4
cuDNN: 7.6.3 CUDA 10 with CUDNN=0 flag (jetpack 4.3):
GPU=1
CUDNN=0
OPENCV=1
ARCH= -gencode arch=compute_53,code=[sm_53,compute_53]
result for yolov3-tiny ./darknet detector demo cfg/coco.data cfg/yolov3-tiny.cfg yolov3-tiny.weights <videoinput>
CUDA-version: 10000 (10000), cuDNN: 7.6.3, GPU count: 1
OpenCV version: 4.1.1
FPS:13.3
Thanks