How to use GPU + 2 DLA can be 100FPS for YoloV3 on Xavier

qmara781128 · March 6, 2020, 8:06am

Hi

Continues this topic https://devtalk.nvidia.com/default/topic/1069371/tensorrt/yolov3-fps-on-tensorrt/.

In this link: https://devblogs.nvidia.com/jetson-xavier-nx-the-worlds-smallest-ai-supercomputer/, report inferencing FPS is close to 100 FPS of YOLO-V3(608x608) on AGX Xavier with TensorRT.(Figure 3)

we try more method in this issue topic link https://devtalk.nvidia.com/default/topic/1069371/tensorrt/yolov3-fps-on-tensorrt/
, but not solve this problem, so we create new topic focus on “How to use GPU + 2 DLA can be 100FPS for YoloV3 on Xavier”.

So far we have tried to use GPU+2DLA follow command:

sudo nvpmodel -m 0
sudo jetson_clocks

Terminal 1 command:

./trtexec --onnx='yolov3.onnx' --workspace=26 --int8 --useSpinWait --iterations=100

Terminal 2 command:

./trtexec --onnx=yolov3.onnx --workspace=30 --int8 --useSpinWait --iterations=100 --useDLACore=0 --allowGPUFallback

Terminal 3 command:

./trtexec --onnx=yolov3.onnx --workspace=26 --int8 --useSpinWait --iterations=100 --useDLACore=1 --allowGPUFallback

and result of FPS in this figure

Can the official provide actual example code?
Because this problem has been going on for a long time.

AastaLLL · March 9, 2020, 5:20am

Hi,

It looks like the GPU performance drop once the DLA launched.
We are reproducing this issue and will update more information asap.

Thanks.

AastaLLL · March 24, 2020, 8:36am

Hi,

Sorry for keeping waiting.

The performance drop is caused by some GPU resource is occupied by the DLA fallback layer.
We are checking this with our internal team. Will update more information with you later.

Thanks.

qmara781128 · March 30, 2020, 9:34am

Ok, we will wait for your reply!

Thanks.

AastaLLL · July 16, 2020, 7:55am

Hi,

Sorry for keeping you waiting.

We have a new software release (JetPack4.4 GA) and a benckmark script here:

Based on this script, we can get 1098fps on the YOLOv3-tiny with 416 resolution.
More detail, please check our latest benchmark report here:

Thanks.

mhk50 · October 12, 2020, 1:53pm

@AastaLLL I was able to make the benchmark scripts work!
How can I actually develop a customized Yolov3 (tiny or not) to achieve such FPS?
i.e. Run it on a video stream.

kayccc · October 22, 2020, 3:47am

Hi mhk5,

Please help to open a new topic for your issue. Thanks