Yolov3 on Xavier is slower than on Xavier NX

lingchao.zhu · August 28, 2020, 6:42am

Description

Hello,

I have transferred a Yolov3(ONNX) model to Xavier(Jetpack4.2.2) and Xavier NX(Jetpack4.4), but the model running on XavierNX is faster than on Xavier.

After some test, I have found that on Xavier a function named,
voidcuPointwise::launchPointwise<cuPointwise::SimpleAlgo<char, int>>(cuPointwise::LaunchParams, nvinfer1::VirtualMachineProgram) occupied the most time.

But on XavierNX this function hasn’t been invoked.

I also use another model to test, HigherHRNet(ONNX), but this will not call voidcuPointwise::launchPointwise<cuPointwise::SimpleAlgo<char, int>>(cuPointwise::LaunchParams, nvinfer1::VirtualMachineProgram) on Xavier.

Any ideas?

Environment

TensorRT Version: Xavier: TensorRT5.1, XavierNX: TensorRT7.1
CUDA Version: Xavier: 10.0, XavierNX: 10.2
CUDNN Version: Xavier: 7.5, XavierNX: 8.0.0

Relevant Files

Xavier

XavierNX

Steps To Reproduce

I test with trtexec and the command is:
Xavier:

./trtexec --onnx=/home/ets/Documents/yolov3/yolov3_bn16_m.onnx  --loadEngine=/home/ets/Documents/yolov3/yolov3_bn16_int8_m.engine --workspace=4096 --int8 --fp16 --batch=16

XavierNX:

./trtexec --onnx=/home/ets/Documents/yolov3/yolov3_bn16_m.onnx --loadEngine=/home/ets/Documents/yolov3/yolov3_bn16_in8t.engine --explicitBatch --workspace=4096 --fp16 --int8 --batch=16 --verbose

AakankshaS · August 28, 2020, 8:05am

Hi @lingchao.zhu,
Jetson Xavier team will be able to help you better here, hence moving your query to the respective forum.
Thanks!

AastaLLL · August 28, 2020, 8:37am

Hi,

There are always some performance improvement within each TensorRT package release.
So the improvement in NX may come from newer TensorRT API.

Would you mind to run the same test with Xavier + JetPack4.4 first?

Thanks.

Topic		Replies	Views
Jetson Xavier NX slower than Jetson TX2 at pytorch inferences Jetson Xavier NX benchmarks	4	644	June 29, 2023
Performance Expectation for Xavier NX Jetson Xavier NX tensorrt	2	511	October 18, 2021
FPS of yolov3 on xavier nx Jetson AGX Xavier yolo	5	1847	April 29, 2022
Xavier NX JP4.6 , Yolo v10 with c++n Jetson Xavier NX yolo	3	150	October 29, 2024
Why there is no difference in performance between tx2 and xavier?(Deep Learning Speed) Jetson AGX Xavier	9	1895	November 9, 2018
Yolov5 slow inference on Jetson Xavier NX16 Jetson Xavier NX ai	10	1770	October 26, 2022
TensorRT + YOLOv3 performance issue TensorRT	2	1469	June 13, 2019
Jetson Xavier NX yolov4 benchmark Jetson AGX Xavier yolo	4	1571	October 18, 2021
Run VIT model on NX tensorrt Jetson Xavier NX tensorrt	2	1211	March 25, 2022
TRT inference speed on two AGX Xavier TensorRT	1	348	September 12, 2021