DLA allows only same dimensions inputs to Elementwise

venkatkrishna772 · May 23, 2022, 8:07am

Description

When I am trying to create the engine file on DLA using below command
trtexec --onnx=./test_mul.onnx --explicitBatch --workspace=1024 --saveEngine=./test_mul_fp16.trt --verbose --fp16 --useDLACore=0 --allowGPUFallback

the multiplication layer in the onnx model is falling back to GPU instead of running on DLA with this warning
DLA allows only same dimensions inputs to Elementwise

how can we make the multiplication layer in this model to run on DLA?

Environment

TensorRT Version: 7.1.3
GPU Type: xavier
Nvidia Driver Version: Package:nvidia-jetpack, Version: 4.4
CUDA Version: 10.2
CUDNN Version: 8.0
Operating System + Version: Ubuntu 18.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

onnx file test_mul.onnx - Google Drive

Steps To Reproduce

run below command

trtexec --onnx=./test_mul.onnx --explicitBatch --workspace=1024 --saveEngine=./test_mul_fp16.trt --verbose --fp16 --useDLACore=0 --allowGPUFallback

NVES · May 23, 2022, 8:37am

Hi,
Please check the below links, as they might answer your concerns.

Thanks!

venkatkrishna772 · May 23, 2022, 9:36am

Thanks, but my problem is that the multiplication operator was running fine on GPU, but when I am trying to run it on DLA, it’s automatically falling back to GPU. It’s saying that Pointwise multiplication with broadcast is not supported on DLA, Does this issue is fixed in the latest version of TensorRT or it’s still a limitation on DLA.

spolisetty · May 24, 2022, 10:21am

Hi,

Moving this post to the Jetson Xavier forum so the Jetson team can take a look for a better help.

Thank you.

AastaLLL · May 25, 2022, 3:37am

Hi,

Unfortunately no.
We test your model on TensorRT 8.2 and 8.4, the same error occurs.
Please enable --allowGPUFallback flag to use GPU instead.

Thanks.

venkatkrishna772 · May 25, 2022, 7:03am

Okay, thanks for the reply. I have enabled that flag and it ran on GPU, but Nowhere in the DLA documentation, there is not mentioned this broadcasting issue right?

AastaLLL · June 2, 2022, 6:00am

Hi,

On the contrary, we document that GPU supports broadcast feature if one of the input tensors has lengths equal to 1.

Since DLA is a hardware-based inference engine, it is not as flexible as GPU.
It can only support the basic elementwise operators.

Thanks.

venkatkrishna772 · June 6, 2022, 9:38am

Sorry for the delay and thanks for your reply

system · June 29, 2022, 5:51am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Trtexec problem Jetson AGX Xavier tensorrt , jetson-inference	6	1684	September 27, 2021
[Need help] An error occurred while using DLA Jetson Xavier NX nvbugs , dla	10	949	October 18, 2021
General Question about jetson Xavier NX Jetson Xavier NX dla	15	1574	October 18, 2021
We want to use GPU+DLA. How do I use DLA when converting onnx to trt model? Is there a python sample Jetson Xavier NX jetson-inference	4	1066	September 19, 2021
TensorRT run DLA on Xavier Jetson AGX Xavier nvbugs	11	1620	October 18, 2021
Resize layer for DLA Jetson Xavier NX dla	4	985	October 18, 2021
FP16 builder does not work, DLA does not accept anything, How to accelerate Deep Learning? Jetson AGX Xavier tensorrt	7	1188	February 9, 2022
Using DLA for Peoplenet tlt model in Deepstream DeepStream SDK jetson-inference , dla	2	1154	October 12, 2021
Accessing Jetson's DLA from python TensorRT tensorrt , jetson-inference , python	3	2194	December 1, 2020
Wrong results when running network on DLA instead of GPU Jetson AGX Xavier	14	1150	October 18, 2021