TensorRT optimization for DLA

tonci.antunovic · October 14, 2019, 3:53pm

We are trying to use tensorRT_optimization tool to optimize some fairly standard segmentation models (a lot of convolution/deconvolution layers), which works fine when optimizing for GPU. However, when optimizing for DLA using --useDLA flag the thing breaks with some weird errors including
Error (conv2d/kernel not running on DLA)
and
Error (truediv/y not running on DLA)

What’s the situation with the DLA support in the tool?

In the Nvidia Drive 9.0 software release notes it is mentioned
6.15 TensorRT Deep Learning Accelerator Performance Limitation
In this release of DRIVE Software, TensorRT does not include full Deep Learning
Accelerator (DLA) support and therefore it is not at full performance.

Is this what we are encountering when using the tool? Is there a workaround?

AastaLLL · October 15, 2019, 9:02am

Hi,

Could you add –winograd 0 to seee if helps?

Thanks.

tonci.antunovic · October 15, 2019, 12:36pm

Unfortunately, this didn’t help, the errors are exactly the same.

We can’t seem to find any record of this --winograd option. Is there a complete documentation of all options somewhere?

AastaLLL · October 18, 2019, 6:26am

Hi,

You can find some related information in our TensorRT document.
The TensorRT version of DRIVE Software 9.0 is v5.0.
Here is the support matrix for DLA in v5.0:
[url]TensorRT Support Matrix :: Deep Learning SDK Documentation

The

tonci.antunovic · November 5, 2019, 4:11pm

The support matrix you linked states that 2D convolution layers are supported on DLA. So we’re not sure
why we get “conv2d/kernel not running on DLA” error. There are broadcasting limitations for convolution layers stated here, but we don’t seem to have an issue with that since these limitations are not DLA specific (and our model optimizes for GPU). We have also found more details about DLA support on
Developer Guide :: NVIDIA Deep Learning TensorRT Documentation (this doesn’t seem to be 5.0.3 version specific). Again we are complying to these guides in our conv2D use.

Also we can’t find mention of winograd option in the TensorRT documentation. We are not sure if you’re referring to trtexec tensorRT tool or AGX Drive specific /usr/local/driveworks/tools/dnn/tensorRT_optimization tool. We’re trying to use the latter tensorRT_optimization tool, as described in Nvidia Drive documentation.

SivaRamaKrishnaNV · November 6, 2019, 8:41am

Dear tonci.antunovic,
What’s the situation with the DLA support in the tool?

If you are generating tensorRT model for DLA using TensorRT_Optimization, all layers your DNN need to be supported on DLA. Incase of trtexec you can use --allowGPUFallback flag to allow unsupported layers to run using GPU. This is not supported with TensorRT_Optimization tool.

As you are trying to get tensorRT model for DLA, could you double check if you have all DLA supported layers in DNN? Please share layer details to understand the issue. It would be great if you can share network architecture file to reproduce the issue on our end.

Topic		Replies	Views
How to use Deep Learning Accelerator DRIVE AGX Xavier General	5	637	October 12, 2021
Convert model to TensorRT with DLA \| DLA Node compilation Failed TensorRT	3	920	October 12, 2021
Jetpack 4.3 DP DLA running Jetson AGX Xavier	5	776	October 18, 2021
Using and updating TensorRT on Pegasus platform DRIVE AGX Xavier General	6	763	October 12, 2021
Trtexec fails with null pointer exception when useDLACore enabled TensorRT dla	16	1143	November 28, 2023
TensorRT run DLA on Xavier Jetson AGX Xavier nvbugs	11	1626	October 18, 2021
DLA trtexec questions Jetson AGX Xavier	4	1804	October 18, 2021
Running tensorRT has a lot of warning message about DLA Jetson AGX Xavier tensorrt , dla	2	907	October 18, 2021
DLA support for 3d conv TensorRT jetson-inference	3	623	April 15, 2021
FP16 builder does not work, DLA does not accept anything, How to accelerate Deep Learning? Jetson AGX Xavier tensorrt	7	1204	February 9, 2022

TensorRT optimization for DLA

Related topics