Unsupported shuffle layers to run on DLA

noir201 · April 5, 2024, 10:11pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) Jetson AGX Orin
• DeepStream Version6.2
**• JetPack Version (valid for Jetson only)**5.1
• TensorRT Version8.5.2
I have converted a facenet model (Face Recognition) from PyTorch to .ONNX to be used in deepstream configuration file. However, running the config file to generate engine file that is running on DLA will cause shuffle layers which were are not part of the model itself to fall back to GPU. Shuffle layers are added 28 time and constant layers are added 28 time. The model it self has 22 layers as below.

what’s the exact requirement by the DLA that requires these shuffle/constant layers to be added. Is it due to reconstruction process? because this much of shuffle layers have caused delay in processing as they fall back to the gpu which will cause back and forth data transmission between Gpu/dla
As per TRT documentation DLA do support shuffle layers but why in my case they fall back to the GPU.
FaceNet_DLAEngine_unsupportedLayers.txt (9.8 KB)

I have attached the log file generated during engine file creation.

AastaLLL · April 8, 2024, 5:36am

Hi,

Usually, shuffle is used for reshape and transpose.
It might be added to change the format between CHW and HWC to reach compatibility.

You can find the details of the DLA support matrix below:

Thanks.

noir201 · April 12, 2024, 9:17pm

Does this mean that these shuffle layers does also exist in the engine file which is created to run the model on GPU as well not only DLA right?
and is there a way to avoid the creation of these shuffle layers, as i have mentioned that the model itself consist of 22 layers generating 28 layers of shuffle is exceeding the model layers even.

AastaLLL · April 17, 2024, 6:39am

Hi,

Does this mean that these shuffle layers does also exist in the engine file which is created to run the model on GPU as well not only DLA right?

Please convert the engine to GPU and find the info in the compiling log.
GPU and DLA have quite different support matrices so the underlying implementation won’t be the same..

We have a repo to demonstrate how to optimize the model for DLA.
Please give it a check:

Thanks.

noir201 · April 28, 2024, 8:14pm

when an operator shows see REF in TRT and Native in DLA. what does this mean?

AastaLLL · May 13, 2024, 5:57am

Hi,

It means the operator does support by DLA but the compile doesn’t enable the function yet.
If you want to request for certain layer, please click the RFE.

Thanks.

system · June 5, 2024, 5:04am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
(SHUFFLE): Unsupported on DLA switching to GPU Jetson AGX Orin tensorrt , dla	6	712	April 8, 2024
Shuffle Layers fall back to the GPU when enabling the DLA TensorRT	1	247	March 21, 2024
Layer shuffling on GPU and DLA Jetson AGX Orin tensorrt , cuda , jetson-inference , dla , gpu , kb	6	1035	May 10, 2023
TRT8 breaks DLA Jetson AGX Xavier tensorrt	9	1253	February 9, 2022
Reshape operator is not supported by DLA Jetson Orin NX dla	5	970	April 13, 2023
Jetson Orin: All layers pushed to GPU, zero layers on DLA Jetson AGX Orin tensorrt , dla	7	1141	April 26, 2023
Running NN models in GPU & DLA DeepStream SDK	17	1014	March 27, 2024
Wrong result from DLA Jetson AGX Xavier nvbugs , dla	8	942	October 18, 2021
Xavier NX does not support adaptative average pooling on DLA? Jetson Xavier NX tensorrt	27	1560	October 11, 2023
Layers Inspection on TRT model Jetson AGX Orin tensorrt , jetson-inference , dla	2	765	October 31, 2022

Unsupported shuffle layers to run on DLA

Related topics