Deploy three AI model engines on both DLAs and GPU

Omid306 · September 20, 2023, 10:42am

Hi,
I want to concurrently and independently of each other run and inference 3 networks. 1 on DLA0, 1 on DLA1 and the third on the GPU, using the python on Jetson Xavier NX.
It would be appreciated if anyone let me know the possibility and implementation of that and help and give me some sample codes.

AastaLLL · September 21, 2023, 3:37am

Hi,

DLA is a hardware-based accelerator so it has some constraints in the layers:

You can find a Python inference sample below:
https://elinux.org/Jetson/L4T/TRT_Customized_Example#OpenCV_with_ONNX_model

Setting DeviceType to DLA can create a DLA engine:

https://docs.nvidia.com/deeplearning/tensorrt/api/python_api/infer/Core/BuilderConfig.html?highlight=dla#tensorrt.DeviceType

Thanks.

Omid306 · September 25, 2023, 7:52am

Hi again,
thank you for your response.
In TRTInference class, when I make an object to execute my_model.engin inference on DLA, does it matter what device I have configured for Device in this line:
“self.cfx = cuda.Device(0).make_context()”
(this line is in TRTInference class not multi-thread)
If yes, how can I config that? I used “dla:0” in cuda.Device and got an error about the argument being wrong.
On the other hand, I think that since I built and save the engine to run on DLA, I don’t need to make other settings when I execute inference on DLA.
I am completely confused. :(

I have another question about multi-threading code, when I use thread1 and thread2, after calling start and join, will both threads be executed at the same time?

AastaLLL · September 26, 2023, 7:37am

Hi,

You can create a DLA engine with trtexec command and run it with Python.
In general, the following configure need to be set:

config.dla_core = 0
config.set_flag(trt.BuilderFlag.GPU_FALLBACK)

Below is our DLA tutorial for your reference:

Thanks.

system · October 10, 2023, 7:37am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Run a part of DNN on DLA and part of DNN on GPU Jetson AGX Xavier dla	7	1227	February 14, 2023
How to run two inferences on different DLAs？ TensorRT	2	609	October 27, 2020
General Question about jetson Xavier NX Jetson Xavier NX dla	15	1592	October 18, 2021
How does the TRT inference run on both DLA and GPUs? Jetson Orin NX tensorrt , dla	2	853	August 30, 2023
How to Execute both Deep Learning Accelerator(DLA) and GPU at the same time in python Jetson Nano tensorrt , jetson-inference , python , dla	3	803	April 26, 2023
Multiple models on DLAs in AGX Xavier 32TOPs Jetson AGX Xavier	13	1369	October 18, 2021
Tensorrt Python API has a bug in DLA usage Jetson AGX Xavier tensorrt	11	658	August 17, 2022
Jetson Orin: Running DLA and GPU cores at the same time Jetson AGX Orin dla	4	961	October 19, 2022
DLA1 is offline on my Jetson Orin Jetson Orin NX dla	4	962	June 13, 2023
How to use GPU+2 * DLA inference model TensorRT	0	221	February 4, 2024

Deploy three AI model engines on both DLAs and GPU

Related topics