How to deploy super resolution DNNs on NVIDIA Jetson AGX Orin 32GB?

uraxurax · March 27, 2023, 11:57am

I want to deploy super resolution DNNs on NVIDIA Jetson AGX Orin 32GB.
The super-resolution codes are written in pytorch, and there are two data types (float32 and int16).

So I want to ask following 5 questions.

Q1) If I port these codes without quantization, will they work on cuda core?

Q2) What should I do to make these code work on tensor core?

Q3) What should I do to make these code work on DLA?

Q4) If I convert my FP32 code to TF32, how much will the performance improve on the Jetson?

Q5) If I convert my INT16 code to FP16, how much will the performance improve on the Jetson?

Regards,
urax

AastaLLL · March 28, 2023, 3:10am

Hi,

1. If you infer the model with GPU, it will use cuda core.

2. TensorRT will use tensor core when the layer can run on tensor core.

3. Use TensorRT or cuDLA API.

4.5 Please check the data format supported by TensorRT below. For example, INT16 is not supported.
It’s expected that the performance will increase when the quantization is applied.
But the ratio is model/layer dependent, please test it directly to get an idea.

Thanks.

ramc · April 26, 2023, 2:46pm

@uraxurax
Check out the DLA github page for samples and resources: Recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.

We have a FAQ page that addresses some common questions that we see developers run into: Deep-Learning-Accelerator-SW/FAQ

Topic		Replies	Views
The Throughput is too slow in Nvidia jetson AGX ORin DLA Jetson AGX Orin cuda , cudnn , dla	3	640	January 31, 2024
Is there a plan to support DLA on the next TensorRT version? Jetson AGX Orin tensorrt , nvbugs , dla , tensorrt-model-optimizer	4	413	December 4, 2024
TensorRT gives different results Jetson Orin Nano tensorrt	1	224	December 17, 2024
Jetson Orin: All layers pushed to GPU, zero layers on DLA Jetson AGX Orin tensorrt , dla	6	1170	April 26, 2023
Clarification about dynamic layers support on DLA core (Jetson AGX Orin 64 GB) Jetson AGX Orin tensorrt , cuda , dla	7	303	June 24, 2025
INT4 on Jetson-AGX-Orin or Jetson-Orin-Nano? Jetson AGX Orin gpu-computing	2	958	August 20, 2024
DLA-v2 is slower than DLA-v1 Jetson AGX Orin tensorrt , jetson-inference	7	2926	June 16, 2022
Model running on DLA with TensoRT(8.4.0) is slower than TensorRT(8.3.0) Jetson AGX Orin tensorrt , dla	3	816	July 28, 2022
Getting less throughput while enabling DLAs on Jetson AGX Orin Jetson AGX Orin dla	4	894	February 9, 2023
TFLOPS(FP16) about DLA (Deep Learning Accelerator) on Jetson Orin NX Jetson AGX Orin dla , kb	3	2084	April 13, 2023

How to deploy super resolution DNNs on NVIDIA Jetson AGX Orin 32GB?

Related topics