General Question about jetson Xavier NX

LoveNvidia · July 11, 2020, 1:45pm

Hello.

1- In the jetson xavier nx , we have three accelerator (GPU/DLA1/DLA2) for using deep learning model, I want to know we can run three separate model on each accelerator simultaneously?
2- For running the AI models on DLA, we need to change the codes that we use for using GPU for jetson nano, GPU desktop? using of DLA for running deep model need difference codes? There two accelerator can be run any deep learning framework ? even TensorRT?
3- These two DLA only support INT8 ? or can support FP16? What about GPU? Can be support INT8 or only support FP16?
4- For running the inference codes how I can set the DLA1 to run model1 and DLA2 to run model2 and GPU to run model3?
5- The general structural of working with this device is similar with jetson nano or have large difference?

fpsychosis · July 11, 2020, 2:43pm

until you receive a more qualified answer, maybe the answers of this post I made bring you some light in some of your questions .

I’m quite sure GPU supports FP16 and INT8.
I’m quite sure DLAs only can run TensorTR models INT8 optimized
ISP,DLAs,7way VPU,x2 PCIe, make it quite different than jetson Nano even for camera support, I think you should look for AGX Xavier to get usable things.

LoveNvidia · July 11, 2020, 6:33pm

Thanks a lot,
I want to know how i can assign one model to DLA1 and model 2 to DLA2?
In the multi-GPU Desktop, we can assign the specific model to GPU0 with the below command : or we also can set with os package.

export CUDA_VISIBLE_DEVICES=0

For GPU=1:

export CUDA_VISIBLE_DEVICES=1

we also can be use this:

os.environ[“CUDA_VISIBLE_DEVICES”]=“0” # for GPU0

For DLAs and GPU of Jetson xavier Nx How I can specific like the above?

AastaLLL · July 13, 2020, 2:57am

Hi,

1. Suppose yes.
But please noticed that DLA is hardware-based inference engine which limited the support operation range.
It’s recommended to check if your model can be fully deployed on the DLA first.

2. DLA can be enabled directly with TensorRT API:

nvinfer1::IBuilder* builder = nvinfer1::createInferBuilder(logger);
builder->setFp16Mode(true);
builder->setDLACore(0);        // or builder->setDLACore(1) for DLA1
...
nvinfer1::IRuntime* infer = nvinfer1::createInferRuntime(gLogger);
infer->setDLACore(0);          // or infer->setDLACore(1) for DLA1

3. FP16 and INT8.
4. Please check answer no.2.
5. Similar

Thanks.

AastaLLL · July 13, 2020, 3:01am

Hi, LoveNvidia

Please noticed that DLA is a hardware process rather than GPU.
So the export command won’t assign the inference job to DLA but GPU.

Currently, DLA must be triggered from the TensorRT API.
Detail can be found in our document here:

Thanks.

fpsychosis · July 13, 2020, 8:00am

Is there python bindings of TensorRT api to can assign DLA by a python?

LoveNvidia · July 13, 2020, 8:00am

Thanks a lots, @AastaLLL

LoveNvidia · July 25, 2020, 1:43pm

Hi @AastaLLL
Is it possible to set DLACore to 0/1 with tensorflow-tensorrt integration api?
which version of tensorRT needed at least for DLA?

LoveNvidia · August 2, 2020, 1:34pm

Hi @AastaLLL
For using Tensor cores of jetson xaiver nx, How can I use these cores? for DLA I have to use only TensorRT, For tencor cores also need onlt TensorRT? Is it possible to run the models with these codes directly like GPU?

AastaLLL · August 12, 2020, 6:34am

Hi,

1.
You will need to use JetPack4.4 to get the XaveirNX support.
So please use TensorRT > 7.1.x to get the DLA support on NX.

2.
TF-TRT doesn’t add DLA support.
For this, you can add the DLA support here and rebuild the TensorFlow package:

builder->allowGPUFallback(true);
builder->setDLACore(0);

github.com

tensorflow/tensorflow/blob/master/tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc#L1405


      
                "TRT:", absl::StrJoin(GetLoadedTensorRTVersion(), "."), "-",
                "Precision:", precision_mode_str, ", ", "Calibration:", use_calibration_,
                ", ", "Max-Batch-Size:", max_batch_size, ", ",
                "Max-Workspace-Size:", max_workspace_size_bytes);
          
          #if IS_TRT_VERSION_GE(8, 0, 0, 0)
            trt_network_name = StrCat(trt_network_name, ", Sparse Compute: ");
          
            switch (sparse_compute_mode) {
              case SparseComputeMode::SIMULATED:
                trt_network_name = StrCat(trt_network_name, "Simulated");
                break;
              case SparseComputeMode::ENABLED:
                trt_network_name = StrCat(trt_network_name, "Enabled");
                break;
              case SparseComputeMode::DISABLED:
                trt_network_name = StrCat(trt_network_name, "Disabled");
                break;
            }
          #endif

3.
Tensor Core is part of GPU cores so you can access it directly with CUDA.

/usr/local/cuda-10.2/samples/0_Simple/cudaTensorCoreGemm
/usr/local/cuda-10.2/samples/0_Simple/immaTensorCoreGemm

Thanks.

LoveNvidia · August 12, 2020, 8:18am

@AastaLLL, Thanks so much,
1- How I can to generate Tensorrt engine with trtexec file for TLT models? is it possible? How I can do it?
2- with trtexec is it possible to set converted engine to be run give DLA?

AastaLLL · August 13, 2020, 4:11am

Hi,

1. You will need to use Deepstream for the TLT format support:

2. YES. Please add --useDLACore=0 or --useDLACore=1 when executing.

Thanks.

LoveNvidia · August 13, 2020, 9:41am

Thanks @AastaLLL,
Is also need to set allowGPUFallback=1? What this mean? allowGPUFallback, what’s happened?

AastaLLL · August 14, 2020, 2:53am

Hi,

If your model cannot be fully supported via DLA, this configure will replace the non-supported layer back to GPU implementation.

Thanks.

LoveNvidia · October 4, 2020, 5:49pm

@AastaLLL,
NVVM Memory only related to jetson device?

Topic		Replies	Views
Python API for converting tensorflow models to DLA support Jetson Xavier NX dla	4	712	October 18, 2021
DLA / GPU question Jetson AGX Xavier dla	6	922	October 18, 2021
Deploy three AI model engines on both DLAs and GPU Jetson AGX Xavier tensorrt , jetson-inference , dla , gpu	4	614	September 26, 2023
Multiple models on DLAs in AGX Xavier 32TOPs Jetson AGX Xavier	13	1337	October 18, 2021
Some question about using dual DLA of jetson xavier nx DeepStream SDK	8	2206	October 12, 2021
Accessing Jetson's DLA from python TensorRT tensorrt , jetson-inference , python	3	2176	December 1, 2020
Performance without DLA Jetson Xavier NX dla	7	1206	October 18, 2021
Cannot run model exported from TLT on Jetson's DLA TAO Toolkit tensorrt	7	444	October 12, 2021
DLA and GPU cores at the same time Jetson AGX Xavier dla	20	10133	October 18, 2021
Quick Start with NVDLA on Jetson Orin NX Jetson Orin NX dla	4	1737	July 25, 2023

General Question about jetson Xavier NX

Related topics