1- In the jetson xavier nx , we have three accelerator (GPU/DLA1/DLA2) for using deep learning model, I want to know we can run three separate model on each accelerator simultaneously? 2- For running the AI models on DLA, we need to change the codes that we use for using GPU for jetson nano, GPU desktop? using of DLA for running deep model need difference codes? There two accelerator can be run any deep learning framework ? even TensorRT? 3- These two DLA only support INT8 ? or can support FP16? What about GPU? Can be support INT8 or only support FP16? 4- For running the inference codes how I can set the DLA1 to run model1 and DLA2 to run model2 and GPU to run model3? 5- The general structural of working with this device is similar with jetson nano or have large difference?
until you receive a more qualified answer, maybe the answers of this post I made bring you some light in some of your questions .
I’m quite sure GPU supports FP16 and INT8.
I’m quite sure DLAs only can run TensorTR models INT8 optimized
ISP,DLAs,7way VPU,x2 PCIe, make it quite different than jetson Nano even for camera support, I think you should look for AGX Xavier to get usable things.
Thanks a lot,
I want to know how i can assign one model to DLA1 and model 2 to DLA2?
In the multi-GPU Desktop, we can assign the specific model to GPU0 with the below command : or we also can set with os package.
export CUDA_VISIBLE_DEVICES=0
For GPU=1:
export CUDA_VISIBLE_DEVICES=1
we also can be use this:
os.environ[“CUDA_VISIBLE_DEVICES”]=“0” # for GPU0
For DLAs and GPU of Jetson xavier Nx How I can specific like the above?
1. Suppose yes.
But please noticed that DLA is hardware-based inference engine which limited the support operation range.
It’s recommended to check if your model can be fully deployed on the DLA first.
2. DLA can be enabled directly with TensorRT API:
nvinfer1::IBuilder* builder = nvinfer1::createInferBuilder(logger);
builder->setFp16Mode(true);
builder->setDLACore(0); // or builder->setDLACore(1) for DLA1
...
nvinfer1::IRuntime* infer = nvinfer1::createInferRuntime(gLogger);
infer->setDLACore(0); // or infer->setDLACore(1) for DLA1
3. FP16 and INT8. 4. Please check answer no.2. 5. Similar
Hi @AastaLLL
For using Tensor cores of jetson xaiver nx, How can I use these cores? for DLA I have to use only TensorRT, For tencor cores also need onlt TensorRT? Is it possible to run the models with these codes directly like GPU?
@AastaLLL, Thanks so much, 1- How I can to generate Tensorrt engine with trtexec file for TLT models? is it possible? How I can do it? 2- with trtexec is it possible to set converted engine to be run give DLA?