DLA for object detection supported with TF-TRT on Xavier?

vasuagrawal · February 12, 2019, 6:44pm

I’m trying to use tf-trt to run inference on object detection networks on the Jetson AGX Xavier Developer Kit. I wanted to understand how different models performed on the Xavier, so I tried to benchmark all of the models from the Object Detection Model Zoo. I downloaded all of the models and converted using the code found here. However, the performance numbers I get do not seem consistent with DLA inference benchmarks I’ve seen (below). The performance numbers are taken by timing the session.run() call to TensorFlow, running on the Xavier in MAX_N mode with all clocks maxed (after jetson_clocks.sh). My hypothesis is that either the conversion step or the inference step is not making use of the DLA chips, and is utilizing the GPU for the entire inference stage. Based on that, I have a few questions:

Is there a way to verify my hypothesis? That is, determine whether tf-trt is executing the model on the DLA or on the GPU?
Assuming the GPU is being used, is there a way to convert models such that the DLA is used as much as possible?
If it is not possible to use the DLA with tf-trt yet, is there an ETA on when support may be enabled?
If it is not possible to use the DLA with tf-trt yet, what is the recommended way of running tensorflow object detection models with the DLA? As this is still early access hardware, with lots of things in flux, I’m not sure what the current best practice is.
Another thing to note is that only some of the models were able to convert with the conversion script (which is just calling trt.create_inference_graph internally). Batch size > 1 only seems supported on SSD topologies. Is this expected?

Object Detection Model Zoo - https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/detection_model_zoo.md
Conversion code - GitHub - NVIDIA-AI-IOT/tf_trt_models: TensorFlow models accelerated with NVIDIA TensorRT
Performance numbers - trt_graphs - Google Sheets
Benchmark 1 - https://developer.nvidia.com/embedded/jetson-agx-xavier-dl-inference-benchmarks
Benchmark 2 - NVIDIA Jetson AGX Xavier Benchmarks - Incredible Performance On The Edge Review - Phoronix

dusty_nv · February 12, 2019, 8:17pm

Hi theholyhades1, DLA is not supported in TF-TRT, you would need to use the UFF workflow and import it into the TensorRT C++ API to run a TensorFlow model on the DLA’s. See here:

Note that currently, the networks that are officially verified on DLA include ResNet-50, GoogleNet, AlexNet, and LeNet. So you will probably need to have GPU fallback enabled so that unsupported layers can be run on GPU instead.

vasuagrawal · February 12, 2019, 8:25pm

Hi Dusty,

Thanks for the clarification. I’ll try using the UFF workflow and post my results. Am I correct in assuming that the TF-TRT is currently using GPU only on the Xavier?

dusty_nv · February 12, 2019, 8:33pm

Yes, that is correct. FYI, here is a GitHub issue about it filed against TensorFlow master: https://github.com/tensorflow/tensorflow/issues/23437

BTW here is a tutorial of using the UFF workflow: https://github.com/NVIDIA-AI-IOT/tf_to_trt_image_classification

Topic		Replies	Views
Python API for converting tensorflow models to DLA support Jetson Xavier NX dla	4	804	October 18, 2021
Is there a TensorRT sample that can run entirely on DLA w/o fallback to GPU? Jetson AGX Xavier	6	994	October 18, 2021
Classification and/or detection models with DLA support TAO Toolkit	10	1196	October 12, 2021
Cannot run model exported from TLT on Jetson's DLA TAO Toolkit tensorrt	7	545	October 12, 2021
Running tensorflow models on DLA Jetson AGX Xavier	9	1778	October 18, 2021
Unable to verify Xavier inference benchmarks Jetson AGX Xavier	17	2464	October 18, 2021
Cannot run model exported from TLT on Jetson's DLA TensorRT	2	371	December 16, 2020
Unable to use DLA with TensorRT Jetson AGX Xavier	11	3493	November 8, 2018
Running tf_to_trt_image_classification for Xavier with NVDLA support Jetson AGX Xavier	2	692	October 18, 2021
Jetson Xavier - error running tlt-converter TAO Toolkit	10	1336	October 12, 2021

DLA for object detection supported with TF-TRT on Xavier?

Related topics