Is it possible to run a Triton inference server with DALI backend on AGX Xavier running JP4.6

alexander.soklev · January 11, 2023, 4:59pm

I have a Jetson AGX Xavier with JP4.6 that I cannot upgrade further as I have incompatible with JP5 camera drivers. I would like to run Triton on that Jetson as a local server and I want to use DALI+TRT ensambles.

I am however facing the following problem. DALI has become available by default in Triton since version 20.11 which is more than 2 years ago, however according to the official Triton release notes, it requires CUDA 11.1

And JetPack 4.6 comes with CUDA 10.2.

I tried to run a docker image on the Jetson of a more up-to-date triton container, 22.10, and added a simple Dali model to serve as a test and I get the following error:

Traceback (most recent call last):
  File "<string>", line 5, in <module>
  File "<frozen importlib._bootstrap>", line 553, in module_from_spec
AttributeError: 'NoneType' object has no attribute 'loader'
Traceback (most recent call last):
  File "<string>", line 8, in <module>
  File "/opt/tritonserver/backends/dali/conda/envs/dalienv/lib/python3.8/site-packages/nvidia/dali/_utils/autoserialize.py", line 70, in invoke_autoserialize
    dali_pipeline().serialize(filename=filename)
  File "/opt/tritonserver/backends/dali/conda/envs/dalienv/lib/python3.8/site-packages/nvidia/dali/pipeline.py", line 1166, in serialize
    self._init_pipeline_backend()
  File "/opt/tritonserver/backends/dali/conda/envs/dalienv/lib/python3.8/site-packages/nvidia/dali/pipeline.py", line 693, in _init_pipeline_backend
    self._pipe = b.Pipeline(self._max_batch_size,
RuntimeError: CUDA runtime API error cudaErrorInsufficientDriver (35):
CUDA driver version is insufficient for CUDA runtime version

So my question is, has anyone done this before? Can I run DALI backed triton server on JP4.6? I’d love to hear your feedback before I waste a dozen hours over something that could be bound to fail.

dusty_nv · January 11, 2023, 11:42pm

Hi @alexander.soklev, that container isn’t built for Jetson/JetPack, so it won’t work as intended. Instead you could try one of the deepstream-l4t:*-triton containers that’s compatible with the version of JetPack-L4T that you are running.

I’m uncertain if those containers were built with the DALI backend however, and I’m not personally experienced with building it, but I was able to find this about building it for aarch64:

github.com/NVIDIA/DALI

Nvidia DALI On Jetson Devices with Triton Inference Server

opened 06:44AM - 19 Aug 21 UTC

closed 04:42PM - 30 Aug 21 UTC

sarperkilic

question

Hello everyone, I have Jetson Nano and Xavier. I couldn't find if I can us…e DALI on Jetson device or not. If yes, could you provide me how can I install and use, please. My purpose is that using DALI in Triton inference server pipeline. I installed Triton Inference DALI backend, but when I run the example DALI script, I got an error that 'Import error: No module named nvidia.dali' for import nvidia.dali as dali line. Thanks

system · February 1, 2023, 4:39am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to run latest Supported Triton Inference Server 23.07 on docker Jetson AGX Xavier docker	4	179	October 23, 2024
Latest Supported Triton Inference Server for Jetson Xavier NX (JetPack 5.1.3) DeepStream SDK	6	748	April 22, 2024
Running a TF-TensorRT model as a inference server on the Xavier Jetson AGX Xavier tensorrt	8	888	October 18, 2021
Correct way to use triton in jp 5.1.2? Jetson AGX Orin inference-server-triton	2	253	May 24, 2024
Triton on Jetson Orin Jetson AGX Orin	12	1808	January 4, 2024
Installing Triton Server on Lenovo SE70 with Xavier NX Jetson Xavier NX inference-server-triton	20	980	April 22, 2024
Jetson AGX Xavier cannot start a basic docker Jetson AGX Xavier docker	8	1292	June 23, 2021
Jetson Xavier hangs on the bootup Jetson AGX Xavier	4	1403	October 18, 2021
Upgrading TLT exported models to work with TensorRT 7.1.2 TAO Toolkit	23	1750	October 12, 2021
Decode_rotate.cu missing file - retinanet-examples installation Jetson TX2 cuda	12	937	October 18, 2021

Is it possible to run a Triton inference server with DALI backend on AGX Xavier running JP4.6

Related topics