Installing Triton Server on Lenovo SE70 with Xavier NX

AastaLLL · March 27, 2024, 5:24am

Hi,

Thanks a lot for your patience.

It turns out that the nvcr.io/nvidia/tritonserver container does work well on JetPack 5.
Please see below for the testing.

Server: tritonserver:24.02-py3-igpu

$ git clone -b r24.02 https://github.com/triton-inference-server/server.git
$ cd server/docs/examples/
$ ./fetch_models.sh 
$ sudo docker run -it --rm --runtime nvidia --network host -v ${PWD}/model_repository:/models nvcr.io/nvidia/tritonserver:24.02-py3-igpu tritonserver --model-repository=/models

You should see the backend and model logs like below:

...
I0327 04:32:46.516401 1 server.cc:634] 
+-------------+-----------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------+
| Backend     | Path                                                            | Config                                                                                                                          |
+-------------+-----------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------+
| tensorflow  | /opt/tritonserver/backends/tensorflow/libtriton_tensorflow.so   | {"cmdline":{"auto-complete-config":"true","backend-directory":"/opt/tritonserver/backends","min-compute-capability":"5.300000", |
|             |                                                                 | "default-max-batch-size":"4"}}                                                                                                  |
| onnxruntime | /opt/tritonserver/backends/onnxruntime/libtriton_onnxruntime.so | {"cmdline":{"auto-complete-config":"true","backend-directory":"/opt/tritonserver/backends","min-compute-capability":"5.300000", |
|             |                                                                 | "default-max-batch-size":"4"}}                                                                                                  |
+-------------+-----------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------+

I0327 04:32:46.516911 1 server.cc:677] 
+----------------------+---------+--------+
| Model                | Version | Status |
+----------------------+---------+--------+
| densenet_onnx        | 1       | READY  |
| inception_graphdef   | 1       | READY  |
| simple               | 1       | READY  |
| simple_dyna_sequence | 1       | READY  |
| simple_identity      | 1       | READY  |
| simple_int8          | 1       | READY  |
| simple_sequence      | 1       | READY  |
| simple_string        | 1       | READY  |
+----------------------+---------+--------+
...

Client: tritonserver:24.02-py3-igpu-sdk

You should be able to see the detection output by sending a query like below.
We test this on another XavierNX but it should be okay to run on the same device.

$ sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/tritonserver:24.02-py3-igpu-sdk
# /workspace/install/bin/image_client -u [IP]:8000 -m densenet_onnx -c 3 -s INCEPTION /workspace/images/mug.jpg
Request 0, batch size 1
Image '/workspace/images/mug.jpg':
    15.349564 (504) = COFFEE MUG
    13.227465 (968) = CUP
    10.424894 (505) = COFFEEPOT

Thanks.

Topic		Replies	Views
JetPack 4.6 Production Release with L4T 32.6.1 Jetson Nano	47	11990	March 10, 2022
Triton container is based on Ubuntu 20.04, all the others are based on Ubuntu 18.04 DeepStream SDK	7	1033	December 7, 2021
DeepStream 6.0.1 Triton GRPC memory leak DeepStream SDK nvbugs	23	2746	September 2, 2022
Triton Server Crashing Running Centerpoint Keypoint (hourglass_512x512_kpts) on Jetson via Dockerized Triton Jetson TX2 jetson-inference , docker , inference-server-triton	6	1168	February 9, 2022
Run Triton kernels on Jetson AGX Orin Jetson AGX Orin inference-server-triton	14	3330	June 14, 2023
GRPC Data Corruption/Issue with Yolo Object Detection with Triton on Jetson DeepStream SDK	20	659	June 25, 2024
Triton Inference Server not supporting PyTorch v1.6? DeepStream SDK pytorch , inference-server-triton	13	2250	October 12, 2021
Custom Detection parser error with nvinferserver and custom python model with > 1 streams DeepStream SDK inference-server-triton , gpu , deepstream	18	1088	September 4, 2023
Jetson running Docker with DeepStream 6.0 and Triton Server DeepStream SDK docker , inference-server-triton , jetson	6	2041	November 19, 2021
Regarding when we execute triton server on jetson orin getting an error unable to load model DeepStream SDK cuda	19	719	July 30, 2024

Installing Triton Server on Lenovo SE70 with Xavier NX

Server: tritonserver:24.02-py3-igpu

Client: tritonserver:24.02-py3-igpu-sdk

Related topics