Deepstream with triton

Lino120 · October 7, 2023, 12:54pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) : NVIDIA GeForce RTX 3090
• DeepStream Version : 6.3
• JetPack Version (valid for Jetson only)
• TensorRT Version : 12.2
• NVIDIA GPU Driver Version (valid for GPU only) : 535.104.05
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)

for deepstream-test.py this pipeline example uses peoplenet object detection with deepstream and triton and this example work successfully but when I tried to check triton server is running using below commands

Command1: ps aux | grep tritonserver
Output

root 1284 0.0 0.0 3304 720 pts/3 S+ 12:43 0:00 grep --color=auto tritonserver

Command2: curl -Is http://localhost:8000/v2/health/live
Output

Empty

Command3: netstat -tuln | grep 8000
Output

Empty

Command3: tritonserver
I1007 12:43:40.197500 1289 metrics.cc:747] Collecting metrics for GPU 0: NVIDIA GeForce RTX 3090
I1007 12:43:40.198636 1289 metrics.cc:640] Collecting CPU metrics
I1007 12:43:40.198792 1289 tritonserver.cc:2364]
±---------------------------------±---------------------------------------------------------------------------------------------------------------------------------+
| Option | Value |
±---------------------------------±---------------------------------------------------------------------------------------------------------------------------------+
| server_id | triton |
| server_version | 2.32.0 |
| server_extensions | classification sequence model_repository model_repository(unload_dependents) schedule_policy model_configuration system_shared_m |
| | emory cuda_shared_memory binary_tensor_data parameters statistics trace logging |
| model_control_mode | MODE_NONE |
| strict_model_config | 0 |
| rate_limit | OFF |
| pinned_memory_pool_byte_size | 268435456 |
| min_supported_compute_capability | 6.0 |
| strict_readiness | 1 |
| exit_timeout | 30 |
| cache_enabled | 0 |
±---------------------------------±---------------------------------------------------------------------------------------------------------------------------------+

I1007 12:43:40.199734 1289 server.cc:281] No server context available. Exiting immediately.
error: creating server: Invalid argument - --model-repository must be specified

does triton server work on different port ?
I see that triton server is not up so how can I make sure that deepstream use triton as backend for inference

Appreciate your feedback

fanzh · October 8, 2023, 1:46am

as the log shown, tritonserver failed to start because you did not pass “model-repository” parameter. the correct command is tritonserver --model-repository=xx(engine path).
please refer to deepstream-test3 README for triton grpc mode.

Lino120 · October 9, 2023, 8:00am

but how deepstream-test3.py worked without triton however this example uses triton as backend

fanzh · October 9, 2023, 8:04am

nvinfer plugin will not use triton to do inference. please refer to doc.

Lino120 · October 9, 2023, 8:06am

I used nvinferserver

fanzh · October 9, 2023, 8:08am

nvinferserver will use triton to do inference. please refer to doc.

Lino120 · October 9, 2023, 8:48am

Yes I know my question I run deepstream-test3.py and it uses triton however triton server is not up?

fanzh · October 9, 2023, 9:00am

nvinferserver supports two triton modes, capi and grpc. if using grpc, you need to start trtionserver.

“–pgie nvinferserver” means using capi mode, you only need to prepare engine for triton. “–pgie nvinferserver-grpc” means using grpc mode. besides engines, you need to start tritonserver.
noticing you are using grpc mode, did you succeed to start the tritonserver?

Lino120 · October 9, 2023, 9:19am

I am using capi mode , so in this mode how can I verify triton is up?

fanzh · October 9, 2023, 9:28am

if using capi mode, you don’t need to start tritonserver. nvinferserver plugin will call triton capi interface to let triton do inference. you can set log_level in model_repo to check if the triton is working.
BTW, nvinferserver plugin is opensource in DS6.3, you can check the code if interested.

Lino120 · October 9, 2023, 9:51am

Could you share link talk about deepstream capi mode to know more details

Thanks for your support

fanzh · October 9, 2023, 10:05am

I’m closing this topic due to there is no update from you for a period, assuming this issue was resolved.
If still need the support, please open a new topic. Thanks

please find “native” in this link. it is also called capi in triton doc.
please refer to some capi samples /opt/nvidia/deepstream/deepstream/samples/configs/deepstream-app-triton/

system · October 24, 2023, 3:19am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deepstream with Triton DeepStream SDK	7	482	October 4, 2023
Deepstream 6.4 with Triton DeepStream SDK	5	374	February 9, 2024
Triton infererence server example 'simple_grpc_infer_client.py' DeepStream SDK	11	4967	March 23, 2022
Deepstream and Triton containers DeepStream SDK deepstream	5	27	September 30, 2024
JetsonNano - Using triton inference server via the DeepStream gstreamer plugin DeepStream SDK tensorrt , jetson-inference , gstreamer , inference-server-triton	3	1434	July 20, 2022
Deepstream 6.4 DeepStream SDK	5	413	January 29, 2024
Add a new model to the pipeline DeepStream SDK	5	357	October 16, 2023
Unexpected deadlock DeepStream SDK inference-server-triton , deepstream	9	1205	January 17, 2023
Deepstream - Use standalone Triton server? DeepStream SDK	10	1330	October 12, 2021
DeepStream Triton Server and Triton Client cannot be used together DeepStream SDK nvbugs , inference-server-triton , deepstream , deepstream61	11	137	October 30, 2024

Deepstream with triton

Command1: ps aux | grep tritonserver Output

Command2: curl -Is http://localhost:8000/v2/health/live Output

Command3: netstat -tuln | grep 8000 Output

Related topics

Command1: ps aux | grep tritonserver
Output

Command2: curl -Is http://localhost:8000/v2/health/live
Output

Command3: netstat -tuln | grep 8000
Output