Triton server configuration instance group

h9945394143 · March 22, 2022, 6:48am

can we configure one GPU and CPU instance for an particular model as below ?

instance_group [{
count: 1
gpus: 0
kind: KIND_GPU
},
{
count: 2
kind: KIND_CPU
}]

because i am getting an error

E0322 06:45:55.883454 73 model_repository_manager.cc:1215] failed to load 'Vehicle_model' version 1: Invalid argument: instance group Vehicle_model_0 of model Vehicle_model must be KIND_GPU and must specify at least one GPU id

mchi · March 22, 2022, 1:08pm

Please provide the setup info as below

• Hardware Platform (Jetson / GPU): GPU
• DeepStream Version: 6.0
• TensorRT Version: 8.0.1
• NVIDIA GPU Driver Version (valid for GPU only): 495.29.05
• Issue Type( questions, new requirements, bugs): questions, bugs

h9945394143 · March 22, 2022, 1:22pm

• Hardware Platform (Jetson / GPU): Tesla T4
• DeepStream Version: 6.0
• TensorRT Version: 8.0.1
root@6b55a2214e5a:/opt/nvidia/deepstream/deepstream-6.0/sources/deepstream-apps/client/src/python/examples# nvidia-smi
Tue Mar 22 13:22:01 2022
±----------------------------------------------------------------------------+
| NVIDIA-SMI 510.47.03 Driver Version: 510.47.03 CUDA Version: 11.6 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Tesla T4 On | 00000000:00:1E.0 Off | 0 |
| N/A 41C P0 26W / 70W | 13495MiB / 15360MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+

±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 6150 C 13445MiB |
±----------------------------------------------------------------------------+

mchi · March 30, 2022, 2:31am

DS-triton has the support for triton’s GPU+CPU multi-instance running together on same models. But not all the triton models/backends can run in GPU mode or CPU mode. for example, tensorrt models doesn’t support CPU mode. some of tensorflow models are frozen into GPU mode only. Some backends only support CPU data process, Need to check whether the model has CPU support. Triton’s multi-instance doc: https://github.com/triton-inference-server/server/blob/main/docs/model_configuration.md#instance-groups

system · April 26, 2022, 2:20am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multiple model instance DeepStream SDK	2	318	December 4, 2023
Does Python backend support KIND_GPU in tritonserver Jetson Orin Nano tensorrt , cuda , jetson-inference , python	6	1025	June 29, 2023
Multi GPU set up DeepStream SDK inference-server-triton , deepstream	7	1182	August 23, 2022
Model has kind KIND_GPU but no GPUs are available TensorRT cudnn , inference-server-triton	2	234	September 30, 2024
GPU support with Triton iGPU image and Python Backend Jetson Orin Nano python	9	754	October 14, 2024
Triton-server model load balancing DeepStream SDK inference-server-triton	6	1060	February 8, 2023
Tensor RT server with GPU only instances high CPU usage Triton Inference Server (archived)	4	2680	February 27, 2020
Running single model instance across multiple pipelines DeepStream SDK	35	1479	September 1, 2023
How to use instance-group in tensor RT with C++ TensorRT	1	312	March 30, 2024
No example or sample available for multi GPU DeepStream-Triton DeepStream SDK gstreamer , inference-server-triton	5	636	July 5, 2022

Triton server configuration instance group

Related topics