Concurrent Model Execution In Same Client Script

skilic · September 20, 2021, 7:22am

Hello,

I have 4 deployed models on the Triton Inference server and have 10 images.

I need to use the Concurrent Model Execution feature exactly as described. (https://github.com/triton-inference-server/server/blob/main/docs/architecture.md#concurrent-model-execution)

All images will arrive on the same computer with 1 second intervals and I need to send the images to Triton Server from the same client script as they are received.

How should I do this?
Do I have to work with parallel programming?
Or Is there any way to do that?

Thanks for your help

nadeemm · September 29, 2021, 5:29pm

Please re-post your question on: Triton Inference Server · GitHub , the NVIDIA and other teams will be able to help you there.
Sorry for the inconvenience, thanks for your patience.

Topic		Replies	Views
Multiple concurrent Execution Contexts? TensorRT tensorrt	6	1729	February 14, 2022
Triton inference server General Topics and Other SDKs inference-server-triton	0	614	April 25, 2022
Multi-model parallel inferencing TensorRT	1	358	March 31, 2023
Inference on video/audio streams in Triton Triton Inference Server - archived inference-server-triton	1	1766	September 30, 2021
Optimal Trt inference using threads/processes for peoplenet model for Triton Inference Server - archived tensorrt , inference-server-triton , a100	1	1150	July 30, 2021
Concurrent tensorRT engines TensorRT jetson	1	388	December 5, 2022
Triton Inference Server, Model Analyzer Triton Inference Server - archived inference-server-triton	0	351	March 4, 2024
Triton inference server dynamic load TensorRT inference-server-triton	0	93	July 18, 2024
Help with efficient execution of triton ensembles DeepStream SDK inference-server-triton	8	389	March 1, 2024
multi-threaded kernel concurrent execution on a single GPU CUDA Programming and Performance	3	5660	January 14, 2021

Concurrent Model Execution In Same Client Script

Related topics