Is TensorRT safe to create engine & context in one thread, and execute in another thread?

dragon_nsc2 · June 5, 2022, 3:43am

Description

I am building a neural-network inference result visualization web using C++ / TensorRT / httplib.

When user opens a project, the server’s reaction is to cudaSetDevice(assigned gpu id), deserialize some tensorrt engine, and create corresponding context, in some randomly generated thread.

When user queries a image’s inference result, the server’s reaction is to get the corresponding’s context’s pointer, do the inference, and send the result back, in some other randomly generated thread. std::mutex and std::lock are used to make use NO CONCURRENT CALLS to context->execute() or context->enqueue() or etc…

The above process works fine in ONE card server.
But I am worried about multiple-gpu environment, or some edge case may cause problems.

Is TensorRT safe to create engine & context in one thread, and execute in another thread?

Environment

TensorRT Version: 8.4
GPU Type: 2080TI
Nvidia Driver Version: 510.47.03
CUDA Version: 11.6
CUDNN Version: 8.3
Operating System + Version: Ubuntu18.04

NVES · June 5, 2022, 4:07am

Hi,

The below links might be useful for you.
https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-803/best-practices/index.html#thread-safety

https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__STREAM.html

For multi-threading/streaming, will suggest you to use Deepstream or TRITON

For more details, we recommend you raise the query in Deepstream forum.

or

raise the query in Triton Inference Server Github instance issues section.

Thanks!

Topic		Replies	Views
Thread safe while use tensorRT TensorRT	1	2652	March 25, 2019
Tensorrt multi gpu with multi threads TensorRT	1	1130	February 18, 2022
Is TensorRT CudaEngine thread safe ? TensorRT	2	1794	December 11, 2018
How to implement TensorRT as an inference server? TensorRT	2	1761	October 24, 2019
How to use TensorRT by the multi-threading package of python Jetson AGX Xavier tensorrt	13	19035	October 18, 2021
Is multi threaded execution possible with tensorRT? TensorRT	3	2286	April 13, 2020
Multiple context and/or multithreading TensorRT	1	1307	March 24, 2022
Concurrent inference in a single IExecutionContext TensorRT	2	1012	February 11, 2020
Threading on TX2 using Tensorrt TensorRT	1	514	May 20, 2020
how to run trt in multithreading？ Jetson TX2	15	8065	October 18, 2021

Is TensorRT safe to create engine & context in one thread, and execute in another thread?

Description

Environment

Related topics