Run Inference on multiple models concurrently using TensorRT

cparks27 · January 30, 2020, 9:15pm

I am trying to run inference on two different models concurrently using TensorRT.

I have serialized both models as .engine files using ofstream, as several forum posts have shown, then attempted to deserialize them and run inference on them concurrently. I am running into two errors when attempting to do this:

ERROR: Cannot deserialize plugin FancyActivation
ERROR: getPluginCreator could not find plugin FancyActivation version 001 namespace

concurrencyTest: cuda/caskConvolutionLayer.cpp:153: virtual void nvinfer1::rt::task::caskConvolutionLayer::allocateResources(const nvinfer1::rt::CommonContext&): Assertion `configIsValid(context)’ failed.

How would you advise running inference on two models concurrently?

I am running this on an NVIDIA Jetson Xavier with TensorRT version 5.0.6. The code is written using the C++ TensorRT API.

Topic		Replies	Views
Concurrently run two or more engine on a tensorrt TensorRT	4	2450	April 14, 2020
Unable to run two TensorRT models in a cascade manner TensorRT tensorrt , python	7	5091	October 12, 2021
Multiple different engine inference simoultaneously with TensorRt c++ TensorRT cudnn	0	39	October 15, 2024
How can I infer sequentially using two independent TensorRT models? Jetson Nano tensorrt	2	410	March 30, 2022
Is it possible to run multiple TensorRT model inference on a GPU simultaneously and parallelly? TensorRT tensorrt , cuda	3	2170	August 23, 2022
TensorRT Concurrent inference in C++ TensorRT cudnn	4	700	February 6, 2024
Unable to do inference of multiple engines in parallel TensorRT tensorrt , nano	3	1827	May 6, 2022
Batch inference parallelization on tensorrt TensorRT tensorrt , cuda	5	1037	May 5, 2021
Not able to inference multiple input models using TRT TensorRT tensorrt , tensorflow , jetson-inference	1	486	August 12, 2021
Run multiple model(engine) with tensorrt without deepstream TensorRT	1	1175	April 20, 2020

Run Inference on multiple models concurrently using TensorRT

Related topics