Multi-gpu multi-model multi-threads

FreedomLiX · March 30, 2023, 3:31am

Q：multi-gpu, multi-thread, multi-model ?
A: Each ICudaEngine object is bound to a specific GPU when it is instantiated, either by the builder or on deserialization. To select the GPU, use cudaSetDevice() before calling the builder or deserializing the engine. Each IExecutionContext is bound to the same GPU as the engine from which it was created. When calling execute() or enqueue(), ensure that the thread is associated with the correct device by calling cudaSetDevice() if necessary.
A:call the enqueue() function of the execution contexts on different streams to allow them to run in parallel;

yingliu · March 30, 2023, 7:25am

Hi @FreedomLiX ,
Seems this is a note/sharing on TensorRT, do you need support from DeepStream perspective?

FreedomLiX · March 30, 2023, 8:23am

yeah !

FreedomLiX · March 30, 2023, 8:24am

是的，怎么实现？

yingliu · March 30, 2023, 8:44am

Can you describe your question for DeepStream?
Examples of using multiple models:

deepstream_test2: multiple serial modelsC/C++ Sample Apps Source Details — DeepStream 6.2 Release documentation (nvidia.com)
[Parallel Inference example in DeepStream - Intelligent Video Analytics / DeepStream SDK - Multiple model in parallel:
NVIDIA Developer Forums](Parallel Inference example in DeepStream - #3)

Binding model to different GPU can be achieved by gpu_id in nvinfer parameters Gst-nvinfer — DeepStream 6.2 Release documentation (nvidia.com).

You can share your requirements/pipeline for further discussion.

FreedomLiX · March 31, 2023, 4:58am

不用deepstream, 如何实现模型并行推理？有没有示例代码？

yingliu · March 31, 2023, 5:07am

Sorry but this forum is focusing on deepstream topics, you may rephrase your question and ask in TensorRT forum.

FreedomLiX · March 31, 2023, 5:10am

谢谢！

system · April 14, 2023, 5:11am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.