Direct GPU Inference

solarflarefx · April 21, 2020, 2:59pm

@SunilJB Thanks for the reply.

I found the following post: Questions about efficient memory management for TensorRT on TX2 - #6 by Beerend

Here the OP suggested:

void ObjectDetector::runInference() {
util::Logger log("ObjectDetector::runInference");

trt_context->enqueue(batch_size, &trt_input_gpu, cuda_stream, nullptr);

cudaStreamSynchronize(cuda_stream);
cudaDeviceSynchronize();
}

Would this be the correct implementation? How would the context know how many elements to consider as the input tensor?

Topic		Replies	Views
How can I access the same TensorRT engine model in different thread TensorRT cudnn	1	563	November 27, 2023
Adding multiple inference on TensorRT (Invalid Resource Handle Error) TensorRT	2	1706	December 4, 2019
TensorRT ROS2 Node TensorRT	1	796	November 15, 2023
Falure to do inference TAO Toolkit tensorrt	9	1071	January 11, 2022
[defaultAllocator.cpp::deallocate::35] Error Code 1: Cuda Runtime (invalid argument) TensorRT tensorrt	3	3986	March 31, 2022
TensorRT waiting after inference seemingly for no reason TensorRT tensorrt , cuda , performance , python	12	1559	October 20, 2022
Tensorrt8.5 inference different with origin onnx model TensorRT	6	1090	December 13, 2022
TensorRT ERROR: pointWiseV2Helpers.h::launchPwgenKernel::532 Cuda Driver (invalid resource handle) Jetson Xavier NX tensorrt , cuda , jetson-inference	3	2058	March 24, 2022
TensorRT inference result of one image don't keep the same in high qps TensorRT tensorrt	1	603	June 29, 2022
Cuda Runtime (invalid resource handle) when use TensorRT and Pytorch(on GPU) simultaneously TensorRT	5	2931	December 17, 2024

Direct GPU Inference

Related topics