how to convert data type from cuda.mem_alloc object to pytorch tensor object without copying?

liangjienana · July 31, 2019, 8:06am

I want to speed up the part of faster-rcnn-fpn, which is extractor of feature map. the feature map size is large. and I get the output of tensorrt which is mem_alloc object, but I need pytorch tensor object. I try to convert mem_alloc object to pytorch tensor, but it spend too much time in memcpy from gpu to cpu. how to convert data type from cuda.mem_alloc object to pytorch tensor object without copying?

my code:
binding = [int(d_input), int(d_output[0]), int(d_output[1]), int(d_output[2]), int(d_output[3])]
cuda.memcpy_htod_async(d_input, input_data_tensor.data.cpu().numpy().astype(NPDTYPE), stream)
context.execute(1, binding)
cuda.memcpy_dtoh_async(output1, d_output[0], stream)
cuda.memcpy_dtoh_async(output2, d_output[1], stream)
cuda.memcpy_dtoh_async(output3, d_output[2], stream)
cuda.memcpy_dtoh_async(output4, d_output[3], stream)
stream.synchronize()

ou1 = torch.tensor(output1, device="cuda")
ou2 = torch.tensor(output2, device="cuda")
ou3 = torch.tensor(output3, device="cuda")
ou4 = torch.tensor(output4, device="cuda")

Topic		Replies	Views
How to manipulate a DeviceAllocation object and a Pytorch tensor in GPU memory? TensorRT tensorrt	2	812	July 21, 2022
Fast Cuda python code or TensorRT python code Jetson Nano python	2	716	October 18, 2021
Transfer tensor device without copying it in Jetson device Jetson Orin Nano tensorrt , jetson-inference , pytorch , python , jetson-nano	4	1048	October 9, 2023
Wrong Output from TensorRt Model converted from Onnx TensorRT	1	1126	December 16, 2019
TensorRT python API cpu Jetson AGX Xavier tensorrt	5	471	October 18, 2021
How can I release GPU memory without terminating the execution process TensorRT tensorrt , python	2	1674	June 10, 2022
Copying Data from host to Device and Back CUDA Programming and Performance	5	1220	August 14, 2015
Transfer data from GPU->CPU takes too much time. TensorRT	0	276	May 23, 2019
Transfer data from GPU->CPU takes too much time. TensorRT	0	242	May 23, 2019
Transfer data from GPU->CPU takes too much time. TensorRT	0	250	May 23, 2019

how to convert data type from cuda.mem_alloc object to pytorch tensor object without copying?

Related topics