Transfer data from GPU->CPU takes too much time.

wangsc_up · May 23, 2019, 12:35pm

Hi guys,

I met data transferring problem on jetson TX2
When i run inference(data from CPU to GPU ,inference ,data from GPU to CPU) on jetson TX2 based on my deep learning network(onnx format) , i found that transferring data from GPU to CPU takes a lot of time. It took up about 80% of the time.

The size of data needed to transfer is 1x17x80x64. TensorRT version : 5.1; Linux version : ubantu 18.04. Copy function i using is cudaMemcpyAsync();

Maybe i can optimize this processing by following ways, but there still are some issues wanted to solve:

I can use pinned memory to improve memory copy times, is it an efficient way and how i can implement?
In fact ,i will process those data(1x17x80x64) to 1x2x17 by function which is implemented by “C++” after transfer data to CPU, i
might implement this function by cuda in order to run on GPU ,then just transfer small size data(1x2x17). Is it a efficient
way to optimize? Can you provide some links to help to implement my function in cuda?

I would appreciate it if you have any advices and help！

Topic		Replies	Views
Transfer data from GPU->CPU takes too much time. TensorRT	0	273	May 23, 2019
Transfer data from GPU->CPU takes too much time. TensorRT	0	341	May 23, 2019
Transfer data from GPU->CPU takes too much time. TensorRT	0	271	May 23, 2019
Transfer data from GPU->CPU takes too much time. TensorRT	0	579	May 23, 2019
transfer data from GPU->CPU takes too much time. TensorRT	0	319	May 23, 2019
Transfer data from GPU to CPU takes too much times on TX2 TensorRT	1	1320	August 9, 2019
Transfer data from GPU to CPU takes too much times on TX2 Jetson TX2	5	1032	October 18, 2021
TensorRT copy data cost a lot of time TensorRT	1	694	April 8, 2020
data transfer cost a lot of time Jetson TX2	2	790	October 18, 2021
GPU data speed Jetson TX2 cuda	8	1085	October 18, 2021

Transfer data from GPU->CPU takes too much time.

Related topics