Zero copy for Tensorflow

eduj · March 1, 2017, 2:26am

Hi, I am currently working on machine learning with tensorflow.

It seems that tensorflow allocates separate memory space for cpu and gpu, and copy data from cpu side to gpu side. Since TX1 has one unified memory space and it is shared by CPU and GPU, tensorflow requests 2x more memory and the copy from CPU memory to GPU memory is useless in TX1 (can be eliminated by zero copy).

Tensorflow does not support zero copy. Is there any way to force all memory copy between cpu and gpu to be zero copy? Or any other framework support zero copy (e.g., caffe, theano, …)?

AastaLLL · March 1, 2017, 7:56am

Hi,

Thanks for your question.

Zero-copy function needs to be specified when calling cudaMalloc, so modification is needed.
If you want to make your tensorflow support zero copy, you can follow this page:
http://arrayfire.com/zero-copy-on-tegra-k1/

More, I think if your framework support GPU input and then it’s possible to use zero copy.
For example:

Prepared shared pointer and create model that uses this pointer as input
Load image data to the shared pointer
Inference from GPU input layer directly

Topic		Replies	Views
Zero copy with tensorflow Jetson AGX Xavier	2	874	October 18, 2021
How can I figure out how much device (GPU) memory the TK1 has? Jetson TK1	4	4518	December 20, 2014
How to disable zero-copy on TX1? Jetson TX1	4	777	October 18, 2021
Performance of zero-copy on jetson TX1 Jetson TX1	9	2170	October 18, 2021
Tensor RT memory copy Jetson TX2	8	2624	October 18, 2021
zero-copy not working on tx1 Jetson TX1	4	973	November 29, 2016
CUDA Zero Copy On TX1 Jetson TX1	20	6921	October 18, 2021
Regarding Usage of Zero Copy on TX1 to improve performance Jetson TX1	1	3253	March 15, 2016
OpenCV Performance TK1 Jetson TK1	18	10618	October 18, 2021
Why zero copy in Jetson TX2 is so slow? Jetson TX2	2	1117	October 18, 2021

Zero copy for Tensorflow

Related topics