Hi, I am currently working on machine learning with tensorflow.
It seems that tensorflow allocates separate memory space for cpu and gpu, and copy data from cpu side to gpu side. Since TX1 has one unified memory space and it is shared by CPU and GPU, tensorflow requests 2x more memory and the copy from CPU memory to GPU memory is useless in TX1 (can be eliminated by zero copy).
Tensorflow does not support zero copy. Is there any way to force all memory copy between cpu and gpu to be zero copy? Or any other framework support zero copy (e.g., caffe, theano, …)?