I’m a graduating student involved in a performance analysis of a scientific application that I’m porting to OpenACC (I’m using PGI 14.1).
In some tests, I noticed that (thanks to NVPP) maximum chunk size of data transferred to device is 16 MB.
Is this an arch-dependent value? The device is a Kepler20s
Thanks in advance.