i am doing an explration between classic way of transfer data, pinned memory, unified memory, zero-copy memory and UVA memory type of data.
I observe that apart from the time of the data trasnfer, the execution time of the kernels are using the the data i have sent with the various types of transfer are changed.
I cannot imagine why, as i think that global memory is not cached.
Has anyone any idea?
I am using Tegra x1, where cpu and gpu shares a common memory.
Thank you in advance!