Why there is a memory transfer between two GPUs

Unless connected by NVLink, the 2080 are not p2p compatible. Device 0 cannot read Device 1 memory.

Nvidia gimped the 2000 series to disallow p2p over pcie. I think this may be the first time I’ve seen a low-level cuda code actually demonstrate this shortcoming

See: