i hope you are doing good. I am using a small piece of code [which i cannot seem to paste here as then submission of this post wasn’t working] as a testbench to find out if i can use the p2p memory access routines to use 8 GPUs on a single machine to solve a problem.
My code is called by a fortran code where i set the device according to the MPI rank issued to a node. Further the fortran code calls a C code that looks for GPUs in the system and finds out which peers it can access. Then each GPU (uniquely identified by the mpi rank) initiates a transfer to its next neighbor to which peer access is possible.
What i notice is that instead of copying from the peer the GPUs seem to access their own pointers. I guess this is because all GPUs use the same code and hence the same pointer names. Is this true?
If yes then how can i use a code that runs on each GPU ( and has same variable names on each GPU) to access peer memory?
thanks in advance
with kind regards
nvidia_query_p2p.txt (6.29 KB)