How to use nvshmem with torch?

728882065 · December 9, 2023, 2:15pm

I am tring to use nvshmem to create embedding which I need transport in each layer after forward.
I want to use torch.nn.Linear to forward it. It means that I need to trans nvshmem array to libtorch tensor.
I try to use torch::from_blob to load nvshmem array, but it raise errors.
Is there any way to do that?
I know I can use cublas to impl it easily, but I don’t konw how to handle autograd, especially distrubuted.
I think it will better if I can use other framework with nvshmem.

by the way, when i use 'nvshmem_finalize()‘， It raise：

/dvs/p4/build/sw/rel/gpgpu/toolkit/r11.8/main_nvshmem/src/host/init/init.cu:1051: non-zero status: 1 Invalid context pointer passed to nvshmemx_host_finalize.

/dvs/p4/build/sw/rel/gpgpu/toolkit/r11.8/main_nvshmem/src/host/init/init.cu:nvshmemx_host_finalize:1128: aborting due to error in nvshmem_finalize 

/dvs/p4/build/sw/rel/gpgpu/toolkit/r11.8/main_nvshmem/src/host/init/init.cu:1051: non-zero status: 1 Invalid context pointer passed to nvshmemx_host_finalize.

/dvs/p4/build/sw/rel/gpgpu/toolkit/r11.8/main_nvshmem/src/host/init/init.cu:nvshmemx_host_finalize:1128: aborting due to error in nvshmem_finalize

Topic		Replies	Views
[nvshmem4py] nvshmem.core.finalize() does not handle everything GPU-Accelerated Libraries nvshmem	6	185	July 7, 2025
Raise error when link nvshmem in my application Legacy PGI Compilers cuda , cudnn	13	1842	January 2, 2024
Using NVSHMEM in Building Pytorch Operator Legacy PGI Compilers	2	692	April 19, 2021
Nvshmem fails to finalize GPU-Accelerated Libraries cuda , nvshmem	4	1367	January 16, 2024
Using NVSHMEM in Building Pytorch Operator GPU-Accelerated Libraries	7	1343	April 27, 2021
Potential NVSHMEM allocated memory performance issue GPU-Accelerated Libraries nvshmem	19	1806	May 10, 2024
Nvshmem4py nvshmem.core.tensor does not support dtype torch.uint64, which is wired GPU-Accelerated Libraries nvshmem	2	101	July 5, 2025
Using NVSHMEM on a Python Library GPU-Accelerated Libraries nvshmem	1	958	January 29, 2024
Internode nvshmme and ib problem GPU-Accelerated Libraries nvshmem	20	1783	April 24, 2024
NVSHMEM runtime error GPU-Accelerated Libraries nvshmem	11	2135	August 16, 2022

How to use nvshmem with torch?

Related topics