The problems when I run nvshmem I got the problem: [image] then I found if I run cudaFree(0) before nvshmem_init, then I got problem with nvshmem_barrier_all. calling CUDA runtime API such as cudaMalloc/cudaFuncSetAttribute also makes the problem. move cudaFree(0) right after nvshmem_init helps…

BUG: call cudaFree(0) before nvshmem_init() makes nvshmem_barrier_all() fails

houqi1993 April 16, 2025, 10:24pm 4

I found no such thing “call cudaSetDevice first, call nvshmem_init then” from the doc, did I miss something?

and if cudaSetDevice is neccessary, maybe it should resident int the nvshmem_init implementation, and avoid the user to manually add it?

or some WARNING messages helps a lot. It toke me so long to find the problem.

Topic		Replies	Views
Nvshmem fails to finalize GPU-Accelerated Libraries cuda , nvshmem	4	1014	January 16, 2024
NVSHMEM on multi-node GPUs failed . My gpu is A5000 GPU-Accelerated Libraries nvshmem	5	935	April 1, 2024
NVSHMEM program fails to initialize Other Tools	0	327	November 16, 2020
Raise error when link nvshmem in my application Legacy PGI Compilers cuda , cudnn	13	1337	January 2, 2024
Device Enumeration and cudaSetDevice SDK Examples Failing to Run on Device 0, but run fine on Device CUDA Programming and Performance	5	30646	August 25, 2011
NVSHMEM issues with synchronization GPU-Accelerated Libraries nvshmem	5	731	July 18, 2023
CUDA device not initialized error on all calls, HGX A100, Centos 7 Linux cuda	9	4625	December 6, 2021
3.0.6 libfabric EFA nvshmemi_get_local_mem_handle(): Assertion `*handle != NULL' failed GPU-Accelerated Libraries nvshmem	2	72	July 17, 2024
CUDA ERROR: no CUDA-capable device CUDA Programming and Performance	13	6630	February 4, 2012
Potential NVSHMEM allocated memory performance issue GPU-Accelerated Libraries nvshmem	19	1360	May 10, 2024