How do i query if nvshmemx_mc_ptr is supported? nvshmemx_mc_ptr core dump if NVLS is not used

houqi1993 · April 2, 2025, 7:50am

nvshmem version: 3.2.5

BUG description

nvshmem core dump when calling nvshmemx_mc_ptr()

details

as the document say:

but when i run nshmemx_mc_ptr() the program core dumps when i set NVSHMEM_DISABLE_CUDA_VMM=1 on a H800 machine, when i dump into the gdb core, it’s nvshmemx_mc_ptr:

and the code is as follow:

nvls is initialized from this code:

More

I think nvshmem may provide an API that tells me if NVLS is used in a team. I don’t want to process the logic to check if the hardware support NVLS or if the team support NVLS or if env NVSHMEM_DISABLE_CUDA_VMM is set or not.

arnavg · April 2, 2025, 5:56pm

but when i run nshmemx_mc_ptr() the program core dumps when i set NVSHMEM_DISABLE_CUDA_VMM=1 on a H800 machine, when i dump into the gdb core, it’s nvshmemx_mc_ptr:

Thanks for pointing it out. This is a known defect on 3.2 release when the user explicitly disables VMM (which is necessary to use NVLS feature) - we have internally also identified the same and addressed the problem and fix should be available in our 3.3 release. The workaround for this in 3.2 is to use __device__ void *nvshmemx_mc_ptr(nvshmem_team_t team, const void *ptr) directly on the user kernel directly instead of calling this API on the host and passing the multicast address to the user kernel, if that works for your usecase.

I think nvshmem may provide an API that tells me if NVLS is used in a team. I don’t want to process the logic to check if the hardware support NVLS or if the team support NVLS or if env NVSHMEM_DISABLE_CUDA_VMM is set or not.

The intent of this API to provide the capability that you are asking i.e given a team and symmetric buffer address, provide a equivalent symmetric multicast address. Once this defect is fixed, it should be work reliably for your use cases.

houqi1993 · April 2, 2025, 10:12pm

thank you, by the way, when will 3.3x be released?

Topic		Replies	Views
Potential NVSHMEM allocated memory performance issue GPU-Accelerated Libraries nvshmem	19	1654	May 10, 2024
Raise error when link nvshmem in my application Legacy PGI Compilers cuda , cudnn	13	1721	January 2, 2024
Nvshmem fails to finalize GPU-Accelerated Libraries cuda , nvshmem	4	1253	January 16, 2024
NVSHMEM on multi-node GPUs failed . My gpu is A5000 GPU-Accelerated Libraries nvshmem	5	1166	April 1, 2024
BUG with nvshmem 3.2.5 for bitcode compiling GPU-Accelerated Libraries nvshmem	1	236	March 25, 2025
NVSHMEM program fails to initialize Other Tools	0	364	November 16, 2020
NVSHMEM Compilling GPU-Accelerated Libraries nvshmem	5	813	January 2, 2024
NVSHMEM runtime error GPU-Accelerated Libraries nvshmem	11	2053	August 16, 2022
BUG: call cudaFree(0) before nvshmem_init() makes nvshmem_barrier_all() fails GPU-Accelerated Libraries nvshmem	6	150	April 19, 2025
NVSHMEM Setup CUDA Setup and Installation	4	949	August 1, 2022

How do i query if nvshmemx_mc_ptr is supported? nvshmemx_mc_ptr core dump if NVLS is not used

BUG description

details

More

Related topics