for nvshem 3.2.5, when compiling bitcode we found a BUG in nvshmem/src/include/non_abi/device/common/nvshmemi_common_device.cuh, which makes libnvshmem_device.bc can’t handle memcpy with src % 16 == 0 and dst % 16 == 0 and size % 16 != 0.
the BUG is shown as below:
