Cannot use Stream Ordered Async Memory Allocator with CUDA MPS

That is pretty much the final arbiter.

FWIW I ran a test on a V100, and the cudaDevAttrMemoryPoolsSupported shows as true even under MPS (CUDA 11.4). So my guess is the limitation has something to do with pre-volta MPS. I don’t have any further information.

If you’d like to see a change to CUDA or CUDA docs, you can always file a bug.

1 Like