Expose NCCL primitives via explicit CUDA graph API

I don’t know what that means. I already pointed to the explicit graph API. I don’t see anything there pertaining to nccl. I was just sharing that observation.

I thought the most tangible suggestion here was the one made by striker159

I didn’t say that. NVIDIA generally doesn’t announce plans or forward looking statements here, on this forum. I don’t either (at least I try not to. It’s related to maintaining my employment at NVIDIA.)

If you want to see a change in CUDA APIs, my suggestion would be to file a bug.