Sharing Cuda contexts among Linux processes

I’m not aware of any method to do this. I generally can’t comment on future plans on these forums. The only “sharing” that is possible is via CUDA IPC, which does not facilitate an external process to run cupti calls.

You can make feature requests through the bug reporting mechanism.