I am working on a product where we have many processes (also different threads per process) that may want to access GPU resources (memory requests, computation, access to other components like DLA PVA).
I understand that MPS is not available for the Xavier, but I wanted to know if a multi-process access pattern is a supported use case. Is there an MPS surrogate I can use to manage requests from different processes? Do I need to write a GPU arbiter to manage resources and requests for compute? What are the best practices for multi-process development for the Xavier (i.e. async streams for data transfer)?