I have a question regarding a CNN model that is loaded into VRAM.
The model is loaded from a process when it first starts. (the process is a linux application written in c++ compiled with gcc and linked with cuda libraries).
Is it possible to share the model loaded in VRAM among different processes? I want to launch another process that uses the same model and not to load again the model in VRAM (there will be two exact models in VRAM occupying twice the VRAM). I would like to use something like shared memory IPC in Linux.(I create a shared memory segment in a process, which somehow maps the loaded memory in VRAM and from another process I access the shared memory segment created in the previous process and so I have access to the CNN model)