Hi,
We are running a SLURM environment and we often see that GPU’s are used but not a 100%. For example running Jupyter notebooks. In the past there was MPS that supported multiple users on one GPU. Nowdays this seem only to work for 1 user per GPU. Without MPS it is possible to run multiple user programs but as long as not all memory is used. Containing memory seems then again only possible by the user itself in their program, but in a multiuser setup you want to do an ‘overall’ containment of memory so as an admin you are in control. Any thought on this by anyone?