The GPU offers a default compute mode which generally allows this with the caveat that all users are actually sharing the GPU, and if each user attempts to allocate most of the memory, it won’t work. To have non-shared batched exclusive usage, you should use a job scheduler or similar tool.