Shared NVIDIA Tesla K80?

john_hosie · October 22, 2018, 9:40pm

I have a single Tesla K80 (not sure what else you need to know) running on a VM provided by AWS. The planned application will include both Gromacs and NAMD. Individual invocations of both programs could be run at any time. The big question has been whether the Tesla K80 GPU can be shared between applications (concurrent execution). Can it? My impression is that there will be limits - possibly even that only one invocation would be permitted at a time. But I need verification.

Also, how would you go about restricting the usage so that whether it is shareable or not, users could not overload it by running too many jobs at one time. My initial thought is that jobs should be submitted through a queuing system or resource manager (SGE, UGE, Torque, Slurm, MOAB, etc) to prevent users from stepping on each others’ applications. But, again, I would like verification.

It seems that no matter what, some facility would have to manage the GPU resources to avoid applications from stepping on each other. But this is territory I haven’t hit with currently available technology, so I’d like to know what you all know - especially if you have a reference to tell me categorically whether it can be done, and if so, how.

Thank you in advance.

Topic		Replies	Views
Multi-user-systems und multi-gpu-usage CUDA Programming and Performance	9	6386	July 15, 2008
manage jobs in multi-gpu system with compute exclusive mode or not CUDA Programming and Performance	14	4329	September 3, 2010
Gpu and multiple processes CUDA Programming and Performance	6	1794	September 16, 2010
Multiple threads using single Tesla CUDA Programming and Performance	3	3834	March 27, 2009
Sharing a GPU server for CUDA programming in a multi-user operating system CUDA Programming and Performance	4	18564	January 3, 2019
What happening when two users are sending a job to V100 and A100 GPU? CUDA Programming and Performance	1	364	May 10, 2023
How to limit number of cores in GPU to be used for processing CUDA Setup and Installation	2	2868	July 28, 2014
(GP)GPU batched jobs CUDA Setup and Installation	1	535	April 28, 2016
Is Tesla K40 clashing with K20 in a multi-GPU system? Teaching & Curriculum Support	0	1263	June 10, 2014
Run LLM in K80 CUDA Programming and Performance	3	8471	July 21, 2023

Shared NVIDIA Tesla K80?

Related topics