Maximizing GROMACS Throughput with Multiple Simulations per GPU Using MPS and MIG

akshaychenna · July 20, 2022, 4:49am

Hello Dr Alan,

I appreciate your response to my queries.

1)
No, I am not setting the CUDA_VISIBLE_DEVICES environment variable.
(Though I had also tried running mdrun after setting this as detailed in your blog.)

The simulation.sh file solely consists of:

module load apps/gromacs/2021.4/gnu
export OMP_NUM_THREADS=1

mpirun -np 1 gmx_mpi mdrun -v -s md.tpr -o md.trr -x md.xtc -cpo md.cpt -e md.edr -g md.log -c md.gro -ntomp 1 -nstlist 150 -nb gpu -bonded gpu -pme gpu -update gpu

2)
I have tried launching jobs with multiple GPUs and used the CUDA_VISIBLE_DEVICES variable. This had worked as expected without errors. The simulations were running on GPU_ID 0 or 1 based on our CUDA_VISIBLE_DEVICES variable used with gmx mdrun.

Some observations:

No user is able to use the second GPU using -nb gpu -bonded gpu -pme gpu -update gpu when MPS was activated by someone on the first GPU.
GROMACS only uses CPUs when -nb gpu -bonded gpu -pme gpu -update gpu flags are skipped on the second GPU jobs when MPS is already running on the first GPU. Therefore we don’t see the “no GPU is detected” error.

I am attaching the tpr file, in case you would like to test them at your end.
md.tpr (6.1 MB)

Thank you,
Akshay.

Topic		Replies	Views
Maximizing OpenMM Molecular Dynamics Throughput with NVIDIA Multi-Process Service Technical Blog	1	84	June 4, 2025
Creating Faster Molecular Dynamics Simulations with GROMACS 2020 Technical Blog	15	2548	January 10, 2023
Guidance on setting MPS_PIPE_DIRECTORY for multiple jobs in loop CUDA Programming and Performance	0	118	May 8, 2025
Is these processes are computed parallelly using MPS? General	3	755	November 22, 2019
Monte Carlo simulations on GPU CUDA Programming and Performance	6	6305	November 28, 2009
A Guide to CUDA Graphs in GROMACS 2023 Technical Blog	1	783	July 18, 2023
accelerate a single loop with mpi and gpu Legacy PGI Compilers	21	16148	July 19, 2013
Delivering up to 9X the Throughput with NAMD v3 and NVIDIA A100 GPU Technical Blog	0	501	August 25, 2020
MPI running issue using NVIDIA MPS Service on Multi-GPU nodes CUDA Programming and Performance	4	2318	September 16, 2016
GROMACS Molecular Dynamics simulations run increasingly slower as simulation progresses CUDA Programming and Performance cuda , ubuntu	3	550	August 25, 2024

Maximizing GROMACS Throughput with Multiple Simulations per GPU Using MPS and MIG

Related topics