Invalid Device when using open mpi to run multiple processes on a machine with 8 gpus

1515_group · August 3, 2017, 11:25pm

I am executing my code on an 8 gpu node with MPS on. I am trying to overload the GPUs by running 21 processes through MPI in this fashion:

mpirun -np 21 ./a.out

This run results in the following error:
call to cuDevicePrimaryCtxRetain returned error 101: Invalid device

When I run this on a machine with only a single gpu, no issues occur and it executes (inefficiently) through MPS correctly.

I am certain that it has to do with how I am calling ACC_INIT

  ACC_NUM = ACC_GET_NUM_DEVICES(ACC_DEVICE_NVIDIA)
  GPUNUM  = MOD(MYID,ACC_NUM)
  CALL ACC_SET_DEVICE(GPUNUM,ACC_DEVICE_NVIDIA)
  CALL ACC_INIT(ACC_DEVICE_NVIDIA)
  ACC_DEV = ACC_GET_DEVICE_NUM(ACC_DEVICE_NVIDIA)

Any help would be appreciated.

Robert_Crovella · August 4, 2017, 12:26am

Is this a CUDA programming question? It doesn’t look like it. If you are using PGI OpenACC, you might get more expert help by posting your question on the PGI forum. [url]http://www.pgroup.com/userforum/index.php[/url]

There is also an OpenACC section on this board. [url]https://devtalk.nvidia.com/default/board/56/openacc-toolkit/[/url]

Topic		Replies	Views
Invalid Device when using open mpi to run multiple processes Legacy PGI Compilers	1	2446	August 4, 2017
problem with multi gpu using mpi Legacy PGI Compilers	2	2192	December 2, 2015
Multi-GPU MPI launch failing when UVM enabled Legacy PGI Compilers	5	3802	January 2, 2019
cudaSetDevice failing Legacy PGI Compilers	5	7298	December 11, 2018
call to cuLaunchKernel returned error 400: Invalid handle Legacy PGI Compilers	2	4351	May 13, 2019
Failure when using OpenACC after MPI_Init nvc, nvc++ and nvfortran	7	1601	April 23, 2021
MPI mixing host and gpu devices with PGI accelerator Legacy PGI Compilers	5	3946	December 7, 2011
no devices detected Legacy PGI Compilers	6	8438	July 16, 2013
Invalid context error with OMP & GPU Legacy PGI Compilers	4	5453	October 15, 2010
CUDA & OpenACC interoperability: Device selection Legacy PGI Compilers	1	3898	July 6, 2017

Invalid Device when using open mpi to run multiple processes on a machine with 8 gpus

Related topics