I’m trying to start a work using mpi + cuda, but I’m having some problems at the execution.
I found other thread in this forum related to the problem, but on it’s case, two source codes were used. In this case the source code is unique, só I can’t use nvcc to compile only the kernel and mpicc to compile the rest.
The command I’m using to compile:
nvcc -o mpicuda template.cu -I /usr/lib/openmpi/include -L /usr/lib/openmpi/lib -lmpi
The command I’m using to run:
mpirun -l -np 1 ./mpicuda
I receive the following error:
[ubuntu:02598] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../../../../orte/mca/ess/singleton/ess_singleton_module.c at line 269
[ubuntu:02598] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../../../../orte/mca/ess/singleton/ess_singleton_module.c at line 143
[ubuntu:02598] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../orte/runtime/orte_init.c at line 132
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
orte_ess_set_name failed
--> Returned value A system-required executable either could not be found or was not executable by this user (-127) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
--------------------------------------------------------------------------
It looks like MPI_INIT failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or environment
problems. This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):
ompi_mpi_init: orte_init failed
--> Returned "A system-required executable either could not be found or was not executable by this user" (-127) instead of "Success" (0)
--------------------------------------------------------------------------
*** An error occurred in MPI_Init
*** before MPI was initialized
*** MPI_ERRORS_ARE_FATAL (your MPI job will now abort)
[ubuntu:2598] Abort before MPI_INIT completed successfully; not able to guarantee that all other processes were killed!
Problably it is a trivial error at the compilation commands.
I would appreciate if someone could help me.
And sorry about the bad english.
PS. This thread is copy of one that I started at a wrong place